Commit Graph

19 Commits (8645591d664f9e059113900281a715f8f83ae93c)

Author SHA1 Message Date
mapingshuo 7d4002e06a
restrict block num of layer_norm_grad cuda block to 128 (#23878)
5 years ago
Pei Yang 0a51098a71
Add TRT support for BERT (#21135)
5 years ago
Yihua Xu 69dd5152cf Fix the crash issue when scale or bias was null-pointer. (#21284)
5 years ago
danleifeng 0e7baabe59 extend elementwise broadcast function (#20957)
5 years ago
sneaxiy 023a3a3d62 fix op grad maker
6 years ago
tensor-tang 14a764c930 simplify the jitkernel templates and tests
6 years ago
tensor-tang 802f362ac4 unify the kernelfuncs cache and add unit test
6 years ago
tensor-tang 1aaec571c2 fix enum style
6 years ago
tensor-tang 6648995f53 fix build
6 years ago
tensor-tang 720b55cbcf enable crf decoding and layer norm refer code
6 years ago
Yihua Xu f4c869d872 Optimize the layer_norm operator with AVX intrinsic function (#14417)
6 years ago
Wu Yi a2d9b34417
Refine operator cmake (#14413)
6 years ago
Yu Yang ef6ea790dc Clean and extract blas
7 years ago
Xin Pan 1a4be55a47 Pass cpu build
7 years ago
Xin Pan 904fa05f46 Improve layer_norm speed
7 years ago
Yi Wang cfffb1a362
Update tensor_util.h (#8422)
7 years ago
qingqing01 24509f4af9 Fix the grammar in copyright. (#8403)
7 years ago
Yi Wang fc374821dd Correct #include path
7 years ago
Yi Wang 90648f336d Move file to fluid/; Edit CMakeLists.txt
7 years ago