Commit Graph

1341 Commits (fef3654b4e76f5e2cc9a5f71c1c047cef82192e5)

Author SHA1 Message Date
jakpiase f8da5536ed
REUPLOAD Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30719)
4 years ago
Tao Luo 824a79d383
Revert "Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30661)" (#30708)
4 years ago
jakpiase d834f4e6e8
Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30661)
4 years ago
Qi Li 846ce40604
[ROCM] update eigen cmake and patch, test=develop (#30602)
4 years ago
石晓伟 39fac847cd
delete the lite meta info because of ccache, test=develop (#30644)
4 years ago
Qi Li 1f5841c2a0
[ROCM] update cmake and dockerfile, test=develop (#30598)
4 years ago
石晓伟 33bf6eb753
revert external gflags, test=develop (#30623)
4 years ago
QingshuChen d849ecc0ae
update kunlun dependence for aarch64 & sunway platform (#30516)
4 years ago
hutuxian 40ede12631
Ascend Framework Part1: OP & Wrapper (#30281)
4 years ago
wanghuancoder bd97192274
if pybind.cc changed, generate total report, test=develop (#30514)
4 years ago
石晓伟 715d862868
export global google flags to users, test=develop (#30448)
4 years ago
taixiurong 6a3c8725b0
support transformer v2.0 (#30381)
4 years ago
Zhou Wei c94a4b9468
Separate AVX and NO_AVX compilation, enhance installation error message (#30413)
4 years ago
houj04 dc12b5eedf
resolve #30141 (#30145)
4 years ago
Wojciech Uss fc42faffc2
Wojtuss/upgrade one dnn 2.0 (#30295)
4 years ago
tangwei12 25f80fd304
Fix/distributed proto (#29981)
4 years ago
wuhuanzhou 1eeba9802f
fix the problem of Unity Build with incremental compilation, test=develop (#30232)
4 years ago
tianshuo78520a 7564d43bbc
down openssl (#29958)
4 years ago
QingshuChen 8e1c3ddf15
add aarch64 and sunway kunlun lib (#30027)
4 years ago
Wilber 66e16b7e99
update lite subgraph. (#30056)
4 years ago
石晓伟 181ea1870b
flush denormals to zero, test=develop (#29924)
4 years ago
Wilber 332da133a1
Support mips arch (#29903)
4 years ago
liuyuhui 4427df37cf
[Kunlun] PR2: Support MultiDevicePass and BKCL in parallel executor (#29574)
4 years ago
wanghuancoder 26f9ab70f7
if PR have no .py files, do not use 'python coverage run', to speedup unit test (#29739)
4 years ago
tangwei12 032414ca2a
[Feature] one ps (3/4) (#29604)
4 years ago
Wojciech Uss 6ef8129dcc
upgrade oneDNN with GRU INT8 optimizations (#28420)
4 years ago
TTerror af8ded773a
update activation op on kunlun (#29577)
4 years ago
Y_Xuan 76738504ad
添加rocm平台支持代码 (#29342)
4 years ago
YUNSHEN XIE 2926e74326
New UT should not exceed 15s (#29492)
4 years ago
QingshuChen 79a41a9ed6
support roi_align & affine_channel for kunlun (#29561)
4 years ago
Wilber 740c0d58c3
update for xpu ci. (#29568)
4 years ago
LoveAn b5d4a1f33d
Add the strategy of skipping cc/cu test compilation and execution in CI (#29499)
4 years ago
Wilber 5fe1f8aff7
update lite tag (#29517)
4 years ago
procr 3a0558339d
support mobilenet for kunlun (#29458)
4 years ago
taixiurong ecca6585cd
1. fix elementwise ops'bug 2. fix softmax_with_cross_entropy_op 3. add biliner_interp_op (#29448)
4 years ago
Wilber ad01658e36
fix cmake error message. (#29421)
4 years ago
LoveAn 671555ed32
Compiling operator libraries with Unity build (#29130)
4 years ago
Wilber 6cb688865a
update lite tag. (#29392)
4 years ago
Wilber cff93b52a7
update cmake for FT openbals version. (#29382)
4 years ago
wanghuancoder 3765da98c7
add coverage incremental switch, test=develop (#29290)
4 years ago
Shang Zhizhou fc80d2e09c
add compile option WITH_TENSORRT (#29208)
4 years ago
QingshuChen 64f29fbb70
update kunlun conv2d/softmax/elementwise implemetation (#29229)
4 years ago
wanghuancoder 2b2cd1864a
revert python file coverage, delete coverage run --include, test=develop (#29230)
4 years ago
Wilber 4fec182d24
[Lite-Subgraph] Fix compile error for lite subgraph. (#29146)
4 years ago
wanghuancoder 0239f79695
Generate code coverage reports only for incremental files (#28508)
4 years ago
Zhou Wei e668cb07fb
fix CUDA 11 error on windows (#29101)
4 years ago
Shang Zhizhou b9e76a0103
detect tensorRT plugin fp16 in runtime (#27933)
4 years ago
YUNSHEN XIE 5cb8e17a18
restore timeout value (#29027)
4 years ago
taixiurong d3d1a6b6e0
add kunlun kernel: slice, slice_grad, top_k, cast. *test=kunlun (#28542)
4 years ago
Zhou Wei 93c39779b4
open a part of GPU unittest for windows (#28378)
4 years ago