Commit Graph

1345 Commits (bc7a3afa687696541b032d56d1e9a8ca8e101c77)

Author SHA1 Message Date
lw921014 c594f57685
add c_reduce_sum op (#31793)
4 years ago
lw921014 15823bb0df
[NPU] add npu kernel for communication op (#31437)
4 years ago
Leo Chen c956c035dc
fix cmake of cryptopp to avoid downloading every time (#31451)
4 years ago
lw921014 9fcdaeba5e
add allreduce and broadcast without test (#31024)
4 years ago
Leo Chen 678a3e8fed
support adding correct npu op in pybind.h (#31143)
4 years ago
Leo Chen 85cbd55648
Fix compilation problem (#31100)
4 years ago
Leo Chen 1201cd2ef2
[feature] support npu allocator, part 2 (#30972)
4 years ago
Leo Chen 7e049108c5
[feature] support npu operator (#30951)
4 years ago
Leo Chen 81138239db
[feature] support npu allocator (#30840)
4 years ago
Leo Chen 500f28ec37
pass cxx_flags to gloo cmake (#30857)
4 years ago
Leo Chen 6eabbc8076
fix compilation on ascend-20.1 (#30722)
4 years ago
gongweibao f9c97dd728
Add distribution supported (#30578)
4 years ago
gongweibao 1882f2ce2d
Fix compilcation on CANN20.1 and older (#30494)
4 years ago
hutuxian 6dd52c5b25
Ascend rc (#30483)
4 years ago
石晓伟 715d862868
export global google flags to users, test=develop (#30448)
4 years ago
taixiurong 6a3c8725b0
support transformer v2.0 (#30381)
4 years ago
Zhou Wei c94a4b9468
Separate AVX and NO_AVX compilation, enhance installation error message (#30413)
4 years ago
houj04 dc12b5eedf
resolve #30141 (#30145)
4 years ago
Wojciech Uss fc42faffc2
Wojtuss/upgrade one dnn 2.0 (#30295)
4 years ago
tangwei12 25f80fd304
Fix/distributed proto (#29981)
4 years ago
wuhuanzhou 1eeba9802f
fix the problem of Unity Build with incremental compilation, test=develop (#30232)
4 years ago
tianshuo78520a 7564d43bbc
down openssl (#29958)
4 years ago
QingshuChen 8e1c3ddf15
add aarch64 and sunway kunlun lib (#30027)
4 years ago
Wilber 66e16b7e99
update lite subgraph. (#30056)
4 years ago
石晓伟 181ea1870b
flush denormals to zero, test=develop (#29924)
5 years ago
Wilber 332da133a1
Support mips arch (#29903)
5 years ago
liuyuhui 4427df37cf
[Kunlun] PR2: Support MultiDevicePass and BKCL in parallel executor (#29574)
5 years ago
wanghuancoder 26f9ab70f7
if PR have no .py files, do not use 'python coverage run', to speedup unit test (#29739)
5 years ago
tangwei12 032414ca2a
[Feature] one ps (3/4) (#29604)
5 years ago
Wojciech Uss 6ef8129dcc
upgrade oneDNN with GRU INT8 optimizations (#28420)
5 years ago
TTerror af8ded773a
update activation op on kunlun (#29577)
5 years ago
Y_Xuan 76738504ad
添加rocm平台支持代码 (#29342)
5 years ago
YUNSHEN XIE 2926e74326
New UT should not exceed 15s (#29492)
5 years ago
QingshuChen 79a41a9ed6
support roi_align & affine_channel for kunlun (#29561)
5 years ago
Wilber 740c0d58c3
update for xpu ci. (#29568)
5 years ago
LoveAn b5d4a1f33d
Add the strategy of skipping cc/cu test compilation and execution in CI (#29499)
5 years ago
Wilber 5fe1f8aff7
update lite tag (#29517)
5 years ago
procr 3a0558339d
support mobilenet for kunlun (#29458)
5 years ago
taixiurong ecca6585cd
1. fix elementwise ops'bug 2. fix softmax_with_cross_entropy_op 3. add biliner_interp_op (#29448)
5 years ago
Wilber ad01658e36
fix cmake error message. (#29421)
5 years ago
LoveAn 671555ed32
Compiling operator libraries with Unity build (#29130)
5 years ago
Wilber 6cb688865a
update lite tag. (#29392)
5 years ago
Wilber cff93b52a7
update cmake for FT openbals version. (#29382)
5 years ago
wanghuancoder 3765da98c7
add coverage incremental switch, test=develop (#29290)
5 years ago
Shang Zhizhou fc80d2e09c
add compile option WITH_TENSORRT (#29208)
5 years ago
QingshuChen 64f29fbb70
update kunlun conv2d/softmax/elementwise implemetation (#29229)
5 years ago
wanghuancoder 2b2cd1864a
revert python file coverage, delete coverage run --include, test=develop (#29230)
5 years ago
Wilber 4fec182d24
[Lite-Subgraph] Fix compile error for lite subgraph. (#29146)
5 years ago
wanghuancoder 0239f79695
Generate code coverage reports only for incremental files (#28508)
5 years ago
Zhou Wei e668cb07fb
fix CUDA 11 error on windows (#29101)
5 years ago