Commit Graph

29796 Commits (c956c035dc9dd3587882b53f65c374cdce1edeb2)
 

Author SHA1 Message Date
Leo Chen c956c035dc
fix cmake of cryptopp to avoid downloading every time (#31451)
4 years ago
Leo Chen 83f81eb573
Fix pow, refine code (#31440)
4 years ago
Leo Chen 5fe3d596e4
Fix pow, use fillD instead of broadcast (#31433)
4 years ago
zhang wenhui ecc6e213d7
fix endif (#31431)
4 years ago
zhang wenhui b3c88e961c
[NPU] Support npu kernel for shape op (#31427)
4 years ago
Leo Chen ac3d821bc0
[NPU] add npu kernel for equal op (#31393)
4 years ago
Leo Chen 0310945f5c
[NPU] Support npu op layer_norm and layer_norm_grad (#31310)
4 years ago
Void Main 45765d6eb6
Refactor HCCLCommContext to be compatible with Paddle (#31359)
4 years ago
Leo Chen 8497e2aad3
[NPU] add npu kernel for elementwise_add_grad (#31347)
4 years ago
lw921014 9fcdaeba5e
add allreduce and broadcast without test (#31024)
4 years ago
Leo Chen 5618f14047
fix reading flags from env (#31329)
4 years ago
liym27 a1ddff81e3
[NPU] Support npu op: (1) slice (2) slice_grad (#31275)
4 years ago
Leo Chen d23bf89cf6
support list of list attribute for NPU (#31299)
4 years ago
liym27 77a0c41cb2
Fix pow npu fp16 test (#31256)
4 years ago
liym27 187248f568
[NPU] Support npu op pow and pow grad (#31247)
4 years ago
xiayanming 821c2f4ef8
add ascend unittest (#31249)
4 years ago
Leo Chen d45f5d787e
Fix typo of selected_npus (#31230)
4 years ago
xiayanming 387c1db4f1
Ascendrc (#31065)
4 years ago
Leo Chen ff4654e216
refactor npu device manager (#31154)
4 years ago
liym27 1435b4c096
[NPU] Support executor with NPU (#31057)
4 years ago
Leo Chen 678a3e8fed
support adding correct npu op in pybind.h (#31143)
4 years ago
Leo Chen 85cbd55648
Fix compilation problem (#31100)
4 years ago
Leo Chen 5cb20f30fc
add npu kernel for elementwise_sub and elementwise_sub_grad (#30973)
4 years ago
gongweibao c687edecd8
Fix reshape on GE graph. (#31084)
4 years ago
xiayanming a6edbc478b
support parsing ascend rank table file (#31000)
4 years ago
Leo Chen 1201cd2ef2
[feature] support npu allocator, part 2 (#30972)
4 years ago
Leo Chen 7e049108c5
[feature] support npu operator (#30951)
4 years ago
Leo Chen 81138239db
[feature] support npu allocator (#30840)
4 years ago
gongweibao ebef6601d5
Destroy session first. (#30954)
4 years ago
Leo Chen 500f28ec37
pass cxx_flags to gloo cmake (#30857)
4 years ago
gongweibao de42d19336
Add paddle ascend distribution training supported (#30796)
4 years ago
OleNet ebb5d181e8
Ascendrc add converted op : [range/equal/range/uniform_random/expand/squeeze], fix cast op bug (#30797)
4 years ago
dingsiyu 4a26729540
Merge ascend_optimizer and ascend_parser. (#30776)
4 years ago
gongweibao 636fefd9f8
code style (#30781)
4 years ago
Leo Chen 88dfd067bf
Dev/fix ascend string (#30749)
4 years ago
Leo Chen 6eabbc8076
fix compilation on ascend-20.1 (#30722)
4 years ago
Void Main 904cc44349
[Feature] Build parser to support distributed training (#30658)
4 years ago
gongweibao 5b77b259d8
cleanup (#30646)
4 years ago
gongweibao 7158061a29
Add startup bash files of test_ascend_group. (#30645)
4 years ago
gongweibao e4287ca60b
Add Hccl program group (#30642)
4 years ago
gongweibao f5aca8fbb4
Pass device_ids info from launch to trainer. (#30632)
4 years ago
Void Main d2404da768
Build praser for Hcom* operators (#30627)
4 years ago
gongweibao f9c97dd728
Add distribution supported (#30578)
4 years ago
gongweibao 1882f2ce2d
Fix compilcation on CANN20.1 and older (#30494)
4 years ago
hutuxian 6dd52c5b25
Ascend rc (#30483)
4 years ago
石晓伟 715d862868
export global google flags to users, test=develop (#30448)
4 years ago
Wojciech Uss 88fc7a7d68
fix cache key for inplaced elementwise ops (#30404)
4 years ago
WeiXin e5bb4edb2c
perfect 'var_list' of static.load/fluid.load (#30457)
4 years ago
123malin 05f06d9ae1
test=develop, fix fleet.metric (#30438)
4 years ago
wawltor 3d49882e2c
fix the rnn mask memory bug for out of read (#30459)
4 years ago