Commit Graph

29611 Commits (ab04997846bdc7497772987604e30889ed60cc88)
 

Author SHA1 Message Date
WangXi ab04997846
[fleet] combine amp and gradient merge, test=develop (#30086)
4 years ago
wanghuancoder 88e6dc4ac5
optimize momentum to speedup dygraph, a little, test=develop (#30099)
4 years ago
liuyuhui 254ad61959
fix xpu pe sync, test=notest (#30095)
4 years ago
Thunderbrook 0b8e1fadc5
add topo-aware in heter-ps (#30087)
4 years ago
hong 297fff1a79
support dygraph in xpu place (#30051)
4 years ago
gongweibao eea7090c26
fix selected_gpus test=develop (#30044)
4 years ago
wangchaochaohu d0a5620575
fix the compiler error when gcc4 cuda9.0 (#29997)
4 years ago
cc 1fa863da40
Support dygraph quant model (#29927)
4 years ago
Chen Weihang 46c4695421
Set FLAGS_selected_gpus for spawn (#29962)
4 years ago
WangXi ee16006b5d
Optimization grad merge performance (#29784)
4 years ago
yongqiangma e891f4da1b
Add p_norm op version info (#30042)
4 years ago
tangwei12 7d1c149e09
for inference checkpoint (#30081)
4 years ago
tangwei12 7d4bdff07d
fix large scale memory (#30035)
4 years ago
Shang Zhizhou 08dc5bc27e
fix op version checker of pass bug (#30028)
4 years ago
cc 68398abce9
[Inference] zero_copy_tensor supports int8_t (#30053)
4 years ago
whs 1b999d2b5d
Add version checking (#30040)
4 years ago
xiaoting 4d395203a2
Add alias for upsample (#29983)
4 years ago
ceci3 85b2f05ab0
register ModifyAttr for instance_norm, test=op_version (#30065)
4 years ago
channings ddcff254db
fix op_register_version for compare ops, test=op_version (#30007)
4 years ago
Wilber 66e16b7e99
update lite subgraph. (#30056)
4 years ago
GaoWei8 a64822589f
add REGISTER_OP_VERSION for LSTM (#30038)
4 years ago
yinhaofeng 6e93fb92f9
Register op version for linspace,test=op_version (#30025)
4 years ago
lilong12 9e51e3833f
update, test=develop (#30047)
4 years ago
123malin d0056c324d
test=develop, add op_register_version for roll_op (#30023)
4 years ago
chentianyu03 e012930aa3
complex gradient matmul (#29966)
4 years ago
lilong12 b0bd93de00
Disable gloo by default (#29805)
4 years ago
ShenLiang b6fd262951
fix gather nd for untest (#30037)
4 years ago
Leo Chen a253a78a85
fix error message (#30020)
4 years ago
ShenLiang 893d37e5c6
Fix rank_attention op_version, test=op_version (#30006)
4 years ago
lilong12 2bc5121da8
add the paddle.distributed.split api (#29970)
4 years ago
Adam Osewski 13aef97043
operator checkpoints for new attributes. (#29832)
4 years ago
wangguanzhong 844d8e0c2c
add REGISTER_OP_VERSION for generate_proposals, roi_align, roi_pool test=op_version (#30034)
4 years ago
cc c3c064a8fc
Add mkldnn nearest_interp and bilinear_interp op (#30016)
4 years ago
zhupengyang 65d4ff753b
hardsigmoid add attr slope and offset (#29999)
4 years ago
chalsliu c053bf2a57
Revert "register ModifyAttr for instance_norm, test=op_version (#29938)"
4 years ago
wawltor cc2f94620c
add the support the op version check for matmul, test=op_version (#30011)
4 years ago
wawltor b33aaea86c
add the op version check for the elementwise ops, test=op_version (#30010)
4 years ago
tangwei12 ed856d254e
fix ut (#29989)
4 years ago
Chengmo 4cbcc9b6da
fix momentum op register (#29941)
4 years ago
hutuxian 7c1f69bdf0
add op_version for flip op [test=op_version] (#30019)
4 years ago
ceci3 77c1684397
register ModifyAttr for instance_norm, test=op_version (#29938)
4 years ago
cc 62f455e023
Support quantizing program_desc (#29526)
4 years ago
Leo Chen 47d10c55d5
Enhance debugging (#30001)
4 years ago
Chen Long 453a57b448
Readme update (#30009)
4 years ago
Chen Long af37285870
fix code bugs (#29932)
4 years ago
FlyingQianMM d42f93e504
add op_register_version for allclose op; test=op_version (#29968)
4 years ago
wawltor 8f49f9d5c9
change the elementwise ops version check, test=op_version
4 years ago
guofei b23faf37be
Add moving_average_abs_max_scale op_register_version test=develop (#29957)
4 years ago
Thunderbrook 0ca6de171f
add include (#29952)
4 years ago
wuhuanzhou 898486dd46
Add direction info log and filter disabled ops in PR-CI-OP-benchmark (#29946)
4 years ago