Commit Graph

29834 Commits (fef3654b4e76f5e2cc9a5f71c1c047cef82192e5)
 

Author SHA1 Message Date
WangXi 572c466d19
[Prepare for MultiProcess xpu] unified gen nccl id, refine imperative reducer (#30455)
4 years ago
ykkk2333 549855ac20
add rmsprop_op_xpu test=kunlun (#30493)
4 years ago
Zhou Wei fb20ec9a4e
fix bug of multicard grad ncclAllReduce (#30553)
4 years ago
Zhen Wang f30d00553a
Fix the compiling error of update_loss_scaling when using cuda9. (#30538)
4 years ago
Leo Chen 81217a94d8
unify calling cudaSetDevice (#30470)
4 years ago
pangyoki 00554b3f6b
fix error message of Inplace strategy (#30520)
4 years ago
QingshuChen d849ecc0ae
update kunlun dependence for aarch64 & sunway platform (#30516)
4 years ago
Leo Chen 7043b8cfc6
support layer_norm fp16 in dygraph amp (#30430)
4 years ago
wuhuanzhou 28eb7b6589
fix logs dir error with auto retry, test=document_fix (#30466)
4 years ago
Zhang Ting 66c514ce83
[2.0 API] device guard (#30307)
4 years ago
WangXi 7a0a576e51
fix adamw lr_to_coeff is fixed when dygraph (#30526)
4 years ago
wanghuancoder 59ad6ff3e3
delete empty line of pybing.cc, test=develop (#30529)
4 years ago
cc ce6777fcdf
Fix bug of supporting channelwise dygraph quantized model, test=develop (#30531)
4 years ago
WeiXin c0fb03a0dc
Supplement PR29988(https://github.com/PaddlePaddle/Paddle/pull/29988) (#30507)
4 years ago
hutuxian 9fec1618d2
Ascend Framework Part3: Ascend Parser (#30391)
4 years ago
hutuxian e207fe6385
Ascend Framework Part2: pybind files (#30410)
4 years ago
hutuxian 40ede12631
Ascend Framework Part1: OP & Wrapper (#30281)
4 years ago
Zhang Ting 34bf8dfc40
avoid calling cast twice (#30527)
4 years ago
gongweibao bdae7ed326
Fix potential port conflicts. (#30508)
4 years ago
liuyuhui 843dc3cdbd
[Kunlun]PR3: add xpu executor, multi xpu card train function optimization (#30317)
4 years ago
QingshuChen 8489d4f76f
optimize batch_norm & pool op for kunlun (#30490)
4 years ago
wanghuancoder bd97192274
if pybind.cc changed, generate total report, test=develop (#30514)
4 years ago
taixiurong 5e5c2827a3
fix range op crash in dygraph xpu place (#30469)
4 years ago
WeiXin 18ecd433f5
Avoid bug on 'MAC python3.5/6'. (#30485)
4 years ago
JZ-LIANG 16ba0abc79
Recompute Offload: fixed bug in memcpy (#30484)
4 years ago
lijianshe02 d8a9ba56ef
fix random seed in nll_loss unittest test=develop (#30468)
4 years ago
cc 5d8d463cf7
Collect weight threshold for lstm op in post_training_quantization (#28701)
4 years ago
guofei 11e78ebaa3
Modify the calculation logic of LambOptimizer (#29313)
4 years ago
Adam Osewski c5ffad126c
[oneDNN] Refactor fuse pass helper functions to one place. (#30460)
4 years ago
LielinJiang 1d7bf1de2b
Update voc dataset url (#30450)
4 years ago
Zhang Ting c9a334e1b3
add VecCastCUDAKernel (#30296)
4 years ago
pangyoki 13d757362c
Add Inplace strategy (Output reuse Input Varbase) in dygraph (#30103)
4 years ago
Yang Zhang 008b0a8b56
Fix float64 bug in layer norm (#30452)
4 years ago
石晓伟 715d862868
export global google flags to users, test=develop (#30448)
4 years ago
Wojciech Uss 88fc7a7d68
fix cache key for inplaced elementwise ops (#30404)
4 years ago
WeiXin e5bb4edb2c
perfect 'var_list' of static.load/fluid.load (#30457)
4 years ago
123malin 05f06d9ae1
test=develop, fix fleet.metric (#30438)
4 years ago
wawltor 3d49882e2c
fix the rnn mask memory bug for out of read (#30459)
4 years ago
tianshuo78520a f090066e85
Clean dockerfiles (#30401)
4 years ago
taixiurong 6a3c8725b0
support transformer v2.0 (#30381)
4 years ago
ShenLiang e85be1b1b2
fix flatten api grad (#30426)
4 years ago
Zhou Wei c94a4b9468
Separate AVX and NO_AVX compilation, enhance installation error message (#30413)
4 years ago
yaoxuefeng 6e0da01c61
Heter ps new (#30198)
4 years ago
Shang Zhizhou 49e79cad39
fix jetson compile error (#30378)
4 years ago
Jiaqi Liu e395bcd1e0
add auc into 'all' list (#30310)
4 years ago
Chengmo 859431aadb
fix ps init(#30397)
4 years ago
123malin 2a98e9323a
test=develop, add distributed_infer (#30300)
4 years ago
Wilber 96784ed6c8
fix compile error on ARM (#30398)
4 years ago
Chen Weihang ae1f32091a
fix prune input bug (#30384)
4 years ago
QingshuChen cf786d22ec
fix bug that cann't find mkldnn(kunlun) (#30394)
4 years ago