Commit Graph

12202 Commits (adaec0073d02c0ea55bcabc4671ebfc8dbd3182c)

Author SHA1 Message Date
Shang Zhizhou 200ee33df8
fix unittest random error (#30808)
5 years ago
xiemoyuan db87087283
Optimize the encoder of Transformer. (#30439)
5 years ago
WangXi 31ed9c9eed
Fleet distributed strategy support pure fp16 (#30754)
5 years ago
Aurelius84 2c974cc316
【CustomOp】support setup.py to compile custom op (#30753)
5 years ago
Jiaqi Liu 65a9744cfd
fix paddle.static.acc and auc sample code bug, test=document_fix (#30715)
5 years ago
Wojciech Uss fc00240575
A fix for oneDNN matmul kernel. Fixes issue #30309 (#30723)
5 years ago
tianshuo78520a a12b6bb9cb
add readme in whl package (#30726)
5 years ago
WeiXin 3491acfb1e
Split unittest. (#30727)
5 years ago
liu zhengxi a87d78f1a9
update gather_tree doc (#30693)
5 years ago
liu zhengxi fef3654b4e
upgrade gather_tree to core.ops (#30697)
5 years ago
jakpiase f8da5536ed
REUPLOAD Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30719)
5 years ago
liym27 13ef444fa6
[Dy2Stat] Fix error message when the message has more than one lines. (#30714)
5 years ago
Tao Luo 824a79d383
Revert "Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30661)" (#30708)
5 years ago
jakpiase d834f4e6e8
Added vanilla LSTM and LSTM with peepholes oneDNN fp32 kernel (#30661)
5 years ago
Leo Chen 1a13626f5f
polish printing dtype (#30682)
5 years ago
WangXi a28a202603
fix test_gen_nccl_id_op failed (#30686)
5 years ago
123malin 164275704d
test=develop, fix nonzero astuple=true (#30647)
5 years ago
yingshengBD 0eea5d714f
post quantize support insert fake_quantize_dequantize node before the OPs that will be used in VIS's faceid models (#30659)
5 years ago
123malin 06a3e31148
test=develop, fix test_lookahead (#30677)
5 years ago
yukavio 8c5f158172
remove PrettyTable dependence from paddle.flops (#30675)
5 years ago
chentianyu03 fb7fbc7a5d
fix abs bug and add abs test case (#30637)
5 years ago
ShenLiang 9514b4aa5f
Fix scatter grad bug (#30604)
5 years ago
Qi Li 1f5841c2a0
[ROCM] update cmake and dockerfile, test=develop (#30598)
5 years ago
Zhen Wang 4a9de931a2
Fix the bug in fleet amp_init. (#30606)
5 years ago
cnn 7e9f336b58
update document of paddle.vision.dataset, test=document (#30414)
5 years ago
guofei 430f8449f1
Fix the error of save_quantized_model (#30583)
5 years ago
TTerror 10271ddfc4
support reduce_max op on kunlun (#30581)
5 years ago
WeiXin ca33821475
延长单测'test_static_save_load'超时 (#30599)
5 years ago
chentianyu03 358106fcb0
make abs op support complex types (#30375)
5 years ago
huangxu96 138620084c
Add fleet amp_init() (#30572)
5 years ago
wanghuancoder 27a5c0cff6
fix layers train eval bug (#30580)
5 years ago
lilong12 8126a41d73
fix the bug of all_reduce pipeline gradient multiple times (#30437)
5 years ago
Aurelius84 621bc4f771
[Dy2static]Fix paddle prefix in is_paddle_api (#30569)
5 years ago
tangwei12 c9e78a22c5
add trainers for pserver (#30523)
5 years ago
Aurelius84 5067e3a8d2
[Dy2Static]Enhance check of TracedLayers out vars (#30576)
5 years ago
liym27 ff25c5b36f
Fix bug: GetAttrValue should deal with attr with attrType vector<double> (#30536)
5 years ago
WangXi 572c466d19
[Prepare for MultiProcess xpu] unified gen nccl id, refine imperative reducer (#30455)
5 years ago
ykkk2333 549855ac20
add rmsprop_op_xpu test=kunlun (#30493)
5 years ago
Leo Chen 7043b8cfc6
support layer_norm fp16 in dygraph amp (#30430)
5 years ago
Zhang Ting 66c514ce83
[2.0 API] device guard (#30307)
5 years ago
WangXi 7a0a576e51
fix adamw lr_to_coeff is fixed when dygraph (#30526)
5 years ago
cc ce6777fcdf
Fix bug of supporting channelwise dygraph quantized model, test=develop (#30531)
5 years ago
WeiXin c0fb03a0dc
Supplement PR29988(https://github.com/PaddlePaddle/Paddle/pull/29988) (#30507)
5 years ago
hutuxian 9fec1618d2
Ascend Framework Part3: Ascend Parser (#30391)
5 years ago
hutuxian 40ede12631
Ascend Framework Part1: OP & Wrapper (#30281)
5 years ago
Zhang Ting 34bf8dfc40
avoid calling cast twice (#30527)
5 years ago
gongweibao bdae7ed326
Fix potential port conflicts. (#30508)
5 years ago
QingshuChen 8489d4f76f
optimize batch_norm & pool op for kunlun (#30490)
5 years ago
taixiurong 5e5c2827a3
fix range op crash in dygraph xpu place (#30469)
5 years ago
WeiXin 18ecd433f5
Avoid bug on 'MAC python3.5/6'. (#30485)
5 years ago
JZ-LIANG 16ba0abc79
Recompute Offload: fixed bug in memcpy (#30484)
5 years ago
lijianshe02 d8a9ba56ef
fix random seed in nll_loss unittest test=develop (#30468)
5 years ago
cc 5d8d463cf7
Collect weight threshold for lstm op in post_training_quantization (#28701)
5 years ago
guofei 11e78ebaa3
Modify the calculation logic of LambOptimizer (#29313)
5 years ago
LielinJiang 1d7bf1de2b
Update voc dataset url (#30450)
5 years ago
pangyoki 13d757362c
Add Inplace strategy (Output reuse Input Varbase) in dygraph (#30103)
5 years ago
WeiXin e5bb4edb2c
perfect 'var_list' of static.load/fluid.load (#30457)
5 years ago
123malin 05f06d9ae1
test=develop, fix fleet.metric (#30438)
5 years ago
taixiurong 6a3c8725b0
support transformer v2.0 (#30381)
5 years ago
Zhou Wei c94a4b9468
Separate AVX and NO_AVX compilation, enhance installation error message (#30413)
5 years ago
Jiaqi Liu e395bcd1e0
add auc into 'all' list (#30310)
5 years ago
Chengmo 859431aadb
fix ps init(#30397)
5 years ago
123malin 2a98e9323a
test=develop, add distributed_infer (#30300)
5 years ago
Chen Weihang ae1f32091a
fix prune input bug (#30384)
5 years ago
WeiXin 5ff4f1ad5e
move 'load_op_library','LayerHelper' to 'paddle/incubate' (#30339)
5 years ago
Huihuang Zheng cd5f11b822
Decrease Batch Size for Windows CI, test=develop (#30331)
5 years ago
cc 8e3a294045
skip quantizing ops in cpu inference (#30342)
5 years ago
Bai Yifan ad6fee2fa8
fix quantize error in speical naming model (#30354)
5 years ago
huangxu96 342d62de60
add amp example document (#30314)
5 years ago
Huihuang Zheng 017a534888
Decrease Mac Input Size Because of CI Short Memory (#30330)
5 years ago
Leo Chen 3d015f1cf5
Set expected place in child thread for dataloader to avoid costing cuda memory on other card (#30338)
5 years ago
QingshuChen 2c1bba02e4
optimize memcpy perf for kunlun (#30291)
5 years ago
cnn 10ae31579b
update error information (#30277)
5 years ago
huangxu96 ee623bff64
Implemented AddQuantDequantPass in imperative quantization. (#26692)
5 years ago
ShenLiang a60f17b89d
Support unused parameters in dynamic graph distributed (#30224)
5 years ago
JZ-LIANG 75936d838f
Recompute Offload (#30233)
5 years ago
lidanqing a238298659
Skip some conv2d_int8 tests in windows (#30128)
5 years ago
Wojciech Uss fc42faffc2
Wojtuss/upgrade one dnn 2.0 (#30295)
5 years ago
tangwei12 5e839e4da5
add sparse embedding & load vars for 2.0 & gloo bug fix (#30306)
5 years ago
YUNSHEN XIE da3ab010e0
disable test_pipeline (#30204)
5 years ago
tangwei12 25f80fd304
Fix/distributed proto (#29981)
5 years ago
Chengmo d479ae1725
【Paddle.Fleet】Support local save sparse param (#30175)
5 years ago
chajchaj 113810c557
fix bug of celoss when using ignore_index and reduction (#30180)
5 years ago
Double_V 231501fefc
fix elugradgrad test fail & error message opt (#30171)
5 years ago
Zhen Wang fb49ea388e
Fix the accuracy problem of allclose op when using float64 data type in static mode. (#29890)
5 years ago
furnace 77051cc9f0
add fp16 support for tril_triu op (#30186)
5 years ago
LielinJiang 86d81af5ef
reduce unittest time of test_datasets (#30275)
5 years ago
liym27 b4989fb744
Support vector<double> as type of op attribute and op set_value suppport vector<double> as value (#30126)
5 years ago
furnace c6296b2b0e
fix empty op unit test fail sometimes (#30225)
5 years ago
AshburnLee 924aac2216
Add tf32 switch for cuDNN (#29192)
5 years ago
chentianyu03 c7371b7b20
type promotion for grad (#30177)
5 years ago
YUNSHEN XIE 42a6442a08
disable ut test_tsm on windows (#30017)
5 years ago
Jiaqi Liu b7335b4db7
Alias from paddle.fluid.layers.auc to paddle.static.auc (#30206)
5 years ago
WeiXin edafb5465a
Fix bug for 'save mutiple method' (#30218)
5 years ago
gongweibao 8700a7bd90
Fix unittests bugs. (#30250)
5 years ago
Bai Yifan dd6f591991
fix test_pool3d_op timeout issue (#30248)
5 years ago
Huihuang Zheng c372a76303
Add Static Variable Clone (#30208)
5 years ago
XiaoguangHu 6bfdef727e
clean redundant API alias in 2.0 - part 2 (#30013)
5 years ago
LielinJiang e6a1e8757d
Delete incorrect warning message (#30196)
5 years ago
wangchaochaohu af80859dd6
reduce the occupied size of memory for the fused pattern of elementwise_add Op and activation Op(relu Op for example) (#29885)
5 years ago