Commit Graph

12390 Commits (5bf25d1e8b6eef2eea8aa24f5dbacea0b832aae2)

Author SHA1 Message Date
WeiXin 5ff4f1ad5e
move 'load_op_library','LayerHelper' to 'paddle/incubate' (#30339)
4 years ago
Huihuang Zheng cd5f11b822
Decrease Batch Size for Windows CI, test=develop (#30331)
4 years ago
cc 8e3a294045
skip quantizing ops in cpu inference (#30342)
4 years ago
Bai Yifan ad6fee2fa8
fix quantize error in speical naming model (#30354)
4 years ago
huangxu96 342d62de60
add amp example document (#30314)
4 years ago
Huihuang Zheng 017a534888
Decrease Mac Input Size Because of CI Short Memory (#30330)
4 years ago
Leo Chen 3d015f1cf5
Set expected place in child thread for dataloader to avoid costing cuda memory on other card (#30338)
4 years ago
QingshuChen 2c1bba02e4
optimize memcpy perf for kunlun (#30291)
4 years ago
cnn 10ae31579b
update error information (#30277)
4 years ago
huangxu96 ee623bff64
Implemented AddQuantDequantPass in imperative quantization. (#26692)
4 years ago
ShenLiang a60f17b89d
Support unused parameters in dynamic graph distributed (#30224)
4 years ago
JZ-LIANG 75936d838f
Recompute Offload (#30233)
4 years ago
houj04 dc12b5eedf
resolve #30141 (#30145)
4 years ago
lidanqing a238298659
Skip some conv2d_int8 tests in windows (#30128)
4 years ago
Wojciech Uss fc42faffc2
Wojtuss/upgrade one dnn 2.0 (#30295)
4 years ago
tangwei12 5e839e4da5
add sparse embedding & load vars for 2.0 & gloo bug fix (#30306)
4 years ago
YUNSHEN XIE da3ab010e0
disable test_pipeline (#30204)
4 years ago
tangwei12 25f80fd304
Fix/distributed proto (#29981)
4 years ago
Chengmo d479ae1725
【Paddle.Fleet】Support local save sparse param (#30175)
4 years ago
chajchaj 113810c557
fix bug of celoss when using ignore_index and reduction (#30180)
4 years ago
Double_V 231501fefc
fix elugradgrad test fail & error message opt (#30171)
4 years ago
Zhen Wang fb49ea388e
Fix the accuracy problem of allclose op when using float64 data type in static mode. (#29890)
4 years ago
furnace 77051cc9f0
add fp16 support for tril_triu op (#30186)
4 years ago
LielinJiang 86d81af5ef
reduce unittest time of test_datasets (#30275)
4 years ago
liym27 b4989fb744
Support vector<double> as type of op attribute and op set_value suppport vector<double> as value (#30126)
4 years ago
furnace c6296b2b0e
fix empty op unit test fail sometimes (#30225)
4 years ago
AshburnLee 924aac2216
Add tf32 switch for cuDNN (#29192)
4 years ago
chentianyu03 c7371b7b20
type promotion for grad (#30177)
4 years ago
YUNSHEN XIE 42a6442a08
disable ut test_tsm on windows (#30017)
4 years ago
Jiaqi Liu b7335b4db7
Alias from paddle.fluid.layers.auc to paddle.static.auc (#30206)
4 years ago
WeiXin edafb5465a
Fix bug for 'save mutiple method' (#30218)
4 years ago
gongweibao 8700a7bd90
Fix unittests bugs. (#30250)
4 years ago
Bai Yifan dd6f591991
fix test_pool3d_op timeout issue (#30248)
4 years ago
Huihuang Zheng c372a76303
Add Static Variable Clone (#30208)
4 years ago
XiaoguangHu 6bfdef727e
clean redundant API alias in 2.0 - part 2 (#30013)
4 years ago
LielinJiang e6a1e8757d
Delete incorrect warning message (#30196)
4 years ago
wangchaochaohu af80859dd6
reduce the occupied size of memory for the fused pattern of elementwise_add Op and activation Op(relu Op for example) (#29885)
4 years ago
pangyoki da16b33f2e
add View(reuse allocation) strategy on squeeze, unsqueeze, reshape, flatten op (#29913)
4 years ago
huangxu96 be5c2e6050
fix windows bug (#29993)
4 years ago
Chen Weihang 3016ba852e
remove distributed prepare context (#30219)
4 years ago
Zhen Wang 7f7dfccf20
Support pure fp16 training for AMP API. (#29544)
4 years ago
Leo Chen 8696335f86
Fix dtype of ungenerated grad var (#28511)
4 years ago
Aurelius84 03e072736e
Skip convert tensor shape while using Paddle.shape (#30223)
4 years ago
liym27 49411a20da
In creation.assgin, reuse implamention code of layers.tensor.assign to avoid maintain two code (#30227)
4 years ago
littletomatodonkey e03171b7c7
fix pad (#30222)
4 years ago
liym27 31ed9a5ed3
[Dy2Stat] Use Paddle2.0 api paddle.tensor.array_* (#30156)
4 years ago
liym27 ad55f609d5
[Dy2Stat] Don't convert to paddle.shape if var_x.shape is not negetive (#29965)
4 years ago
Leo Chen 1f97d61c68
Add callback after TensorCopy (#30123)
4 years ago
liym27 b2483d78a8
Fix test_slice: avoid unnecessary copying of TensorArray from subblock to parent block(#30168)
4 years ago
Chengmo 528e03fc08
【Paddle.Fleet】Fix tensor table (#30075)
4 years ago
guofei 1bdf924217
Quantization supports 2.0 APIs (#30036)
4 years ago
Chen Weihang d0fb06b27f
[Complex] Simplify prepared op impl to improve performance (#30153)
4 years ago
Chen Weihang e503470700
try multi times for sys.exit (#30188)
4 years ago
WangXi 619c62bb48
fix adamw apply gradient (#30130)
4 years ago
LutaoChu 1ff69f58b6
fix paddle.pow doc, test=document_fix (#30159)
4 years ago
wangchaochaohu 7dd551e08b
refine the paddle place support using str (#28769)
4 years ago
Chen Weihang 8020e34e7c
Simplify the options of spawn based on fleetrun (#30144)
4 years ago
tangwei12 4763e6bc4e
pre padding in dygraph (#30163)
4 years ago
123malin 198fbdfb60
Add Lookahead and ModelAverage Optimizer (#30004)
4 years ago
ceci3 6a19e41f1f
fix syncbn convert (#30158)
4 years ago
Leo Chen adac38c506
add dispenable input for core.ops.reshape2/expand/slice (#30072)
4 years ago
Zhou Wei 30888ca343
Polish and Optimize the print/repr information of Layer (#29998)
4 years ago
WeiXin f3a2392662
Extend the timeout for the (#30151)
4 years ago
Zhou Wei 9c99d37906
fix unittest failed on windows (#29837)
4 years ago
liym27 9922bd4125
Fix bug: In dynamic mode, if start or end is negetive, __getitem__ return wrong result(#30003)
4 years ago
gongweibao 4d2a4bb27a
fix logs info test=develop (#30071)
4 years ago
ceci3 a125d6331f
fix bn docs (#30096)
4 years ago
ceci3 334247791a
add attribute for batch_norm (#29950)
4 years ago
Jiaqi Liu 2e8425b693
Fix beam search bug (#29824)
4 years ago
WeiXin f43e1d8c57
Support storage of large parameters (#29988)
4 years ago
chentianyu03 666e665132
change the kron gradient when complex types (#29995)
4 years ago
WangXi ab04997846
[fleet] combine amp and gradient merge, test=develop (#30086)
4 years ago
wanghuancoder 88e6dc4ac5
optimize momentum to speedup dygraph, a little, test=develop (#30099)
4 years ago
Thunderbrook 0b8e1fadc5
add topo-aware in heter-ps (#30087)
4 years ago
gongweibao eea7090c26
fix selected_gpus test=develop (#30044)
4 years ago
cc 1fa863da40
Support dygraph quant model (#29927)
4 years ago
Chen Weihang 46c4695421
Set FLAGS_selected_gpus for spawn (#29962)
4 years ago
WangXi ee16006b5d
Optimization grad merge performance (#29784)
4 years ago
xiaoting 4d395203a2
Add alias for upsample (#29983)
4 years ago
lilong12 9e51e3833f
update, test=develop (#30047)
4 years ago
chentianyu03 e012930aa3
complex gradient matmul (#29966)
4 years ago
lilong12 b0bd93de00
Disable gloo by default (#29805)
4 years ago
ShenLiang b6fd262951
fix gather nd for untest (#30037)
4 years ago
Leo Chen a253a78a85
fix error message (#30020)
4 years ago
lilong12 2bc5121da8
add the paddle.distributed.split api (#29970)
4 years ago
cc c3c064a8fc
Add mkldnn nearest_interp and bilinear_interp op (#30016)
4 years ago
zhupengyang 65d4ff753b
hardsigmoid add attr slope and offset (#29999)
4 years ago
tangwei12 ed856d254e
fix ut (#29989)
4 years ago
cc 62f455e023
Support quantizing program_desc (#29526)
4 years ago
Chen Long af37285870
fix code bugs (#29932)
4 years ago
guofei 8212874f47
Fix test_imperative_skip_out (#29939)
4 years ago
LielinJiang ec2fad4d51
Fix rotation bug when use cv2 backend (#29933)
4 years ago
Chen Weihang a1d9a14e89
support grad accumulated across batch (#29942)
4 years ago
liuyuhui bb20dcfc1a
[Kunlun] bug fix of PR2: Support MultiDevicePass and BKCL in parallel executor (#29961)
4 years ago
wawltor 587b67ef62
fix the state_dict bug for the xpu (#29888)
4 years ago
QingshuChen f4be9d6a32
add bkcl.so in whl for kunlun (#29947)
4 years ago
XiaoguangHu 726c78f293
clean redundant API alias in 2.0 - part 1 (#29928)
4 years ago
liym27 14bd77f941
[Windows CI test] Enable unittest test_optimizer_in_control_flow and remove unnecessay code (#29851)
4 years ago
Wilber 332da133a1
Support mips arch (#29903)
4 years ago
littletomatodonkey 5c162fe66e
fix reg api ut fail (#29921)
4 years ago