Commit Graph

12347 Commits (6a3c8725b01dedbc10f99f431ba5a4541e0e431e)

Author SHA1 Message Date
liym27 49411a20da
In creation.assgin, reuse implamention code of layers.tensor.assign to avoid maintain two code (#30227)
4 years ago
littletomatodonkey e03171b7c7
fix pad (#30222)
4 years ago
liym27 31ed9a5ed3
[Dy2Stat] Use Paddle2.0 api paddle.tensor.array_* (#30156)
4 years ago
liym27 ad55f609d5
[Dy2Stat] Don't convert to paddle.shape if var_x.shape is not negetive (#29965)
4 years ago
Leo Chen 1f97d61c68
Add callback after TensorCopy (#30123)
4 years ago
liym27 b2483d78a8
Fix test_slice: avoid unnecessary copying of TensorArray from subblock to parent block(#30168)
4 years ago
Chengmo 528e03fc08
【Paddle.Fleet】Fix tensor table (#30075)
4 years ago
guofei 1bdf924217
Quantization supports 2.0 APIs (#30036)
4 years ago
Chen Weihang d0fb06b27f
[Complex] Simplify prepared op impl to improve performance (#30153)
4 years ago
Chen Weihang e503470700
try multi times for sys.exit (#30188)
4 years ago
WangXi 619c62bb48
fix adamw apply gradient (#30130)
4 years ago
LutaoChu 1ff69f58b6
fix paddle.pow doc, test=document_fix (#30159)
4 years ago
wangchaochaohu 7dd551e08b
refine the paddle place support using str (#28769)
4 years ago
Chen Weihang 8020e34e7c
Simplify the options of spawn based on fleetrun (#30144)
4 years ago
tangwei12 4763e6bc4e
pre padding in dygraph (#30163)
4 years ago
123malin 198fbdfb60
Add Lookahead and ModelAverage Optimizer (#30004)
4 years ago
ceci3 6a19e41f1f
fix syncbn convert (#30158)
4 years ago
Leo Chen adac38c506
add dispenable input for core.ops.reshape2/expand/slice (#30072)
4 years ago
Zhou Wei 30888ca343
Polish and Optimize the print/repr information of Layer (#29998)
4 years ago
WeiXin f3a2392662
Extend the timeout for the (#30151)
4 years ago
Zhou Wei 9c99d37906
fix unittest failed on windows (#29837)
4 years ago
liym27 9922bd4125
Fix bug: In dynamic mode, if start or end is negetive, __getitem__ return wrong result(#30003)
4 years ago
gongweibao 4d2a4bb27a
fix logs info test=develop (#30071)
4 years ago
ceci3 a125d6331f
fix bn docs (#30096)
4 years ago
ceci3 334247791a
add attribute for batch_norm (#29950)
4 years ago
Jiaqi Liu 2e8425b693
Fix beam search bug (#29824)
4 years ago
WeiXin f43e1d8c57
Support storage of large parameters (#29988)
4 years ago
chentianyu03 666e665132
change the kron gradient when complex types (#29995)
4 years ago
WangXi ab04997846
[fleet] combine amp and gradient merge, test=develop (#30086)
4 years ago
wanghuancoder 88e6dc4ac5
optimize momentum to speedup dygraph, a little, test=develop (#30099)
4 years ago
Thunderbrook 0b8e1fadc5
add topo-aware in heter-ps (#30087)
4 years ago
gongweibao eea7090c26
fix selected_gpus test=develop (#30044)
4 years ago
cc 1fa863da40
Support dygraph quant model (#29927)
4 years ago
Chen Weihang 46c4695421
Set FLAGS_selected_gpus for spawn (#29962)
4 years ago
WangXi ee16006b5d
Optimization grad merge performance (#29784)
4 years ago
xiaoting 4d395203a2
Add alias for upsample (#29983)
4 years ago
lilong12 9e51e3833f
update, test=develop (#30047)
4 years ago
chentianyu03 e012930aa3
complex gradient matmul (#29966)
4 years ago
lilong12 b0bd93de00
Disable gloo by default (#29805)
4 years ago
ShenLiang b6fd262951
fix gather nd for untest (#30037)
4 years ago
Leo Chen a253a78a85
fix error message (#30020)
4 years ago
lilong12 2bc5121da8
add the paddle.distributed.split api (#29970)
4 years ago
cc c3c064a8fc
Add mkldnn nearest_interp and bilinear_interp op (#30016)
4 years ago
zhupengyang 65d4ff753b
hardsigmoid add attr slope and offset (#29999)
4 years ago
tangwei12 ed856d254e
fix ut (#29989)
4 years ago
cc 62f455e023
Support quantizing program_desc (#29526)
4 years ago
Chen Long af37285870
fix code bugs (#29932)
4 years ago
guofei 8212874f47
Fix test_imperative_skip_out (#29939)
4 years ago
LielinJiang ec2fad4d51
Fix rotation bug when use cv2 backend (#29933)
4 years ago
Chen Weihang a1d9a14e89
support grad accumulated across batch (#29942)
4 years ago
liuyuhui bb20dcfc1a
[Kunlun] bug fix of PR2: Support MultiDevicePass and BKCL in parallel executor (#29961)
4 years ago
wawltor 587b67ef62
fix the state_dict bug for the xpu (#29888)
4 years ago
QingshuChen f4be9d6a32
add bkcl.so in whl for kunlun (#29947)
4 years ago
XiaoguangHu 726c78f293
clean redundant API alias in 2.0 - part 1 (#29928)
4 years ago
liym27 14bd77f941
[Windows CI test] Enable unittest test_optimizer_in_control_flow and remove unnecessay code (#29851)
4 years ago
Wilber 332da133a1
Support mips arch (#29903)
4 years ago
littletomatodonkey 5c162fe66e
fix reg api ut fail (#29921)
4 years ago
Leo Chen a4b9daf97c
fix optimizer dtype (#29917)
4 years ago
liuyuhui 4427df37cf
[Kunlun] PR2: Support MultiDevicePass and BKCL in parallel executor (#29574)
4 years ago
LielinJiang 0b74428db8
Fix Conv2DTanspose bug when padding='same' (#29915)
4 years ago
LielinJiang 11de384c6d
Split callbacks unittest (#29914)
4 years ago
lilong12 01950ceb42
fix the bug in pipeline data parallelism (#29731)
4 years ago
YUNSHEN XIE 2a01756bf3
remove duplicate ut names (#29809)
4 years ago
Chen Weihang a6072055be
[Complex] Handle complex to real after type promotion (#29855)
4 years ago
Chen Weihang 1a304e6c06
[Complex] Add support for complex grad accumulated (#29889)
4 years ago
guofei 80eb77788f
Skip Windows Multi-GPU test of test_fetch_lod_tensor_array (#29508)
4 years ago
Leo Chen 6b258317cb
fix TransferInplaceBack (#29830)
4 years ago
QingshuChen 59b47f3b32
feat: support check_nan_inf for kunlun/xpu device (#29694)
4 years ago
wawltor 7498df2587
add the cumsum unit test for the develop (#29881)
4 years ago
wanghuancoder 26f9ab70f7
if PR have no .py files, do not use 'python coverage run', to speedup unit test (#29739)
4 years ago
Tao Luo 5d130d5670
Revert "fix conv2d int8 windows UT (#29528)" (#29869)
4 years ago
tangwei12 032414ca2a
[Feature] one ps (3/4) (#29604)
4 years ago
jakpiase edc06c6a1b
Added fc + activation fuse pass (currently only gelu, sigmoid and tanh are supported) (#29772)
4 years ago
Chen Weihang 0e0bb1b97d
replace exit method (#29862)
4 years ago
lidanqing 067d7f1d0d
fix conv2d int8 windows UT (#29528)
4 years ago
liym27 97e75ad0f5
[setitem] Support Tensor setitem in static mode (#29708)
4 years ago
YUNSHEN XIE 24ce051a84
remove duplicate ut reload (#29810)
4 years ago
Thunderbrook 09b6e71928
heter box (#29734)
4 years ago
LielinJiang 1092da82b2
Change the conditions of hapi printing logs (#29792)
4 years ago
ceci3 c4eb5d0378
fix unittest timeout (#29820)
4 years ago
chentianyu03 ddfc3d2c2f
change grad elementwise_mul for complex types (#29757)
4 years ago
chentianyu03 2a260d9b0e
change the grad of div when complex types (#29804)
4 years ago
syyxsxx e219b8ccef
fix api link for the any, all, isfinite
4 years ago
Guo Sheng 356efd36fa
Remove test_rnn_decode_api from disable list. (#29814)
4 years ago
TTerror 82aa01c373
add nearest_interp_v2 on kunlun (#29725)
4 years ago
yukavio 0f97ff0368
fix flops (#29818)
4 years ago
whs 82630408b4
Support double backward rsqrt (#29589)
4 years ago
cc 61820fd217
add the time threshold of quantization tests, test=develop (#29786)
4 years ago
xiaoting 55725cd2e1
fix for timeout, test=develop (#29788)
4 years ago
LielinJiang a94c3cbbf3
register cudnn conv double grad for depthwise conv (#29807)
4 years ago
ShenLiang 01e2874a0e
Support multi-stream communication for dynamic graph distributed (#29525)
4 years ago
huangxu96 a29006d128
Optimizer trans momentum (#29597)
4 years ago
liym27 0cc42e34c6
Migrate 4 APIs about array to paddle.tensor.* (#29565)
4 years ago
yukavio 96934b7430
fix flops (#29758)
4 years ago
liym27 41a7b07159
[Dy2Stat] Fix bug for loop: a variable is used and created in loop, but used before created (#29769)
4 years ago
LielinJiang e5af650b71
Add double grad for conv_transpose (#29706)
4 years ago
huangxu96 97e29411eb
fix a bug in multi_precision_fp16 unittest. (#29756)
4 years ago
Wojciech Uss 6ef8129dcc
upgrade oneDNN with GRU INT8 optimizations (#28420)
4 years ago
Huihuang Zheng dfffee8a5d
[Dy2stat] Enable jit.save to Save Without Running (#29579)
4 years ago
liym27 a0b60716f1
[Dy2Stat] Support grammar: for ele in var[idx] (#29541)
4 years ago