Commit Graph

12390 Commits (5bf25d1e8b6eef2eea8aa24f5dbacea0b832aae2)

Author SHA1 Message Date
Leo Chen a4b9daf97c
fix optimizer dtype (#29917)
4 years ago
liuyuhui 4427df37cf
[Kunlun] PR2: Support MultiDevicePass and BKCL in parallel executor (#29574)
4 years ago
LielinJiang 0b74428db8
Fix Conv2DTanspose bug when padding='same' (#29915)
4 years ago
LielinJiang 11de384c6d
Split callbacks unittest (#29914)
4 years ago
lilong12 01950ceb42
fix the bug in pipeline data parallelism (#29731)
4 years ago
YUNSHEN XIE 2a01756bf3
remove duplicate ut names (#29809)
4 years ago
Chen Weihang a6072055be
[Complex] Handle complex to real after type promotion (#29855)
4 years ago
Chen Weihang 1a304e6c06
[Complex] Add support for complex grad accumulated (#29889)
4 years ago
guofei 80eb77788f
Skip Windows Multi-GPU test of test_fetch_lod_tensor_array (#29508)
4 years ago
Leo Chen 6b258317cb
fix TransferInplaceBack (#29830)
4 years ago
QingshuChen 59b47f3b32
feat: support check_nan_inf for kunlun/xpu device (#29694)
4 years ago
wawltor 7498df2587
add the cumsum unit test for the develop (#29881)
4 years ago
wanghuancoder 26f9ab70f7
if PR have no .py files, do not use 'python coverage run', to speedup unit test (#29739)
4 years ago
Tao Luo 5d130d5670
Revert "fix conv2d int8 windows UT (#29528)" (#29869)
4 years ago
tangwei12 032414ca2a
[Feature] one ps (3/4) (#29604)
4 years ago
jakpiase edc06c6a1b
Added fc + activation fuse pass (currently only gelu, sigmoid and tanh are supported) (#29772)
4 years ago
Chen Weihang 0e0bb1b97d
replace exit method (#29862)
4 years ago
lidanqing 067d7f1d0d
fix conv2d int8 windows UT (#29528)
4 years ago
liym27 97e75ad0f5
[setitem] Support Tensor setitem in static mode (#29708)
4 years ago
YUNSHEN XIE 24ce051a84
remove duplicate ut reload (#29810)
4 years ago
Thunderbrook 09b6e71928
heter box (#29734)
4 years ago
LielinJiang 1092da82b2
Change the conditions of hapi printing logs (#29792)
4 years ago
ceci3 c4eb5d0378
fix unittest timeout (#29820)
4 years ago
chentianyu03 ddfc3d2c2f
change grad elementwise_mul for complex types (#29757)
4 years ago
chentianyu03 2a260d9b0e
change the grad of div when complex types (#29804)
4 years ago
syyxsxx e219b8ccef
fix api link for the any, all, isfinite
4 years ago
Guo Sheng 356efd36fa
Remove test_rnn_decode_api from disable list. (#29814)
4 years ago
TTerror 82aa01c373
add nearest_interp_v2 on kunlun (#29725)
4 years ago
yukavio 0f97ff0368
fix flops (#29818)
4 years ago
whs 82630408b4
Support double backward rsqrt (#29589)
4 years ago
cc 61820fd217
add the time threshold of quantization tests, test=develop (#29786)
4 years ago
xiaoting 55725cd2e1
fix for timeout, test=develop (#29788)
4 years ago
LielinJiang a94c3cbbf3
register cudnn conv double grad for depthwise conv (#29807)
4 years ago
ShenLiang 01e2874a0e
Support multi-stream communication for dynamic graph distributed (#29525)
4 years ago
huangxu96 a29006d128
Optimizer trans momentum (#29597)
4 years ago
liym27 0cc42e34c6
Migrate 4 APIs about array to paddle.tensor.* (#29565)
4 years ago
yukavio 96934b7430
fix flops (#29758)
4 years ago
liym27 41a7b07159
[Dy2Stat] Fix bug for loop: a variable is used and created in loop, but used before created (#29769)
4 years ago
LielinJiang e5af650b71
Add double grad for conv_transpose (#29706)
4 years ago
huangxu96 97e29411eb
fix a bug in multi_precision_fp16 unittest. (#29756)
4 years ago
Wojciech Uss 6ef8129dcc
upgrade oneDNN with GRU INT8 optimizations (#28420)
4 years ago
Huihuang Zheng dfffee8a5d
[Dy2stat] Enable jit.save to Save Without Running (#29579)
4 years ago
liym27 a0b60716f1
[Dy2Stat] Support grammar: for ele in var[idx] (#29541)
4 years ago
chentianyu03 b59b6d7ae6
Complex op test (#29753)
4 years ago
liym27 096c048b45
Fix unitest test_slice (#29740)
4 years ago
Huihuang Zheng 2e788bd81e
Reduce batch size ot fix CPU memory, test=develop (#29736)
4 years ago
LielinJiang 10edfb6f21
Update en docs of to_tensor (#29718)
4 years ago
chentianyu03 71063b8137
add conj op for complex types (#29527)
4 years ago
WangXi 9cbcc6cadc
fleet sync build strategy, test=develop (#29732)
4 years ago
Chen Weihang 6cfa59de1b
[Complex] Add real & imag op and api for complex tensor (#29672)
4 years ago
LiuChiachi 572810eecb
Update EarlyStopping sample code (#29723)
4 years ago
TTerror af8ded773a
update activation op on kunlun (#29577)
4 years ago
ceci3 cc387159f3
add pad and concat double grad (#29549)
4 years ago
liuyuhui f13c3a9cd7
[Kunlun] PR1:Support one Kunlun card training in parallel executor (#29337)
4 years ago
huangxu96 b96dada4f0
add static.amp into setup.pu.in (#29621)
4 years ago
YUNSHEN XIE d0b789d27f
disable ut test_cumsum_op (#29613)
4 years ago
Jack Zhou 84bae27779
fix wmt14 doc, remove backward, add bidirect direction in rnn api (#29633)
4 years ago
YUNSHEN XIE 2926e74326
New UT should not exceed 15s (#29492)
4 years ago
Chen Weihang f02aece1f0
Add complex dtype op (add) test example (#29603)
4 years ago
AshburnLee efea540ca9
Add tf32 support for A100 tensor core acceleration for cuBLAS (#28732)
4 years ago
lijianshe02 7779768b53
add transpose double grad test=develop (#29600)
4 years ago
huangxu96 c05170d3d8
add alias for fluid.contrib.mixed_precision (#29562)
4 years ago
ShenLiang fb6697b424
Fix the dowanload bug in the case of multiple machines (#29551)
4 years ago
ShenLiang 1efef8baed
Fix bug of matmul_v2 for broadcast case (#29599)
4 years ago
qingqing01 8d549fc85d
Add clip double grad (#29590)
4 years ago
Tao Luo 81acc3278c
disable test_parallel_executor_profiler in cuda 10.1 (#29581)
4 years ago
wangchaochaohu ac4bae8ee9
elementwise_add_grad Op optimization (#29575)
4 years ago
huangxu96 2cb6f94888
add float16 into adaptive_avg_pool2d check list. (#29547)
4 years ago
yukavio ee1a7d020c
add some feature for paddle.flops (#29572)
4 years ago
WangXi 467c716963
gen nccl id use socket (#29431)
4 years ago
Bai Yifan d72604cd46
fix unittst unstable issue on ci machine (#29588)
4 years ago
QingshuChen 79a41a9ed6
support roi_align & affine_channel for kunlun (#29561)
4 years ago
liym27 0cad1152f4
[Dy2Stat] 1. Fix bug of for-range stmts. 2. Support that step value is negative in for-range stmts (#29519)
4 years ago
Huihuang Zheng 831e9135b9
Fix Windows Unittest (#29543)
4 years ago
GeminiCarrie 08f24a3108
Fix precision problem (#29567)
4 years ago
JZ-LIANG d33d468f02
[Sharding] add hybrid-dp feature (#29518)
4 years ago
Chen Weihang c1a26e2a05
fix train eval set error in static mode (#29540)
4 years ago
taixiurong 760d015c14
add xpu ops for training transformer in kunlun (#29539)
4 years ago
Leo Chen 0fdd365665
Add fast path for dropout when p == 0 (#29553)
4 years ago
Wojciech Uss 917a11495f
fix ininite scale values (#29386)
4 years ago
lijianshe02 bd29052e33
fix random seed in nll_loss unitest test=develop (#29538)
4 years ago
joanna.wozna.intel 0ce6d7fa77
Fix bf16 activations test for softmax and gelu (#29502)
4 years ago
huangxu96 4001979309
Add ReserveSpace in dygraph batch_norm. (#29221)
4 years ago
arlesniak b781953ef5
[oneDNN] Fix flags use test for #29080, assert condition more general (#29493)
4 years ago
Zhen Wang 5ac71b36fb
Remove tensor copy in the update_loss_scaling op. (#29426)
4 years ago
Zhou Wei e74e1a226c
support deepcopy for Layer/Tensor/Paramerbase (#29387)
4 years ago
joejiong 50d3117d30
Add random_split and Subset dataset (#29291)
4 years ago
joejiong 87e75a77c2
Add tangent operator (#29207)
4 years ago
Wei Shengyu dc8bb76c68
remove addcmul (#28937)
4 years ago
Zhong Hui f459dd9634
fix abs double grad unittest (#29478)
4 years ago
huangxu96 576d0d938b
add fp16 check into max and avg pool (#29479)
4 years ago
ShenLiang 2ef9e0e23c
Rebuild group automatically in dynamic graph distributed (#29255)
4 years ago
procr 3a0558339d
support mobilenet for kunlun (#29458)
4 years ago
Aurelius84 5d530c9319
fix amp support fleet (#29491)
4 years ago
ShenLiang 311b3b44fc
Fix the bug where embedding can‘t be processed correctly in reducer (#29485)
4 years ago
Pei Yang 2480bdef6c
change hard_swish from plugin to layer (#29177)
4 years ago
lilong12 b122d0bb76
Fix bug in gloo that gloo initialization hangs (#29447)
4 years ago
taixiurong ecca6585cd
1. fix elementwise ops'bug 2. fix softmax_with_cross_entropy_op 3. add biliner_interp_op (#29448)
4 years ago
LoveAn 03b42d9fa7
fix unittest on windows, test=develop (#29365)
4 years ago
ShenLiang 22e6b9e373
Fix the ut of matmulv2 for broadcast case (#29461)
4 years ago
TTerror a5fcc4b545
update reduce_sum op on xpu (#29367)
4 years ago
chentianyu03 acce962133
remove complex module direction (#29419)
4 years ago
Zhang Ting 6296f4ed09
revert cast eigen kernel (#29427)
4 years ago
Leo Chen a040c055a5
fix layer_norm accuracy (#29434)
4 years ago
Shang Zhizhou 225a9c4ed8
Fix unittest (#29412)
4 years ago
Pei Yang f860de4af7
support clip op trt converter (#29411)
4 years ago
Bai Yifan 87bb726258
Add deform_conv2d,DeformConv2D (#29364)
4 years ago
chentianyu03 64e4e17f0c
remove complexvariable (#29390)
4 years ago
chajchaj 79e6086743
change shape of output in cross_entropy, test=develop (#29220)
4 years ago
liuyuhui 2ee7a6b08c
[paddle v2.0.0rc1: API fixs] assign/conv2d/conv2d_transpose/cast/ParamAttr (#29171)
4 years ago
Guo Sheng 8fc7f1b66a
Fix api docs in RNN, Transformer, layer_norm, WeightNormParamAttr (#29235)
4 years ago
Chen Long c940f842ca
remove rarfile from requirements (#29319)
4 years ago
yongqiangma 7c508d8668
update unbind norm add CUDAPlace api doc information (#29322)
4 years ago
chentianyu03 879e913b6d
Make transpose, trace, kron, reshape, sum op support complex type (#29321)
4 years ago
Chen Long 66fd1c00a0
fix some docs test=develop;test=document_fix (#29374)
4 years ago
liym27 5f84d0b375
Fix bug: delete wrong check_type of paddle.concat and support LoDTensorArray (#29306)
4 years ago
Feiyu Chan f7cdcefa65
fix multiple documentation errors, test=document_fix (#29210)
4 years ago
卖鱼的哲学 074065e5de
fix expand/uniform_random && concat/transpose to new api on xpu (#29280)
4 years ago
ShenLiang 4064354a01
support dp run single card (#29358)
4 years ago
gongweibao 8989053443
Fix bug of test_fleet_launch_async.sh (#29332)
4 years ago
Huihuang Zheng 8f7627907c
[Dy2stat] Reduce Exception Type for Better Error Message (#29268)
4 years ago
liym27 61a8f2874f
[Dy2Stat] Fix bug: Do not use gast.Subscript to replace gast.Name in when transforming for_enumerate_loop (#29310)
4 years ago
liym27 b10ecd9d3a
[inplace] Add ShareHolderWith for class Variable and SharePlaceholderWith in VarBase.detach() to share the same Tensor/SelectedRows (#29267)
4 years ago
Chen Weihang 9ad800ebb2
Support type promote for basic math ops (quantum required) (#29265)
4 years ago
LielinJiang f31e5adab5
fix typo in ProgBarLogger (#29329)
4 years ago
tangwei12 8358791607
fix gpu outofrange (#29238)
4 years ago
YUNSHEN XIE 28164b266f
disable test_rnn_decode_api and test_complex_matmul on windows (#29252)
4 years ago
Leo Chen b58cfff89d
use has_grad instead of train_mode (#29309)
4 years ago
Aurelius84 67c700b479
[Dy2Stat] Add cache for Executor and Context in run_program_op (#28421)
4 years ago
ShenLiang d6753e1e6d
fix matmulv2 for windows (#29327)
4 years ago
gongweibao 96de8b008f
cleanup enum test=develop (#29294)
4 years ago
liym27 b9a8ebd50f
[Dy2stat] Add a decorator paddle.jit.not_to_static to support that not to convert a function in Dynamic-to-Static. (#29253)
4 years ago
ShenLiang 2d6aa1a5bb
fix warning of fleet (#29317)
4 years ago
ShenLiang 2cd0bf5764
Fix doc of fleet api (#29282)
4 years ago
ShenLiang c00af94435
fix matmulv2 for windows (#29302)
4 years ago
Steffy-zxf 41f17aeb8b
fix DATA_HOME path in win (#29222)
4 years ago
Jack Zhou cf43322139
fix nll_loss doc;test=document_fix; (#29247)
4 years ago
LielinJiang b9f1f4343b
Move temporal_shift to paddle.nn.functional (#29261)
4 years ago
Chen Weihang a2e9d95a4a
change test_imperative_signal_handler_to_exclusive (#29283)
4 years ago
Zhen Wang be3777a50a
Add pure fp16 training with master weights. (#27712)
4 years ago
chentianyu03 976961de6d
fix random failed of complex matmul (#29285)
4 years ago
furnace 7584bb5096
Layer norm fp16 (#29169)
4 years ago
mls1999725 a37963b890
Update APIs in text/datasets and dataloader (#29219)
4 years ago
mls1999725 493568b070
Update Codes of Cifar and VOC2012 (#29204)
4 years ago
mls1999725 0aedd463ee
Update get_worker_info API (#29190)
4 years ago
mls1999725 6a9a62c3ef
Update conv3d API (#29205)
4 years ago
Huihuang Zheng aec05d811c
[Dy2stat] Fix PaddleGan Deoldify Model Dy2stat Problems (#29226)
4 years ago
Leo Chen 116305ea4b
Improve performance of elementwise_add grad op (#29187)
4 years ago
卖鱼的哲学 07c67d5a8b
add deformable_conv op on xpu (#29234)
4 years ago
Chen Weihang 1de32f823d
Hot fix complle failed in gcc4.8 caused by complex impl (#29254)
4 years ago
yukavio a71ea00922
add unit test (#29228)
4 years ago
ShenLiang 46b73e6cd9
Change the api of DataParallel and Fleet (#29224)
4 years ago
Leo Chen 73e51a17e7
add stop_gradient property and remove reduce redundant information (#29185)
4 years ago
QingshuChen 64f29fbb70
update kunlun conv2d/softmax/elementwise implemetation (#29229)
4 years ago
Jiawei Wang b11ab12787
Fix doc (adadelta, sgd, momentum) (#29212)
4 years ago
lijianshe02 76312deb30
fix nll_loss test random fail bug test=develop (#29236)
4 years ago
LielinJiang 8a2dd34a1e
fix depthwise conv (#29227)
4 years ago
huangxu96 dbdeecd665
Modify doc mistakes of grad API. (#29176)
4 years ago
Jiawei Wang a5d13d593c
Momentum Velocity init in Momentum.__init__() (#29223)
4 years ago
Leo Chen 4556ad76b4
Upgrade string literals to raw string [part 2](#29217)
4 years ago
wanghuancoder 2b2cd1864a
revert python file coverage, delete coverage run --include, test=develop (#29230)
4 years ago
chentianyu03 8f45d14263
add complex64 and complex128 type; add +-*/@ and slice opreator for c… (#29199)
4 years ago
123malin cc9c619679
test=develop, fix doc (#29200)
4 years ago
Zhou Wei c0a991c874
accumulate gradient for leaf tensor with previous graph and expose leaf tensor concept (#28429)
4 years ago
huangjun12 b6a26749dc
fix doc of alpha_dropout/dropout/dropout2d/dropout3d/npair_loss (#29136)
4 years ago
LielinJiang d8eef4e4a4
Remove dependence of scipy (#29121)
4 years ago
yaoxuefeng a069e1ca91
fix docs (#29097)
4 years ago
Chen Weihang 786e69e9c7
diable test_yolov3 in musl (#29216)
4 years ago
hong19860320 f23665e5d5
Refine the doc and unit test for Sigmoid and stanh (#29198)
4 years ago
123malin b5c6342336
Update ps gpu (#29209)
4 years ago
liym27 865a45984f
Check whether there is any inplace operation affecting gradient calculation. (#27901)
4 years ago
lilong12 08fb079dbc
Fix the doc for shard_index api (#29183)
4 years ago
qingqing01 058f1b2284
Enhance paddle.metric.Accuracy (#29125)
4 years ago
joejiong dc070ecfb0
Remove cast from paddle.pow api (#29134)
4 years ago
WangXi 0c2a51d240
optimizer amp, all use fp16 communication, overlap last comm and compute (#28957)
4 years ago
Chen Weihang 0b032faeee
Polish unittests details and execution conditions to adapt to MUSL (#29044)
4 years ago
123malin 92817f8005
test=develop, rm pathlib (#28658)
4 years ago
Wojciech Uss 4fd4095d1b
Add quantization of multi_gru op and tests (#28615)
4 years ago
Thunderbrook 4adddcc89a
add set_trainer_num api in dataset (#29133)
4 years ago
liym27 e03440812a
fix code: if y is True -> if y (#29184)
4 years ago
danleifeng 7e7b4b9e5d
remove sampled_softmax_with_cross_entropy alias;test=develop (#29180)
4 years ago
WeiXin 1476e1f998
save model after jit.load (#28748)
4 years ago
wanghuancoder 0239f79695
Generate code coverage reports only for incremental files (#28508)
4 years ago
zhang wenhui 8388abe66b
Fix api 1128 (#29174)
4 years ago
LielinJiang f92fdfb8ef
Add ReduceLROnPlateau (#29113)
4 years ago
Huihuang Zheng 27b4218333
[Dy2stat] Disable PaddleInference IR Optimization in test_mnist for CUDA11 (#29105)
4 years ago
liym27 01bdea7c31
[Dy2Stat] Don't conver the function from third library logging (#29161)
4 years ago
liym27 a7433cc379
[Dy2Stat] Fix bug: the return statement should be transformed to an equivalent Paddle/Python if statement, which depends on if conditions of the return stmt. (#29165)
4 years ago
Huihuang Zheng 4a0a870177
[dy2stat] Set shape for linspace to Fix dy2stat for GridGenerator Model (#29173)
4 years ago
Aurelius84 cb680c8013
[Dy2Stat]Refine code of test_lac unittest (#29087)
4 years ago
ShenLiang e2d01eb650
Support dynamic graph distributed (#28997)
4 years ago
lilong12 7e5e9934fe
update expand as op to use the shape of the target tensor instead of the target tensor itself. (#29020)
4 years ago
Kaipeng Deng f4c894a693
alias yolo_loss & yolo_box to paddle.vision. (#28520)
4 years ago
Shibo Tao 4ceedec33d
enhance doc. add kwargs for backward compatibility. test=develop (#29143)
4 years ago
LutaoChu 28280647eb
add paddle.subtract, optimize paddle.maximum and paddle.minimum
4 years ago
徐铭远 3c2a46bd7b
fix doc of erf,rank,mm,cross_entropy,pixel_shuffle,kron... (#29126)
4 years ago
Chen Long d576d6ddeb
fix some docs test=develop;test=document_fix (#29159)
4 years ago
yukavio 5da3d514eb
solve pretty table dependent in flops api (#29132)
4 years ago
pangyoki 6df685ab64
fix nce, multinomial, Categorical, Normal, Uniform en doc (#28541)
4 years ago
LielinJiang 9f53f3d09e
Enhance logger callback for benchmark (#29106)
4 years ago