Commit Graph

12204 Commits (1cbb282d7774539a809d32f45bb9b443f56485a7)

Author SHA1 Message Date
yukavio 96934b7430
fix flops (#29758)
4 years ago
liym27 41a7b07159
[Dy2Stat] Fix bug for loop: a variable is used and created in loop, but used before created (#29769)
4 years ago
LielinJiang e5af650b71
Add double grad for conv_transpose (#29706)
4 years ago
huangxu96 97e29411eb
fix a bug in multi_precision_fp16 unittest. (#29756)
4 years ago
Wojciech Uss 6ef8129dcc
upgrade oneDNN with GRU INT8 optimizations (#28420)
4 years ago
Huihuang Zheng dfffee8a5d
[Dy2stat] Enable jit.save to Save Without Running (#29579)
4 years ago
liym27 a0b60716f1
[Dy2Stat] Support grammar: for ele in var[idx] (#29541)
4 years ago
chentianyu03 b59b6d7ae6
Complex op test (#29753)
4 years ago
liym27 096c048b45
Fix unitest test_slice (#29740)
4 years ago
Huihuang Zheng 2e788bd81e
Reduce batch size ot fix CPU memory, test=develop (#29736)
4 years ago
LielinJiang 10edfb6f21
Update en docs of to_tensor (#29718)
4 years ago
chentianyu03 71063b8137
add conj op for complex types (#29527)
4 years ago
WangXi 9cbcc6cadc
fleet sync build strategy, test=develop (#29732)
4 years ago
Chen Weihang 6cfa59de1b
[Complex] Add real & imag op and api for complex tensor (#29672)
4 years ago
LiuChiachi 572810eecb
Update EarlyStopping sample code (#29723)
4 years ago
TTerror af8ded773a
update activation op on kunlun (#29577)
4 years ago
ceci3 cc387159f3
add pad and concat double grad (#29549)
4 years ago
liuyuhui f13c3a9cd7
[Kunlun] PR1:Support one Kunlun card training in parallel executor (#29337)
4 years ago
huangxu96 b96dada4f0
add static.amp into setup.pu.in (#29621)
4 years ago
YUNSHEN XIE d0b789d27f
disable ut test_cumsum_op (#29613)
4 years ago
Jack Zhou 84bae27779
fix wmt14 doc, remove backward, add bidirect direction in rnn api (#29633)
4 years ago
YUNSHEN XIE 2926e74326
New UT should not exceed 15s (#29492)
4 years ago
Chen Weihang f02aece1f0
Add complex dtype op (add) test example (#29603)
4 years ago
AshburnLee efea540ca9
Add tf32 support for A100 tensor core acceleration for cuBLAS (#28732)
4 years ago
lijianshe02 7779768b53
add transpose double grad test=develop (#29600)
4 years ago
huangxu96 c05170d3d8
add alias for fluid.contrib.mixed_precision (#29562)
4 years ago
ShenLiang fb6697b424
Fix the dowanload bug in the case of multiple machines (#29551)
4 years ago
ShenLiang 1efef8baed
Fix bug of matmul_v2 for broadcast case (#29599)
4 years ago
qingqing01 8d549fc85d
Add clip double grad (#29590)
4 years ago
Tao Luo 81acc3278c
disable test_parallel_executor_profiler in cuda 10.1 (#29581)
4 years ago
wangchaochaohu ac4bae8ee9
elementwise_add_grad Op optimization (#29575)
4 years ago
huangxu96 2cb6f94888
add float16 into adaptive_avg_pool2d check list. (#29547)
4 years ago
yukavio ee1a7d020c
add some feature for paddle.flops (#29572)
4 years ago
WangXi 467c716963
gen nccl id use socket (#29431)
4 years ago
Bai Yifan d72604cd46
fix unittst unstable issue on ci machine (#29588)
4 years ago
QingshuChen 79a41a9ed6
support roi_align & affine_channel for kunlun (#29561)
4 years ago
liym27 0cad1152f4
[Dy2Stat] 1. Fix bug of for-range stmts. 2. Support that step value is negative in for-range stmts (#29519)
4 years ago
Huihuang Zheng 831e9135b9
Fix Windows Unittest (#29543)
4 years ago
GeminiCarrie 08f24a3108
Fix precision problem (#29567)
4 years ago
JZ-LIANG d33d468f02
[Sharding] add hybrid-dp feature (#29518)
4 years ago
Chen Weihang c1a26e2a05
fix train eval set error in static mode (#29540)
4 years ago
taixiurong 760d015c14
add xpu ops for training transformer in kunlun (#29539)
4 years ago
Leo Chen 0fdd365665
Add fast path for dropout when p == 0 (#29553)
4 years ago
Wojciech Uss 917a11495f
fix ininite scale values (#29386)
4 years ago
lijianshe02 bd29052e33
fix random seed in nll_loss unitest test=develop (#29538)
4 years ago
joanna.wozna.intel 0ce6d7fa77
Fix bf16 activations test for softmax and gelu (#29502)
4 years ago
huangxu96 4001979309
Add ReserveSpace in dygraph batch_norm. (#29221)
4 years ago
arlesniak b781953ef5
[oneDNN] Fix flags use test for #29080, assert condition more general (#29493)
4 years ago
Zhen Wang 5ac71b36fb
Remove tensor copy in the update_loss_scaling op. (#29426)
4 years ago
Zhou Wei e74e1a226c
support deepcopy for Layer/Tensor/Paramerbase (#29387)
4 years ago