Commit Graph

17967 Commits (73e51a17e744bd2a04bfc1fe948e928870454e20)

Author SHA1 Message Date
QingshuChen 64f29fbb70
update kunlun conv2d/softmax/elementwise implemetation (#29229)
5 years ago
chentianyu03 8f45d14263
add complex64 and complex128 type; add +-*/@ and slice opreator for c… (#29199)
5 years ago
Zhou Wei c0a991c874
accumulate gradient for leaf tensor with previous graph and expose leaf tensor concept (#28429)
5 years ago
Wilber 74c43ac638
fix lite unit test. (#29233)
5 years ago
Adam Osewski 4096ff94dc
Small optimizations for conv2d kernel subroutines. (#29188)
5 years ago
joanna.wozna.intel 5c61eeef61
Enable all image classification models (#29155)
5 years ago
Wilber 4fec182d24
[Lite-Subgraph] Fix compile error for lite subgraph. (#29146)
5 years ago
123malin b5c6342336
Update ps gpu (#29209)
5 years ago
liym27 865a45984f
Check whether there is any inplace operation affecting gradient calculation. (#27901)
5 years ago
chen zhiyu 4056c4f11c
Add unittest in musl build (#29099)
5 years ago
123malin 03d4665f44
prefetch optimize (#29095)
5 years ago
WangXi 0c2a51d240
optimizer amp, all use fp16 communication, overlap last comm and compute (#28957)
5 years ago
Chen Weihang 0b032faeee
Polish unittests details and execution conditions to adapt to MUSL (#29044)
5 years ago
123malin 92817f8005
test=develop, rm pathlib (#28658)
5 years ago
Wojciech Uss 4fd4095d1b
Add quantization of multi_gru op and tests (#28615)
5 years ago
Jack Zhou bc6033f86b
fix gru gcc7.4 bug for the gru compile
5 years ago
wanghuancoder 0239f79695
Generate code coverage reports only for incremental files (#28508)
5 years ago
wangchaochaohu b818429ae7
optimize cumsum OP (#29193)
5 years ago
ShenLiang e2d01eb650
Support dynamic graph distributed (#28997)
5 years ago
lilong12 7e5e9934fe
update expand as op to use the shape of the target tensor instead of the target tensor itself. (#29020)
5 years ago
pangyoki 7c8ac064c8
Delete prettytable in condabuild (#29145)
5 years ago
Zhou Wei e668cb07fb
fix CUDA 11 error on windows (#29101)
5 years ago
Jack Zhou 085260f3de
Add eigen gru and fix the dropout bug in the rnn
5 years ago
yaoxuefeng 545df287fc
add user_define_dump (#28596)
5 years ago
Aurelius84 71815637cc
Move gym into unittest/requirements.txt (#29149)
5 years ago
arlesniak bc902044a4
Fixes mkldnn dygraph learning rate scheduler crashes (#28988)
5 years ago
Shang Zhizhou b9e76a0103
detect tensorRT plugin fp16 in runtime (#27933)
5 years ago
Leo Chen fd3fcb051a
fix typo of flag name (#29154)
5 years ago
Noel da71173bc9
Fix ops doc for some ops
5 years ago
Leo Chen 770395cb93
Split train_mode and has_grad for tracer (#29064)
5 years ago
Aurelius84 7ae3cb554a
Polish CUDA Information stdout (#29109)
5 years ago
chalsliu 7a15e64034
Support precision test for new ut
5 years ago
WangXi 173c22aec2
optimize fast graph executor (#28962)
5 years ago
Shang Zhizhou 562ded1041
fix unittest trt_dynamic_shape_transformer_prune_test error (#29122)
5 years ago
Shibo Tao db41258501
add API serialize_program, serialize_persistables, save_to_file, deserialize_program, deserialize_persistables, load_from_file. (#29034)
5 years ago
joanna.wozna.intel b0d1ac161e
Add bf16 pool2d and unify bf16 unit tests (#29039)
5 years ago
joanna.wozna.intel fddea67445
Fix cpu_bfloat16_pass (#28730)
5 years ago
Qi Li 2fd16cf6fc
fix win ci failure, test=develop (#29089)
5 years ago
Chen Weihang fea0e294ee
Hide the C++ stack by default and add hints (#29042)
5 years ago
Chen Weihang b1274ac3d6
set show cpp stack by default, test=document_fix (#29102)
5 years ago
joejiong 582c0a0468
add uint8 for reshape op (#28996)
5 years ago
Zhou Wei 8ca0a8a859
fix tensor detach to zero copy (#27921)
5 years ago
Aurelius84 8af0d85ea4
fix unittest failed on windows GPU (#29072)
5 years ago
taixiurong a5aa4dc7a9
add xpu elementwise ops (#29031)
5 years ago
joejiong b04c78ef5e
Update pow (#29000)
5 years ago
wawltor b2c8a00745
remove eigen threadpool for the speed up
5 years ago
Wojciech Uss 7b5a8e46de
Add multi_gru_fuse_pass and tests (#28601)
5 years ago
LoveAn c91bb084f4
Add op benchmark ci pipeline in Paddle repo (#28692)
5 years ago
Zhou Wei 5e26a15484
Open GPU unitest on windows (#29003)
5 years ago
Leo Chen 3815d7aa40
Upgrade string literals to raw string (#28989)
5 years ago