Commit Graph

17987 Commits (ebf689197d61af28110fa6b45e91527c47f68076)

Author SHA1 Message Date
Shang Zhizhou ebf689197d
fix tensorrt output shape error (#29308)
4 years ago
Aurelius84 67c700b479
[Dy2Stat] Add cache for Executor and Context in run_program_op (#28421)
4 years ago
ShenLiang 696dc4bb13
fix the warning of reducer (#29323)
4 years ago
wangchaochaohu c4be80f402
polish the code of cumsum and remove some unused code (#29303)
4 years ago
ShenLiang c00af94435
fix matmulv2 for windows (#29302)
4 years ago
wanghuancoder 3765da98c7
add coverage incremental switch, test=develop (#29290)
4 years ago
Wilber d68af02c04
fix analysis_config bug. (#29304)
4 years ago
ShenLiang 0fb18bc214
enforce the matmul_v2 error message (#29297)
4 years ago
Zhen Wang 9b59a589b1
Remove some useless log. (#29300)
4 years ago
Leo Chen 13a22a3752
fix shape of tile_grad op (#29289)
4 years ago
Zhen Wang be3777a50a
Add pure fp16 training with master weights. (#27712)
4 years ago
Wojciech Uss 6673fb0565
change import math.h to cmath (#29260)
4 years ago
furnace 7584bb5096
Layer norm fp16 (#29169)
4 years ago
Shang Zhizhou c59b4f28a2
fix cmake error when WITH_GPU=ON and WITH_TENSORRT=ON && WITH_MKL=OFF (#29275)
4 years ago
Shang Zhizhou fc80d2e09c
add compile option WITH_TENSORRT (#29208)
4 years ago
Leo Chen 116305ea4b
Improve performance of elementwise_add grad op (#29187)
4 years ago
卖鱼的哲学 07c67d5a8b
add deformable_conv op on xpu (#29234)
4 years ago
Chen Weihang 1de32f823d
Hot fix complle failed in gcc4.8 caused by complex impl (#29254)
4 years ago
GeminiCarrie 642abe2a48
Fix a bug when running on an operating system without "bash." (#29131)
4 years ago
ShenLiang 46b73e6cd9
Change the api of DataParallel and Fleet (#29224)
4 years ago
QingshuChen 64f29fbb70
update kunlun conv2d/softmax/elementwise implemetation (#29229)
4 years ago
chentianyu03 8f45d14263
add complex64 and complex128 type; add +-*/@ and slice opreator for c… (#29199)
4 years ago
Zhou Wei c0a991c874
accumulate gradient for leaf tensor with previous graph and expose leaf tensor concept (#28429)
4 years ago
Wilber 74c43ac638
fix lite unit test. (#29233)
4 years ago
Adam Osewski 4096ff94dc
Small optimizations for conv2d kernel subroutines. (#29188)
4 years ago
joanna.wozna.intel 5c61eeef61
Enable all image classification models (#29155)
4 years ago
Wilber 4fec182d24
[Lite-Subgraph] Fix compile error for lite subgraph. (#29146)
4 years ago
123malin b5c6342336
Update ps gpu (#29209)
4 years ago
liym27 865a45984f
Check whether there is any inplace operation affecting gradient calculation. (#27901)
4 years ago
chen zhiyu 4056c4f11c
Add unittest in musl build (#29099)
4 years ago
123malin 03d4665f44
prefetch optimize (#29095)
4 years ago
WangXi 0c2a51d240
optimizer amp, all use fp16 communication, overlap last comm and compute (#28957)
4 years ago
Chen Weihang 0b032faeee
Polish unittests details and execution conditions to adapt to MUSL (#29044)
4 years ago
123malin 92817f8005
test=develop, rm pathlib (#28658)
4 years ago
Wojciech Uss 4fd4095d1b
Add quantization of multi_gru op and tests (#28615)
4 years ago
Jack Zhou bc6033f86b
fix gru gcc7.4 bug for the gru compile
4 years ago
wanghuancoder 0239f79695
Generate code coverage reports only for incremental files (#28508)
4 years ago
wangchaochaohu b818429ae7
optimize cumsum OP (#29193)
4 years ago
ShenLiang e2d01eb650
Support dynamic graph distributed (#28997)
4 years ago
lilong12 7e5e9934fe
update expand as op to use the shape of the target tensor instead of the target tensor itself. (#29020)
4 years ago
pangyoki 7c8ac064c8
Delete prettytable in condabuild (#29145)
4 years ago
Zhou Wei e668cb07fb
fix CUDA 11 error on windows (#29101)
4 years ago
Jack Zhou 085260f3de
Add eigen gru and fix the dropout bug in the rnn
4 years ago
yaoxuefeng 545df287fc
add user_define_dump (#28596)
4 years ago
Aurelius84 71815637cc
Move gym into unittest/requirements.txt (#29149)
4 years ago
arlesniak bc902044a4
Fixes mkldnn dygraph learning rate scheduler crashes (#28988)
4 years ago
Shang Zhizhou b9e76a0103
detect tensorRT plugin fp16 in runtime (#27933)
4 years ago
Leo Chen fd3fcb051a
fix typo of flag name (#29154)
4 years ago
Noel da71173bc9
Fix ops doc for some ops
4 years ago
Leo Chen 770395cb93
Split train_mode and has_grad for tracer (#29064)
4 years ago