Commit Graph

17565 Commits (fd7ab4e63c5960c0ba6a9c0ce0d00478cbc78c7f)

Author SHA1 Message Date
Zhou Wei f6be5989fd
Reduce the parallel compile count (#27187)
4 years ago
lilong12 c5f957ae38
add double grad for tile op and expand_v2 op (#27114)
4 years ago
lilong12 58a88ba9af
add double grad for expand (#27183)
4 years ago
Qi Li 7c7fbd3218
fix error msg of fused_embedding_fc_lstm_op, test=develop (#27231)
4 years ago
Qi Li 78446ecdba
[UT] fix run type of ut test cases of test_train_recognize_digits and test_api_impl, test=develop (#27218)
4 years ago
Jacek Czaja e005861598
[oneDNN]Introducing oneDNN 1.6 (#27137)
4 years ago
ShenLiang 5bd84b22c4
revert divide (#27202)
4 years ago
wawltor fde5cfe881
fix the CudaPinMemory bug for the equal op (#27176)
4 years ago
zhupengyang cc3306f7c8
restruct logsumexp to speed up compiling (#27191)
4 years ago
Steffy-zxf 50e60e8779
update error info for selected_rows_functor
4 years ago
Wilber edd962b1d0
Add 2.0 inference api doc. (#27125)
4 years ago
JZ-LIANG 5d039f4086
modified the implement of Lars optimizer (#26733)
4 years ago
wangchaochaohu c71d79b1d2
[cuda11 support] change the CMakeLists to support the cuda11 (#27124)
4 years ago
Qinghe JING 43b0445b29
Add double grad in reduce sum (#27115)
4 years ago
kinghuin ed292695c5
optimize the error message for math dir
4 years ago
yongqiangma 4558d395e9
fix Norm op error (#26771)
4 years ago
LielinJiang 4d7d661249
Fix kl and summary bug (#27132)
4 years ago
WeiXin 13804ed80c
Error msg/polish tensor error msg (#26976)
4 years ago
whs eb01976037
[2.0 API]Add checker in grid_sample_grad op (#27126)
4 years ago
wangguanzhong a28ae86e11
Enhance ops to support LoD as input for dygraph detection models. (#25316)
4 years ago
Zhou Wei 753a0748ee
Temporarily turn off WITH_INFERENCE_API_TEST (#27170)
4 years ago
YUNSHEN XIE d4710163eb
add timeout unittests retry (#27152)
4 years ago
LielinJiang 8df5b4d608
Add correlation api to contrib (#27015)
4 years ago
LoveAn cbcd5e407a
Fix problem that target name already exists when there isn't model data cache, test=develop (#27142)
4 years ago
kinghuin 1b102dd552
optimize the error message for unpooling.cc
4 years ago
Pei Yang 5fb8c92054
fix multihead matmul shared params (#27121)
4 years ago
xiaoting 58f3ef982a
fix typo for interp_v2,test=develop (#26843)
4 years ago
LoveAn ed2f57cc42
Restore file changes caused by pre-commit (#27105)
4 years ago
YUNSHEN XIE 9fd5eae81d
add failed unittests retry on mac system (#26813)
4 years ago
YUNSHEN XIE 92bf0d47e3
add failed unittests retry on win system (#26823)
4 years ago
wangchaochaohu 5af81f833c
fix gpu kernel for numel Op (#27085)
4 years ago
Wilber 632125415c
Refine python inference api (#26958)
4 years ago
YUNSHEN XIE b150f2b3a6
disable test_trt_dynamic_shape_ernie_ser_deser,test=document_fix (#27059)
4 years ago
zhupengyang 19ca6d9dd2
add .part to speed up compile (#27044)
4 years ago
LoveAn fab8bbf25b
Modify data download function and support unittests of inference APIs on windows (#26988)
4 years ago
GaoWei8 4ff16eb201
Add padding cudnn interface (#26370)
4 years ago
wawltor 8857e3911f
add the dynamic dtype check for the argmin/argma
4 years ago
wangchaochaohu 041f4ab842
refine linspace Op for dtype setting(#27071)
5 years ago
yaoxuefeng 9aa39584fe
fix cuda generator hard-coded offset step (#27027)
5 years ago
Jacek Czaja f6653c71e9
[oneDNN] Fix to conv2d grad with groups (#27006)
5 years ago
Chengmo a72752263b
support heter-xpu-ps (#27018)
5 years ago
whs 2660ea379d
Fix cuda kernel of affine grid (#27003)
5 years ago
Zhou Wei 4204ceaed9
kill op_function_generator.exe (#27005)
5 years ago
Zhou Wei 5a48952a54
remove rmdir build (#26965)
5 years ago
zhangchunle 5866cde758
mac tests failed (#26928)
5 years ago
ShenLiang ff3dc8ac73
fix the remainder (#26995)
5 years ago
yaoxuefeng 7f3e6ca596
add cuda generator (#26786)
5 years ago
iducn 35ae10272e
add shell of CPU's version info (#26937)
5 years ago
Feiyu Chan c8cc094576
add template specialization for bfloat16 for gcc 4.8 compatability (#26985)
5 years ago
wangchaochaohu 3eacced950
[cuda11 support] add support for cublas load of same function name (parameter diff) (#26963)
5 years ago
Chen Weihang 209273e605
Support load state dict form `inference model` format save result (#26718)
5 years ago
joanna.wozna.intel 95e1434bb2
Add bfloat16 data type (#25402)
5 years ago
Yang Zhang 29b844ad5e
Fix clip op attr (#26924)
5 years ago
LoveAn 26c698e2c9
Fix catch exit code failed caused by (#26934)
5 years ago
Shang Zhizhou 61fc7a3e45
Pass version check (#26887)
5 years ago
Zhou Wei f772540d80
add time when test failed (#26935)
5 years ago
huangjun12 e480168fae
fix dropout bug in backward when input is 1d tensor (#26837)
5 years ago
YUNSHEN XIE d8984a6b90
limit timeout value setting on linux (#26923)
5 years ago
Zhou Wei 1771d9f880
fix cache judge more safe (#26910)
5 years ago
joanna.wozna.intel 0627a319b0
Restore "Add mkldnn bfloat16 option to C-API " (#26882)
5 years ago
Jacek Czaja 5e874cc333
- Cosmetic fixes to align with PADDLE_ENFORCE guidelines (#26891)
5 years ago
wanghuancoder 2d2c31a63a
Add FetchAsyncOpHandle, and use it in FastThreadedExecutor (#26643)
5 years ago
Thunderbrook 5205748481
fix eigen in push sparse; fix hadoop command (#26872)
5 years ago
Zhaolong Xing 932bbe955b
fix pool trt plugin bug (#26463)
5 years ago
wawltor 0a29fc85d6
fix the argmin,argmax op for the paddlepaddle 2.0
5 years ago
LoveAn d067e66d39
Show more possible problems with build_and_check in file paddle_build.sh (#26846)
5 years ago
Chengmo d0962abd20
supplement bug fix of parameter server (#26217)
5 years ago
zlsh80826 ad6e3dd69c
[Paddle-TRT] Stack op plugin (#25605)
5 years ago
Leo Chen 60ffc22026
Refine bernoulli and unsqueeze op (#26842)
5 years ago
YUNSHEN XIE 1e50b2a635
fix retry error with blank (#26835)
5 years ago
石晓伟 ced6e87eee
Revert "Add mkldnn bfloat16 option to C-API (#26676)" (#26854)
5 years ago
tangwei12 ebc5f99789
add embedding 2.0 (#26649)
5 years ago
Zhou Wei d85410109d
Count the time and packet size for Windows monitor (#26678)
5 years ago
hong19860320 40378edfa8
Add the AddCheckpoint macro to softplus op (#26809)
5 years ago
GaoWei8 11fb8a1c10
Refine cudnn softmax (#25757)
5 years ago
arlesniak 885c61f086
Add use of global flag 'use_mkldnn' to layer_helper (#26497)
5 years ago
swtkiwi f44420c874
test=develop (#26710)
5 years ago
Pei Yang 78a530c219
[Paddle-TRT] TRT dynamic shape support PaddleSlim quant models (#26536)
5 years ago
wawltor 7ee70a47b8
update the doc for the some ops
5 years ago
yaoxuefeng a47d92d868
fleet add save with whitelist test=develop (#23376)
5 years ago
zhupengyang 0f1ad9b06c
leaky_relu and hardshrink add checkpoint for behavior changed (#26802)
5 years ago
Chengmo 7f2aa2db3c
【paddle.fleet】Support Heter Parameter Server (#25998)
5 years ago
zlsh80826 ac63c7cdef
fix a skip_layernorm bug, test=develop (#26800)
5 years ago
Jiawei Wang a1b99fae07
Adadelta Optimizer (#26590)
5 years ago
LielinJiang 346689c6f1
Register conv_transpose Op version for compatible Op upgrades (#26745)
5 years ago
Adam 8bcb1f29d9
Add conv+affine_channel fuse pass to MKLDNN pass strategy and fix it (#26779)
5 years ago
Wilber 68e0560c2f
refine paddle inference api (#26774)
5 years ago
iducn 64df9b99a9
add shell of GPU version (#26589)
5 years ago
Wojciech Uss 7afb1df11e
Decouple weights and bias from fc primitive in MKLDNN cache (#26708)
5 years ago
Zhen Wang f32ae272ec
Remove `sorted_sum_gradient_` form BasicEngine and PartialGradTask. (#26766)
5 years ago
Leo Chen 844583c8fd
Refine paddle.manual_seed (#26496)
5 years ago
Zhou Wei 2d88b9ffe7
turn on WITH_INFERENCE_API_TEST (#26746)
5 years ago
Pei Yang e3f8e5cf5c
trt int8 support conv2d_transpose (#26636)
5 years ago
ShenLiang 29494d703d
fix remainder, floor_div (#26732)
5 years ago
zhangchunle 623a4c2e56
fix ci coverage build error (#26761)
5 years ago
lilong12 5f524efe56
modify error report message, test=develop (#26743)
5 years ago
wangchaochaohu 4561fc37e2
Add check point for gather Op (#26696)
5 years ago
joanna.wozna.intel eb097d64f6
Fix int8 performace drop cpu_quantize_placement_pass (#26715)
5 years ago
joanna.wozna.intel 02083bda40
Add mkldnn bfloat16 option to C-API (#26676)
5 years ago
LutaoChu 1ec30cb160
register cumsum Op version for compatible Op upgrades (#26734)
5 years ago