Commit Graph

17782 Commits (bc4606922578d2efab4d75425e15b8c380973b59)

Author SHA1 Message Date
Chengmo d014e29fc6
fix error message (#27318)
4 years ago
Leo Chen 35074963e3
Refine error msg in paddle/fluid/framework/details [part 2] (#27429)
4 years ago
Chengmo 0e101c4f6f
Fix test dist fleet heter ctr (#27513)
4 years ago
Zhong Hui a85592bcbf
fix cpplint error for the autmic max/min
4 years ago
joanna.wozna.intel b0ee1405f7
Add conv2d bfloat16 support (#27325)
4 years ago
Leo Chen a5b3263782
Refine error msg in paddle/fluid/imperative (#27521)
4 years ago
chalsliu 09f1953296
Revert "Disable ut quickly."
4 years ago
Thunderbrook 6f69a4cb05
add xpu in heter mode (#27000)
4 years ago
ceci3 8daccc9ea7
Fix batch norm double grad compute (#27549)
4 years ago
ShenLiang 6fc74bbaf6
add fp16 for matmul (#27523)
4 years ago
Zhong Hui fab4e6d08f
add abs support double grad
4 years ago
GaoWei8 36ed83d270
Refine PADDLE_ENFORCE (#27360)
4 years ago
liym27 effd51b6be
Fix error message in operator/utils.h (#27532)
4 years ago
Leo Chen 6bb02e8e3c
increase retry time (#27553)
4 years ago
Shang Zhizhou 77a36f8997
[buf fix]:fix some unittests error (#27540)
4 years ago
Zhong Hui 597345d17b
fix cuda atomic for ARCH<350 for the automic_max
4 years ago
WangXi e550fc02ae
fleet2.0 add fp16 grad compression (#27480)
4 years ago
cc c5c13473c6
Add compatibility check for four mkldnn pass (#27364)
4 years ago
mapingshuo c83ade6d6b
add AsDuplicable for sync_comm op(#27515)
4 years ago
Zhou Wei d20349b548
add unittest count ,install check on windows (#27492)
4 years ago
Wilber 3d5522146e
register seq_concat_fc_fuse pass. (#27479)
4 years ago
Wilber df7fabeedc
Fix memory leak for mkldnn. (#27493)
4 years ago
ruri b7319ef518
fix err msg in pixel shuffle op (#27503)
4 years ago
Kaipeng Deng d7f422c984
fix error message in conv/conv_transpose. test=develop (#27464)
4 years ago
Wilber ec4155d7d0
windows lib size crop from 5.4G to 3.9G (#27477)
4 years ago
ruri e1fb77d123
[2.0RC]refine error message in shuffle channel OP (#27505)
4 years ago
Aurelius84 f91c37e665
Refine error message of MatchMatrix and PyramidHash (#27484)
4 years ago
Shibo Tao 8f7bb52bd2
fix tensorrt 6 build error. test=develop (#27511)
4 years ago
wanghuancoder df43905f12
use iwyu clean include (#27267)
4 years ago
chalsliu 29f1560d8f
Disable ut quickly.
4 years ago
wangchaochaohu dc713116e0
refine the error message for bath size like OP (#27446)
4 years ago
Zhong Hui 4a9d21de49
Add GPU Kernels of Segment Ops, support, sum, max, min, mean
4 years ago
YUNSHEN XIE 66951ab2ea
modified timeout value for 4 ut (#27462)
4 years ago
Shang Zhizhou c17f9cf25f
[bug fix]:Memory increases after adapting the cudnn version to cudnn8 (#27436)
4 years ago
Zhou Wei 1e1ae5c54d
Make the Bind Method of Tensor more automatic (#27270)
4 years ago
LutaoChu 5508c78744
Fix bug: The calculation result of Diag_v2 Op under large size input is wrong (#27447)
4 years ago
tangwei12 bc5f0246a8
large scale kv speedup (#26510)
4 years ago
Qi Li d7b7dcd10e
fix cmake dependencies of test_recognize_digits, test=develop (#27475)
4 years ago
Zhou Wei 292b24aa6d
fix bug MD of compile, And add MD/STATIC/OPENBLAS inference lib check on windows (#27051)
4 years ago
Chen Weihang 41b5955538
Polish no onwer ops error message (#27448)
4 years ago
Zhang Ting 906e7f921e
add fuse_bn_act op (#27230)
4 years ago
Wilber 5034d181f3
update for 2.0 inference api. (#27473)
4 years ago
Chen Weihang 765064476b
Polish some lost invalid error message (#27445)
4 years ago
wangchaochaohu 76fb95fe76
avoid data transform for linspace OP (#27444)
4 years ago
123malin a04524759e
Enhance Op's Error Message (#27455)
4 years ago
wangchaochaohu 0a862fd356
refine the precious of linspace Op using half way (#27452)
4 years ago
Pei Yang fda54c0212
errmsg refine of trt plugin (#27309)
4 years ago
tianshuo78520a 9f3a9be76a
update python 2.7.15 (#27435)
4 years ago
石晓伟 dd4c2d86a5
enhance error messages, test=develop (#27423)
4 years ago
Zhou Wei b7371fa55d
judge whether remove build dir to accelerate compile,test=develop (#27334)
4 years ago
Zhong Hui f4c750d721
Add the cpu version of segment sum mean max min op
4 years ago
Wilber afe94903c3
Rename fluid_inference to paddle_inference. (#27422)
4 years ago
Pei Yang 8182337096
clear pass logs (#27434)
4 years ago
furnace 13a4c74efd
add mv op(c++, python, unit test) (#27024)
4 years ago
LutaoChu f11a53ee76
Optimize argsort Op performance on GPU
4 years ago
ceci3 1d3b27cae8
add double grad compute for batch norm (#27296)
4 years ago
Shang Zhizhou d93661942e
fix bug sequececonv_eltadd_relu_fuse_pass (#27404)
4 years ago
Leo Chen aba759ba16
[Feature] Enhance inplace addto strategy for gradient accumulation in static graph (#27112)
4 years ago
LutaoChu 669efb98de
Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor
4 years ago
Wilber 39546aa2f3
Add pass compatible and unit test. (#27377)
4 years ago
huangxu96 02606d45ef
Quant op dev (#25932)
4 years ago
Leo Chen bbc84e0fe0
Refine error msg in paddle/fluid/framework/details [part 1] (#25631)
4 years ago
MRXLT f936adbd2d
fix adam (#27343)
4 years ago
tangwei12 99626502f7
【paddle.fleet】gloo and util (#27213)
4 years ago
Pei Yang a5ef246cac
Optimize emb_eltwise_layernorm_plugin and support fp16 (#27128)
4 years ago
yaoxuefeng d726fd5e86
enhance dataset err msg (#27363)
4 years ago
guofei 9fdcfe8981
Support python3.8 (#26850)
4 years ago
Pei Yang fd7ab4e63c
register pass compatibility (#27357)
4 years ago
haozech 7e6dfcf9b2
Add 3 pass version check (#27283)
4 years ago
GaoWei8 1a7559718e
fix cudnn dyload (#27308)
4 years ago
wawltor b6a4349dd4
fix the error message for the math dir
4 years ago
HappyAngel 01659a6961
Polish operators error message in average_accumlate OP (#27268)
4 years ago
Shang Zhizhou 3c11717988
add op version checker to ir passes (#27329)
4 years ago
furnace 515efe4240
add empty_like op (python, and unit test), use c++ implementation of empty op, (#27287)
4 years ago
Yi Liu e9a0fbfff2
OP报错信息优化 (#27301)
4 years ago
tianshuo78520a da583edf6d
Fix paddle build install requirements (#27378)
4 years ago
Jack Zhou 63203c4abc
enhance reduce op which can reduce tensor with arbitrary rank
4 years ago
YUNSHEN XIE fd496fa9a3
modified the ut name with test_dist (#27315)
4 years ago
lilong12 9f9d15e285
fix the bug of non-exit, test=develop (#27350)
4 years ago
chalsliu d773c6c94e
Support precision test
4 years ago
ShenLiang 9ee77b1f41
Fix elementwise_floordiv op (#27352)
4 years ago
Zhou Wei ebc6d54446
fix cache file judge (#27369)
4 years ago
ShenLiang 54b81fa32c
add adaptivelsgd in meta_optimizer (#27289)
4 years ago
Jack Zhou 6e29c2da05
Error description optimize for the math dir
4 years ago
Zhou Wei f992f8d7ef
fix judge cache file of inference api more accurate (#27175)
4 years ago
Jacek Czaja 4582f697b6
- Fix to concat oneDNN overwritting data (#27273)
4 years ago
ShenLiang c296618c94
fix error message in broadcast/allreduce/gather (#27302)
4 years ago
Chen Weihang 4f9d6529fe
Polish framework error message part 7 (#27266)
4 years ago
wawltor 4e8582fe5a
update the error message check for the some ops
4 years ago
wawltor d003573f90
add the error message check for the some operator
4 years ago
Wilber dae62556cb
Enhance infer error info message (#26731)
4 years ago
Leo Chen 4c8ea492cd
use shared dev_ctx (#27313)
4 years ago
wanghuancoder 696a39e2f3
use clcache in windows (#27279)
4 years ago
Shang Zhizhou 47fdc60ecc
Optimize slice trt plugin (#26970)
4 years ago
Wilber f827665ae6
[Pass Compatible] Bind python compatible. (#27262)
4 years ago
石晓伟 bd77a4258d
error messages of inference/tests, test=develop (#27259)
4 years ago
Chen Weihang dafb0e3bb7
Polish framework error message part 6 (#27257)
4 years ago
Shang Zhizhou e6e2e53782
Optimize error report (#27254)
4 years ago
GaoWei8 ee1ed42c99
change sequence length attribute to input (#27193)
4 years ago
Pei Yang 3ae3b86489
fix trt_dynamic_shape_ernie_deserialize_test (#27290)
4 years ago
joanna.wozna.intel 1483ea2304
Add bfloat16 passes (#26999)
4 years ago
lilong12 bf461fa524
Improving error report message for sequence_expand op (#27245)
4 years ago
Zhong Hui bbad3414e8
Enhance the error messages for files in operators/math
4 years ago
Chen Weihang 79149c8ee6
polish framework error message part 8 (#27269)
4 years ago
Pei Yang aae41c6fca
refine error message related to paddle-TRT (#27256)
4 years ago
Zhen Wang d708b21074
Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240)
4 years ago
ShenLiang 2b6a5793fe
remove auto mode from localsgd optimizer (#27237)
4 years ago
Adam cc3f4b813a
Add int8 GRU kernel (#27220)
4 years ago
石晓伟 255e0cf978
error messages of inference/capi, test=develop (#27258)
4 years ago
Jack Zhou 9437ce36c4
Error description optimize for math dir
4 years ago
Zhang Ting 5c1bafbbc6
use eval to improve performance, test=develop (#25459)
4 years ago
lidanqing 5c4eed66fd
Fix GRU mkldnn kernel fail on look_table_v2 (#27198)
4 years ago
LoveAn 7745ad55ed
Add details to the summary for show more error informations (#27165)
4 years ago
Chen Weihang 33ff833af2
fix loaded no params layer run error (#27241)
4 years ago
Wilber f1ab288201
enhance inference error info. (#27251)
4 years ago
Wilber 1b84c0bf43
Lite subgraph refine predictor (#27167)
4 years ago
furnace 2e59769612
add empty op (c++, python, unit test) (#26659)
4 years ago
Zhou Wei f6be5989fd
Reduce the parallel compile count (#27187)
4 years ago
lilong12 c5f957ae38
add double grad for tile op and expand_v2 op (#27114)
4 years ago
lilong12 58a88ba9af
add double grad for expand (#27183)
4 years ago
Qi Li 7c7fbd3218
fix error msg of fused_embedding_fc_lstm_op, test=develop (#27231)
4 years ago
Qi Li 78446ecdba
[UT] fix run type of ut test cases of test_train_recognize_digits and test_api_impl, test=develop (#27218)
4 years ago
Jacek Czaja e005861598
[oneDNN]Introducing oneDNN 1.6 (#27137)
4 years ago
ShenLiang 5bd84b22c4
revert divide (#27202)
4 years ago
wawltor fde5cfe881
fix the CudaPinMemory bug for the equal op (#27176)
4 years ago
zhupengyang cc3306f7c8
restruct logsumexp to speed up compiling (#27191)
4 years ago
Steffy-zxf 50e60e8779
update error info for selected_rows_functor
4 years ago
Wilber edd962b1d0
Add 2.0 inference api doc. (#27125)
4 years ago
JZ-LIANG 5d039f4086
modified the implement of Lars optimizer (#26733)
4 years ago
wangchaochaohu c71d79b1d2
[cuda11 support] change the CMakeLists to support the cuda11 (#27124)
4 years ago
Qinghe JING 43b0445b29
Add double grad in reduce sum (#27115)
4 years ago
kinghuin ed292695c5
optimize the error message for math dir
4 years ago
yongqiangma 4558d395e9
fix Norm op error (#26771)
4 years ago
LielinJiang 4d7d661249
Fix kl and summary bug (#27132)
4 years ago
WeiXin 13804ed80c
Error msg/polish tensor error msg (#26976)
4 years ago
whs eb01976037
[2.0 API]Add checker in grid_sample_grad op (#27126)
4 years ago
wangguanzhong a28ae86e11
Enhance ops to support LoD as input for dygraph detection models. (#25316)
4 years ago
Zhou Wei 753a0748ee
Temporarily turn off WITH_INFERENCE_API_TEST (#27170)
4 years ago
YUNSHEN XIE d4710163eb
add timeout unittests retry (#27152)
4 years ago
LielinJiang 8df5b4d608
Add correlation api to contrib (#27015)
4 years ago
LoveAn cbcd5e407a
Fix problem that target name already exists when there isn't model data cache, test=develop (#27142)
4 years ago
kinghuin 1b102dd552
optimize the error message for unpooling.cc
4 years ago
Pei Yang 5fb8c92054
fix multihead matmul shared params (#27121)
4 years ago
xiaoting 58f3ef982a
fix typo for interp_v2,test=develop (#26843)
4 years ago
LoveAn ed2f57cc42
Restore file changes caused by pre-commit (#27105)
4 years ago
YUNSHEN XIE 9fd5eae81d
add failed unittests retry on mac system (#26813)
4 years ago
YUNSHEN XIE 92bf0d47e3
add failed unittests retry on win system (#26823)
4 years ago
wangchaochaohu 5af81f833c
fix gpu kernel for numel Op (#27085)
4 years ago
Wilber 632125415c
Refine python inference api (#26958)
4 years ago
YUNSHEN XIE b150f2b3a6
disable test_trt_dynamic_shape_ernie_ser_deser,test=document_fix (#27059)
4 years ago
zhupengyang 19ca6d9dd2
add .part to speed up compile (#27044)
4 years ago
LoveAn fab8bbf25b
Modify data download function and support unittests of inference APIs on windows (#26988)
4 years ago
GaoWei8 4ff16eb201
Add padding cudnn interface (#26370)
4 years ago
wawltor 8857e3911f
add the dynamic dtype check for the argmin/argma
4 years ago
wangchaochaohu 041f4ab842
refine linspace Op for dtype setting(#27071)
5 years ago
yaoxuefeng 9aa39584fe
fix cuda generator hard-coded offset step (#27027)
5 years ago
Jacek Czaja f6653c71e9
[oneDNN] Fix to conv2d grad with groups (#27006)
5 years ago
Chengmo a72752263b
support heter-xpu-ps (#27018)
5 years ago
whs 2660ea379d
Fix cuda kernel of affine grid (#27003)
5 years ago
Zhou Wei 4204ceaed9
kill op_function_generator.exe (#27005)
5 years ago
Zhou Wei 5a48952a54
remove rmdir build (#26965)
5 years ago
zhangchunle 5866cde758
mac tests failed (#26928)
5 years ago
ShenLiang ff3dc8ac73
fix the remainder (#26995)
5 years ago
yaoxuefeng 7f3e6ca596
add cuda generator (#26786)
5 years ago
iducn 35ae10272e
add shell of CPU's version info (#26937)
5 years ago
Feiyu Chan c8cc094576
add template specialization for bfloat16 for gcc 4.8 compatability (#26985)
5 years ago
wangchaochaohu 3eacced950
[cuda11 support] add support for cublas load of same function name (parameter diff) (#26963)
5 years ago
Chen Weihang 209273e605
Support load state dict form `inference model` format save result (#26718)
5 years ago
joanna.wozna.intel 95e1434bb2
Add bfloat16 data type (#25402)
5 years ago
Yang Zhang 29b844ad5e
Fix clip op attr (#26924)
5 years ago
LoveAn 26c698e2c9
Fix catch exit code failed caused by (#26934)
5 years ago
Shang Zhizhou 61fc7a3e45
Pass version check (#26887)
5 years ago
Zhou Wei f772540d80
add time when test failed (#26935)
5 years ago
huangjun12 e480168fae
fix dropout bug in backward when input is 1d tensor (#26837)
5 years ago
YUNSHEN XIE d8984a6b90
limit timeout value setting on linux (#26923)
5 years ago
Zhou Wei 1771d9f880
fix cache judge more safe (#26910)
5 years ago
joanna.wozna.intel 0627a319b0
Restore "Add mkldnn bfloat16 option to C-API " (#26882)
5 years ago
Jacek Czaja 5e874cc333
- Cosmetic fixes to align with PADDLE_ENFORCE guidelines (#26891)
5 years ago
wanghuancoder 2d2c31a63a
Add FetchAsyncOpHandle, and use it in FastThreadedExecutor (#26643)
5 years ago
Thunderbrook 5205748481
fix eigen in push sparse; fix hadoop command (#26872)
5 years ago
Zhaolong Xing 932bbe955b
fix pool trt plugin bug (#26463)
5 years ago
wawltor 0a29fc85d6
fix the argmin,argmax op for the paddlepaddle 2.0
5 years ago
LoveAn d067e66d39
Show more possible problems with build_and_check in file paddle_build.sh (#26846)
5 years ago
Chengmo d0962abd20
supplement bug fix of parameter server (#26217)
5 years ago
zlsh80826 ad6e3dd69c
[Paddle-TRT] Stack op plugin (#25605)
5 years ago
Leo Chen 60ffc22026
Refine bernoulli and unsqueeze op (#26842)
5 years ago
YUNSHEN XIE 1e50b2a635
fix retry error with blank (#26835)
5 years ago
石晓伟 ced6e87eee
Revert "Add mkldnn bfloat16 option to C-API (#26676)" (#26854)
5 years ago
tangwei12 ebc5f99789
add embedding 2.0 (#26649)
5 years ago
Zhou Wei d85410109d
Count the time and packet size for Windows monitor (#26678)
5 years ago
hong19860320 40378edfa8
Add the AddCheckpoint macro to softplus op (#26809)
5 years ago
GaoWei8 11fb8a1c10
Refine cudnn softmax (#25757)
5 years ago
arlesniak 885c61f086
Add use of global flag 'use_mkldnn' to layer_helper (#26497)
5 years ago
swtkiwi f44420c874
test=develop (#26710)
5 years ago
Pei Yang 78a530c219
[Paddle-TRT] TRT dynamic shape support PaddleSlim quant models (#26536)
5 years ago
wawltor 7ee70a47b8
update the doc for the some ops
5 years ago
yaoxuefeng a47d92d868
fleet add save with whitelist test=develop (#23376)
5 years ago
zhupengyang 0f1ad9b06c
leaky_relu and hardshrink add checkpoint for behavior changed (#26802)
5 years ago
Chengmo 7f2aa2db3c
【paddle.fleet】Support Heter Parameter Server (#25998)
5 years ago
zlsh80826 ac63c7cdef
fix a skip_layernorm bug, test=develop (#26800)
5 years ago