Commit Graph

17929 Commits (47af5c3c9dbc58d7a8c771bfef2f2d4b177574bd)

Author SHA1 Message Date
石晓伟 21a63f6f90
enhance the op_version_registry, test=develop (#28347)
5 years ago
YUNSHEN XIE c1c3e21726
retry will not be executed when the number of failed ut is greater than 20 (#28374)
5 years ago
Shang Zhizhou ea851796e5
TensorRT中ernie模型推理性能优化,支持变长输入 (#28367)
5 years ago
Jacek Czaja 84cc61b2cd
[oneDNN] sum op refactor (#28318)
5 years ago
Wilber 6f0f45f69c
copy_to_cpu support uint8 (#28372)
5 years ago
Wilber 09fd2b2aab
Paddle support compile on sw (#27858)
5 years ago
chen zhiyu 953302d9eb
add musl docker build script (#28027)
5 years ago
Leo Chen 6115c14fca
Pool2d cuda kernel supports fp16 (#28316)
5 years ago
Zhou Wei f41104efa3
fix compile out of memory temporary (#28346)
5 years ago
Guo Sheng 9a600df373
Add rnn_op (#28197)
5 years ago
wangchaochaohu 0f4b6247c8
refine the gpu config for performance optimization (#28291)
5 years ago
Huihuang Zheng acc11c2a62
Retry CUDA Initialization to Fix Random Failure, test=develop (#28323)
5 years ago
wangguanzhong 5262b02585
add generate_proposals_v2 op (#28214)
5 years ago
石晓伟 d9b5f1261c
update the version of pybind, test=develop (#28284)
5 years ago
Leo Chen 18c86fb2fb
hide some logs of p2p (#28307)
5 years ago
lidanqing 8cd1c102d9
Enable GRU infer model running CAPI (#28313)
5 years ago
wangguanzhong 1c385e26f9
add op_function_generator for box_coder (#28303)
5 years ago
iducn f763cb81a6
Modify the shell script according to the specification (#28302)
5 years ago
joanna.wozna.intel 571a63e7ec
Add bf16 transpose2, reshape2, concat ops (#28195)
5 years ago
Guanghua Yu e8f2614da5
Enhance multiclass_nms op to support LoD for dygraph mode (#28276)
5 years ago
石晓伟 842a4e5abd
fix analyzer_capi_tester, test=develop (#28289)
5 years ago
Leo Chen 8953038400
Fix transpose in conv cudnn kernel when addto enabled (#28295)
5 years ago
Tao Luo e1e666a05f
fix conv mkldnn build error (#28288)
5 years ago
Jacek Czaja 0b678d401b
- sum (#28233)
5 years ago
Jacek Czaja c11d9b3035
[oneDNN ] conv2d fwd&bwd optimization (#27871)
5 years ago
Zhou Wei 8f87c7eac4
fix judge bug of errorlevel on cmd (#28271)
5 years ago
wangxinxin08 41d26a8287
update matrix nms op to api 2.0 (#28265)
5 years ago
Leo Chen 7fcb32ddf3
fill_constant op supports NINF (#28270)
5 years ago
wangchaochaohu 6905608cea
refine yolo box Op for performace optimization (#28155)
5 years ago
wangchaochaohu cdadc8f019
refine temporal_shift_op for performance optimization using gpu kernel config (#28114)
5 years ago
Zhang Ting fdc06f2158
add Fuse bn add act pass (#28196)
5 years ago
Chen Weihang 813b2ade34
Enrich the python error types of paddle & polish format (#28124)
5 years ago
Adam Osewski 7db747d9e8
oneDNN BatchNorm + Act fusion pass. (#27912)
5 years ago
Zhou Wei fb7f85291b
fix print tensor place,add cpu/cuda/pin_memory API for Tensor (#28200)
5 years ago
tianshuo78520a 11089cacdb
Fix xpu notest (#28204)
5 years ago
mapingshuo 81244fbfab
add sharding strategy in fleet(#27900)
5 years ago
Chen Weihang 2babd6ff67
Add compile limit for PADDLE_ENFORCE without error message (#28221)
5 years ago
lidanqing 4ea2330759
use FLAGS_use_mkldnn to prevent unnecessary attrs copy (#28146)
5 years ago
tianshuo78520a d835118dbd
Hide log message (#28220)
5 years ago
Double_V 2db77be423
fix wrong data type, test=develop (#28203)
5 years ago
Feiyu Chan efe6e2840c
fix strided_slice_op's GetExpectedKernelType (#28192)
5 years ago
Zhou Wei 271ee58f5c
Enhance build detection (#28123)
5 years ago
Leo Chen 1f3be85914
Fix bug of fetch_async_op_handle when fetching the feed variable (#28194)
5 years ago
WangXi e450823b8b
Fix nccl op test failed, test=develop (#28172)
5 years ago
tianshuo78520a c226b2e45a
update dockerfile (#27589)
5 years ago
Wilber f935ca8a50
[lite-xpu-subgraph] Fix xpu compile and test xpu ci. (#27932)
5 years ago
Zhou Wei 68c473e3e0
fix Automatic GPU detection failed on windows (#28148)
5 years ago
danleifeng f29fb396df
dygraph nccl init support host domain name (#28107)
5 years ago
wangguanzhong 5cd97a1cb0
support multiclass nms for multi-batch, test=develop (#28154)
5 years ago
Pei Yang 602d2ce5c9
change avg pooling from trt plugin to trt layer (#28032)
5 years ago
Double_V 5289b72acc
fix Wmaybe-uninitialized warning in pooling.cc, test=develop (#28126)
5 years ago
Zhou Wei 5d7000215a
fix dynamic_loader more safe and error message on windows (#28117)
5 years ago
tianshuo78520a d87d286707
Add build paddle inference (#28131)
5 years ago
wangguanzhong d1e1f17482
fix generate_proposal_labels in cascade-rcnn series model, test=develop (#27892)
5 years ago
Leo Chen a911c19eb0
fill_constant op supports NaN and Inf (#28109)
5 years ago
zhupengyang 6dd64b0a30
randperm run error in multi-gpus (#27942)
5 years ago
Double_V d43f75e4cc
add rois_num for roi_align xpu OP (#28077)
5 years ago
xiaoting e3d02c9574
rm max_input in conv2d for kunlun, test=kunlun (#28062)
5 years ago
joanna.wozna.intel a21b57109c
Add AVX512 instruction check for C-API (#28087)
5 years ago
wangchaochaohu 463c72c2d9
refine gpu kernel config for Paddle (#28085)
5 years ago
yinhaofeng 2cb1ecb99e
lookup_table_v2_op_xpu report errors;test=kunlun (#28064)
5 years ago
yinhaofeng 6f0c3d1f06
xpu adam op (#28031)
5 years ago
TeslaZhao a5c95cd588
Add xpu transpose2 op.test=kunlun (#28086)
5 years ago
Chengmo 5f04875c30
Fix xpu error message (#28061)
5 years ago
LutaoChu c8d32c8c10
Fix diag OP bug on Windows Python3.8
5 years ago
Pei Yang a0b2f93689
reduce trt warning message (#28011)
5 years ago
huangxu96 d466893820
Allclose op (#27891)
5 years ago
pangyoki 975bd8873b
Fix error message of multinomial op (#27946)
5 years ago
Kaipeng Deng b6eff4427c
update yolo_box support h != w. test=develop (#27327)
5 years ago
Double_V c1eed1fa24
error message opt for XPU, test=kunlun (#27972)
5 years ago
pangyoki 4c5b779a99
Add truncated_gaussian_random XPU kernel (#27861)
5 years ago
pangyoki 5b8e500135
Add gaussian_random XPU kernels (#27853)
5 years ago
pangyoki 74ce039743
Add uniform_random XPU kernel (#27846)
5 years ago
xiaoting abf4d52a74
Polish kunlun error (#27974)
5 years ago
liuyuhui 3e9568653b
add cast/concat/assign xpu op (#27911)
5 years ago
Guo Sheng fa9d3fa5bf
Incorporate cudnn_lstm into LSTM api (#27217)
5 years ago
chentianyu03 05fd49e974
change paddle.fluid.layers.reduce_sum to paddle.sum in sample codes (#27998)
5 years ago
Guanghua Yu f94d053705
error message optimization in mean_xpu,softmax_with_cross_entropy_op_xpu,test=kunlun (#27967)
5 years ago
Jack Zhou d330cf66cc
Fix xpu enforce (#27978)
5 years ago
lidanqing 7cb4a8b8f2
[oneDNN] Conv dilation support (#27914)
5 years ago
mapingshuo 64c2634995
fix kunlun kernel of reshape op (#27988)
5 years ago
tangwei12 202bfab1be
Feature/large scale kv save base/delta (#27470)
5 years ago
123malin aa3b4ed717
【paddle.fleet】geo send sparse optimize (#27719)
5 years ago
Zhou Wei 2ac6c6c3af
fix bug of tensor copy of CUDAPinnedPlace (#27966)
5 years ago
joanna.wozna.intel 840c521b77
Fix problem with flags fp32 and int8 (#27954)
5 years ago
mapingshuo 5ccaaab8aa
reshape support bool, test=develop (#27944)
5 years ago
Qinghe JING 4a4f773658
Add reduce sum and reduce mean xpu op (#27939)
5 years ago
Zhou Wei bf412f4665
add tensor clone (#27953)
5 years ago
Feiyu Chan 2e845182d9
support channel last in BatchNorm*d
5 years ago
guofei 6bbb6e7f45
Implement the function of OutScaleForTraining/OutScaleForInference in dygraph (#26601)
5 years ago
YUNSHEN XIE fea09fe534
disable ut quickly (#27793)
5 years ago
chentianyu03 d05058d268
Remove and reorganize the alias of APIs (#27717)
5 years ago
Leo Chen 9a2a4b5f65
Support setting xpu place in dygraph mode (#27909)
5 years ago
Thunderbrook 3ee6ad6ec5
solve bug in pull_dense_worker (#27918)
5 years ago
MRXLT 263a9e97fd
Fix adam (#27778)
5 years ago
Double_V b0edda4d99
kunlun add op (#27890)
5 years ago
Jack Zhou c791df09cf
Add elementwise XPU OP kernel for KUNLUN core, including (but still cannot process common broadcast
5 years ago
wangchaochaohu c5fcc96d5b
xpu support for fill_constant Op (#27675)
5 years ago
tianshuo78520a a820871669
Change PR-CI-Kunlun Test Number (#27923)
5 years ago
Chengmo 328cb289ed
【paddle.fleet】fix sparse load (#27680)
5 years ago
tangwei12 cf70d5b350
fix paddle error informations (#27889)
5 years ago
wawltor 95aa53425d
update the code for the topk message optimize
5 years ago
Chen Weihang 4ba977c720
Polish some error message in opeators (#27876)
5 years ago
123malin a4f850748a
【paddle.fleet】bug fix for parameter_recv (#27838)
5 years ago
QingshuChen 2712d07644
support kunlun matmul_v2 (#27910)
5 years ago
zhang wenhui 5a83496c8d
Multi task (#26002)
5 years ago
zhang wenhui 7a58431c0a
fix norm api doc, test=develop (#27652)
5 years ago
yinhaofeng 3eb106da6d
Lookup table v2 xpu (#27888)
5 years ago
Zhang Ting d5cc144c60
tune backward filter algorithm for float16 (#27529)
5 years ago
wanghuancoder 41aad9bfcd
revert 4 files, from clear include by iwyu, test=develop (#27895)
5 years ago
hutuxian 3f2a6ab65d
fix error msg (#27887)
5 years ago
xiaoting ae01801f0a
Add dropout and log_loss for kunlun (#27790)
5 years ago
Guanghua Yu 70c8c31371
support mean,softmax_with_cross_entropy on Baidu Kunlun (#27792)
5 years ago
Chengmo 1607e87cb9
add xpu sgd & momentum (#27728)
5 years ago
Leo Chen 049696bf67
Refine the format of printing tensor (#27673)
5 years ago
hong19860320 c90d35564b
Add batch_norm and layer_norm XPU kernels (#27818)
5 years ago
joanna.wozna.intel ddcd1b5381
Add bfloat16 resnet50 test (#27755)
5 years ago
xiaoting 6da7a7458b
add conv for xpu, test=kunlun (#27809)
5 years ago
Thunderbrook 04be37c57f
add xpu slice op (#27349)
5 years ago
Thunderbrook 8c25dfaacc
op error info (#27856)
5 years ago
Wilber 345574a6ed
Demo CMakeLists add openmp flag. (#27848)
5 years ago
ShenLiang 6d63cd2b93
add gather_op xpu, test=kunlun (#27822)
5 years ago
Feiyu Chan 1d95a0fbc3
fix error message for nce_op (#27863)
5 years ago
gongweibao 4237fefeb4
Add shellcheck tools and modify copyright hook (#27722)
5 years ago
Chengmo c5f2802d56
【paddle.fleet】Update fleetrun & ps-heter (#27472)
5 years ago
Shang Zhizhou bbc837ee72
add info log for trt input dynamic shape check (#27796)
5 years ago
guofei 2e1bca99ca
Refine the gradient calculation errors caused by renaming in while_grad (#27814)
5 years ago
wanghuancoder 8fa4c09889
add load_op_xpu for Baidu Kunlun (#27817)
5 years ago
Wilber 9005c5a260
Lite subgraph support arm cpu. (#27827)
5 years ago
Jacek Czaja 55e63763ec
[oneDNN] adaptive pool support (#27747)
5 years ago
chen zhiyu 6335e6a0a6
add musl option (#27798)
5 years ago
yongqiangma e8a5aefbbd
update CUDAPlace doc. test=document_fix (#27711)
5 years ago
Zhang Ting 16999ae49d
use IndexList to improve performance of instance_norm op (#25132)
5 years ago
GaoWei8 36bb056ed6
Add flattern weight of lstm (#27192)
5 years ago
Guanghua Yu 7779790c61
error message optimization in softmax_with_cross_entropy_op (#27772)
5 years ago
zhupengyang 659d04df2c
hsigmoid -> hsigmoid_loss/HSigmoidLoss; refine docs (#27745)
5 years ago
TeslaZhao 070ac9590c
Add double grad in Squeeze and Unsqueeze (#27810)
5 years ago
Jack Zhou d4359b0f39
add the kunlun kernel for the paddle 2.0
5 years ago
mapingshuo 840d54de9b
add XPU support for shape op and reshape op (#27804)
5 years ago
cc 8fabb1c32f
Add test attribute in channelwise_quant op, test=develop (#27742)
5 years ago
wangxinxin08 ad99e638fd
add double grad op for matmul (#27776)
5 years ago
zhupengyang 0025e0d87b
refine APIs: brelu, hardsigmoid, hardswish, maxout (#27658)
5 years ago
zhupengyang 5098891fdf
add softmax xpu kernel (#27700)
5 years ago
Double_V f6ad2375be
fix pool3d bug, test=develop (#27718)
5 years ago
石晓伟 0d27591642
save operator version infomation to program desc, test=develop (#27668)
5 years ago
Qi Li b8d2a021f0
fix ut error of test_recognize_digits, test=develop (#27791)
5 years ago
Jacek Czaja 631c1f3018
- Fix to 27398 (#27770)
5 years ago
Feiyu Chan 0a7bab4e34
fix error mesage for negative_positive_pair_op and nce_op (#27779)
5 years ago
zhupengyang 395cb561aa
refine logsumexp error message and docs (#27713)
5 years ago
smallv0221 057e28bc8f
API(lstm_unit, lstmp, sequence_mask, sequence_enumerate, sequence_conv) error message enhancement (#27572)
5 years ago