Commit Graph

10487 Commits (5305b2749a4c5f913b0fa8b5ffe6ba616b621bab)

Author SHA1 Message Date
Zhong Hui f4c750d721
Add the cpu version of segment sum mean max min op
5 years ago
Wilber afe94903c3
Rename fluid_inference to paddle_inference. (#27422)
5 years ago
Pei Yang 8182337096
clear pass logs (#27434)
5 years ago
furnace 13a4c74efd
add mv op(c++, python, unit test) (#27024)
5 years ago
LutaoChu f11a53ee76
Optimize argsort Op performance on GPU
5 years ago
ceci3 1d3b27cae8
add double grad compute for batch norm (#27296)
5 years ago
Shang Zhizhou d93661942e
fix bug sequececonv_eltadd_relu_fuse_pass (#27404)
5 years ago
Leo Chen aba759ba16
[Feature] Enhance inplace addto strategy for gradient accumulation in static graph (#27112)
5 years ago
LutaoChu 669efb98de
Fix bug: shapes of Topk outputs are wrong when the parameter k is Tensor
5 years ago
Wilber 39546aa2f3
Add pass compatible and unit test. (#27377)
5 years ago
huangxu96 02606d45ef
Quant op dev (#25932)
5 years ago
Leo Chen bbc84e0fe0
Refine error msg in paddle/fluid/framework/details [part 1] (#25631)
5 years ago
MRXLT f936adbd2d
fix adam (#27343)
5 years ago
tangwei12 99626502f7
【paddle.fleet】gloo and util (#27213)
5 years ago
Pei Yang a5ef246cac
Optimize emb_eltwise_layernorm_plugin and support fp16 (#27128)
5 years ago
yaoxuefeng d726fd5e86
enhance dataset err msg (#27363)
5 years ago
Pei Yang fd7ab4e63c
register pass compatibility (#27357)
5 years ago
haozech 7e6dfcf9b2
Add 3 pass version check (#27283)
5 years ago
GaoWei8 1a7559718e
fix cudnn dyload (#27308)
5 years ago
wawltor b6a4349dd4
fix the error message for the math dir
5 years ago
HappyAngel 01659a6961
Polish operators error message in average_accumlate OP (#27268)
5 years ago
Shang Zhizhou 3c11717988
add op version checker to ir passes (#27329)
5 years ago
furnace 515efe4240
add empty_like op (python, and unit test), use c++ implementation of empty op, (#27287)
5 years ago
Yi Liu e9a0fbfff2
OP报错信息优化 (#27301)
5 years ago
Jack Zhou 63203c4abc
enhance reduce op which can reduce tensor with arbitrary rank
5 years ago
lilong12 9f9d15e285
fix the bug of non-exit, test=develop (#27350)
5 years ago
ShenLiang 9ee77b1f41
Fix elementwise_floordiv op (#27352)
5 years ago
Zhou Wei ebc6d54446
fix cache file judge (#27369)
5 years ago
ShenLiang 54b81fa32c
add adaptivelsgd in meta_optimizer (#27289)
5 years ago
Jack Zhou 6e29c2da05
Error description optimize for the math dir
5 years ago
Zhou Wei f992f8d7ef
fix judge cache file of inference api more accurate (#27175)
5 years ago
Jacek Czaja 4582f697b6
- Fix to concat oneDNN overwritting data (#27273)
5 years ago
ShenLiang c296618c94
fix error message in broadcast/allreduce/gather (#27302)
5 years ago
Chen Weihang 4f9d6529fe
Polish framework error message part 7 (#27266)
5 years ago
wawltor 4e8582fe5a
update the error message check for the some ops
5 years ago
wawltor d003573f90
add the error message check for the some operator
5 years ago
Wilber dae62556cb
Enhance infer error info message (#26731)
5 years ago
Leo Chen 4c8ea492cd
use shared dev_ctx (#27313)
5 years ago
Shang Zhizhou 47fdc60ecc
Optimize slice trt plugin (#26970)
5 years ago
Wilber f827665ae6
[Pass Compatible] Bind python compatible. (#27262)
5 years ago
石晓伟 bd77a4258d
error messages of inference/tests, test=develop (#27259)
5 years ago
Chen Weihang dafb0e3bb7
Polish framework error message part 6 (#27257)
5 years ago
Shang Zhizhou e6e2e53782
Optimize error report (#27254)
5 years ago
GaoWei8 ee1ed42c99
change sequence length attribute to input (#27193)
5 years ago
Pei Yang 3ae3b86489
fix trt_dynamic_shape_ernie_deserialize_test (#27290)
5 years ago
joanna.wozna.intel 1483ea2304
Add bfloat16 passes (#26999)
5 years ago
lilong12 bf461fa524
Improving error report message for sequence_expand op (#27245)
5 years ago
Zhong Hui bbad3414e8
Enhance the error messages for files in operators/math
5 years ago
Chen Weihang 79149c8ee6
polish framework error message part 8 (#27269)
5 years ago
Pei Yang aae41c6fca
refine error message related to paddle-TRT (#27256)
5 years ago
Zhen Wang d708b21074
Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240)
5 years ago
ShenLiang 2b6a5793fe
remove auto mode from localsgd optimizer (#27237)
5 years ago
Adam cc3f4b813a
Add int8 GRU kernel (#27220)
5 years ago
石晓伟 255e0cf978
error messages of inference/capi, test=develop (#27258)
5 years ago
Jack Zhou 9437ce36c4
Error description optimize for math dir
5 years ago
Zhang Ting 5c1bafbbc6
use eval to improve performance, test=develop (#25459)
5 years ago
lidanqing 5c4eed66fd
Fix GRU mkldnn kernel fail on look_table_v2 (#27198)
5 years ago
Chen Weihang 33ff833af2
fix loaded no params layer run error (#27241)
5 years ago
Wilber f1ab288201
enhance inference error info. (#27251)
5 years ago
Wilber 1b84c0bf43
Lite subgraph refine predictor (#27167)
5 years ago
furnace 2e59769612
add empty op (c++, python, unit test) (#26659)
5 years ago
lilong12 c5f957ae38
add double grad for tile op and expand_v2 op (#27114)
5 years ago
lilong12 58a88ba9af
add double grad for expand (#27183)
5 years ago
Qi Li 7c7fbd3218
fix error msg of fused_embedding_fc_lstm_op, test=develop (#27231)
5 years ago
Qi Li 78446ecdba
[UT] fix run type of ut test cases of test_train_recognize_digits and test_api_impl, test=develop (#27218)
5 years ago
Jacek Czaja e005861598
[oneDNN]Introducing oneDNN 1.6 (#27137)
5 years ago
ShenLiang 5bd84b22c4
revert divide (#27202)
5 years ago
wawltor fde5cfe881
fix the CudaPinMemory bug for the equal op (#27176)
5 years ago
zhupengyang cc3306f7c8
restruct logsumexp to speed up compiling (#27191)
5 years ago
Steffy-zxf 50e60e8779
update error info for selected_rows_functor
5 years ago
Wilber edd962b1d0
Add 2.0 inference api doc. (#27125)
5 years ago
JZ-LIANG 5d039f4086
modified the implement of Lars optimizer (#26733)
5 years ago
wangchaochaohu c71d79b1d2
[cuda11 support] change the CMakeLists to support the cuda11 (#27124)
5 years ago
Qinghe JING 43b0445b29
Add double grad in reduce sum (#27115)
5 years ago
kinghuin ed292695c5
optimize the error message for math dir
5 years ago
yongqiangma 4558d395e9
fix Norm op error (#26771)
5 years ago
LielinJiang 4d7d661249
Fix kl and summary bug (#27132)
5 years ago
WeiXin 13804ed80c
Error msg/polish tensor error msg (#26976)
5 years ago
whs eb01976037
[2.0 API]Add checker in grid_sample_grad op (#27126)
5 years ago
wangguanzhong a28ae86e11
Enhance ops to support LoD as input for dygraph detection models. (#25316)
5 years ago
LielinJiang 8df5b4d608
Add correlation api to contrib (#27015)
5 years ago
LoveAn cbcd5e407a
Fix problem that target name already exists when there isn't model data cache, test=develop (#27142)
5 years ago
kinghuin 1b102dd552
optimize the error message for unpooling.cc
5 years ago
Pei Yang 5fb8c92054
fix multihead matmul shared params (#27121)
5 years ago
xiaoting 58f3ef982a
fix typo for interp_v2,test=develop (#26843)
5 years ago
wangchaochaohu 5af81f833c
fix gpu kernel for numel Op (#27085)
5 years ago
Wilber 632125415c
Refine python inference api (#26958)
5 years ago
YUNSHEN XIE b150f2b3a6
disable test_trt_dynamic_shape_ernie_ser_deser,test=document_fix (#27059)
5 years ago
zhupengyang 19ca6d9dd2
add .part to speed up compile (#27044)
5 years ago
LoveAn fab8bbf25b
Modify data download function and support unittests of inference APIs on windows (#26988)
5 years ago
GaoWei8 4ff16eb201
Add padding cudnn interface (#26370)
5 years ago
wawltor 8857e3911f
add the dynamic dtype check for the argmin/argma
5 years ago
wangchaochaohu 041f4ab842
refine linspace Op for dtype setting(#27071)
5 years ago
yaoxuefeng 9aa39584fe
fix cuda generator hard-coded offset step (#27027)
5 years ago
Jacek Czaja f6653c71e9
[oneDNN] Fix to conv2d grad with groups (#27006)
5 years ago
Chengmo a72752263b
support heter-xpu-ps (#27018)
5 years ago
whs 2660ea379d
Fix cuda kernel of affine grid (#27003)
5 years ago
ShenLiang ff3dc8ac73
fix the remainder (#26995)
5 years ago
yaoxuefeng 7f3e6ca596
add cuda generator (#26786)
5 years ago
Feiyu Chan c8cc094576
add template specialization for bfloat16 for gcc 4.8 compatability (#26985)
5 years ago
wangchaochaohu 3eacced950
[cuda11 support] add support for cublas load of same function name (parameter diff) (#26963)
5 years ago
joanna.wozna.intel 95e1434bb2
Add bfloat16 data type (#25402)
5 years ago
Yang Zhang 29b844ad5e
Fix clip op attr (#26924)
5 years ago
Shang Zhizhou 61fc7a3e45
Pass version check (#26887)
5 years ago
huangjun12 e480168fae
fix dropout bug in backward when input is 1d tensor (#26837)
5 years ago
YUNSHEN XIE d8984a6b90
limit timeout value setting on linux (#26923)
5 years ago
Zhou Wei 1771d9f880
fix cache judge more safe (#26910)
5 years ago
joanna.wozna.intel 0627a319b0
Restore "Add mkldnn bfloat16 option to C-API " (#26882)
5 years ago
Jacek Czaja 5e874cc333
- Cosmetic fixes to align with PADDLE_ENFORCE guidelines (#26891)
5 years ago
wanghuancoder 2d2c31a63a
Add FetchAsyncOpHandle, and use it in FastThreadedExecutor (#26643)
5 years ago
Thunderbrook 5205748481
fix eigen in push sparse; fix hadoop command (#26872)
5 years ago
Zhaolong Xing 932bbe955b
fix pool trt plugin bug (#26463)
5 years ago
wawltor 0a29fc85d6
fix the argmin,argmax op for the paddlepaddle 2.0
5 years ago
Chengmo d0962abd20
supplement bug fix of parameter server (#26217)
5 years ago
zlsh80826 ad6e3dd69c
[Paddle-TRT] Stack op plugin (#25605)
5 years ago
Leo Chen 60ffc22026
Refine bernoulli and unsqueeze op (#26842)
5 years ago
石晓伟 ced6e87eee
Revert "Add mkldnn bfloat16 option to C-API (#26676)" (#26854)
5 years ago
tangwei12 ebc5f99789
add embedding 2.0 (#26649)
5 years ago
hong19860320 40378edfa8
Add the AddCheckpoint macro to softplus op (#26809)
5 years ago
GaoWei8 11fb8a1c10
Refine cudnn softmax (#25757)
5 years ago
arlesniak 885c61f086
Add use of global flag 'use_mkldnn' to layer_helper (#26497)
5 years ago
Pei Yang 78a530c219
[Paddle-TRT] TRT dynamic shape support PaddleSlim quant models (#26536)
5 years ago
wawltor 7ee70a47b8
update the doc for the some ops
5 years ago
yaoxuefeng a47d92d868
fleet add save with whitelist test=develop (#23376)
5 years ago
zhupengyang 0f1ad9b06c
leaky_relu and hardshrink add checkpoint for behavior changed (#26802)
5 years ago
Chengmo 7f2aa2db3c
【paddle.fleet】Support Heter Parameter Server (#25998)
5 years ago
zlsh80826 ac63c7cdef
fix a skip_layernorm bug, test=develop (#26800)
5 years ago
Jiawei Wang a1b99fae07
Adadelta Optimizer (#26590)
5 years ago
LielinJiang 346689c6f1
Register conv_transpose Op version for compatible Op upgrades (#26745)
5 years ago
Adam 8bcb1f29d9
Add conv+affine_channel fuse pass to MKLDNN pass strategy and fix it (#26779)
5 years ago
Wilber 68e0560c2f
refine paddle inference api (#26774)
5 years ago
Wojciech Uss 7afb1df11e
Decouple weights and bias from fc primitive in MKLDNN cache (#26708)
5 years ago
Zhen Wang f32ae272ec
Remove `sorted_sum_gradient_` form BasicEngine and PartialGradTask. (#26766)
5 years ago
Leo Chen 844583c8fd
Refine paddle.manual_seed (#26496)
5 years ago
Pei Yang e3f8e5cf5c
trt int8 support conv2d_transpose (#26636)
5 years ago
ShenLiang 29494d703d
fix remainder, floor_div (#26732)
5 years ago
zhangchunle 623a4c2e56
fix ci coverage build error (#26761)
5 years ago
lilong12 5f524efe56
modify error report message, test=develop (#26743)
5 years ago
wangchaochaohu 4561fc37e2
Add check point for gather Op (#26696)
5 years ago
joanna.wozna.intel eb097d64f6
Fix int8 performace drop cpu_quantize_placement_pass (#26715)
5 years ago
joanna.wozna.intel 02083bda40
Add mkldnn bfloat16 option to C-API (#26676)
5 years ago
LutaoChu 1ec30cb160
register cumsum Op version for compatible Op upgrades (#26734)
5 years ago
Jack Zhou c282db3a93
add broadcast feature for elementwise logical op
5 years ago
Yang Zhang 63eef7632e
Fix clip input check (#26683)
5 years ago
Zhen Wang f9066e6a6f
Update the demo code and the doc of varbase.backward. (#26506)
5 years ago
Wilber 1c898b66d6
add bug fix enum. (#26736)
5 years ago
Zhou Wei 8071d23073
fix bug that can't print int8_t (#26712)
5 years ago
joejiong f311d3c1cf
Fix pow api type error with python side method, merge elementwise_pow and pow. (#26163)
5 years ago
yongqiangma e4cc6a28b0
Norm op support 2-axis (#26492)
5 years ago
chalsliu dc56c89822
Add the option to execute unit tests only at night (#26669)
5 years ago
xiaoting 89d7d86684
add intepolte_v2 (#26520)
5 years ago
Adam Osewski c2c689582e
Update Paddle-Lite commit hash. (#26413)
5 years ago
Zhang Ting 97cebfa4d3
add dtype for unique (#26655)
5 years ago
lilong12 1c68138327
[api 2.0] add collective op for cpu using gloo and paddle.distributed.* apis (#26552)
5 years ago
joanna.wozna.intel 559e43eee4
Small change in conv2d and quantize pass (#26671)
5 years ago
Bai Yifan 8986a82131
fix adaptive gpu grad bug, add doc refine (#26660)
5 years ago
wawltor 286eca2d9e
update the code for the topk v2
5 years ago
whs f82384113b
Fix atomicAdd in grid sample op and affine grid op (#26647)
5 years ago
Wilber 32ba8602c6
Enhance py_func error info message. (#26557)
5 years ago
chalsliu cb3f131f1c
Set timeout properity for a few unitests
5 years ago
石晓伟 32ceacf317
update op_version_registry, test=develop (#26644)
5 years ago
Dong Daxiang 08d736ad78
【paddle.fleet】add cudnn related strategies to DistributedStrategy (#26598)
5 years ago
Zhang Ting 0a895bc0df
improve unique op (#26537)
5 years ago
whs a004dfde3d
Use atomicAdd defined in paddle fromework (#26631)
5 years ago
LoveAn 02fc1fef8b
Fix the cmake-function named inference_download_and_uncompress on Windows (#26512)
5 years ago
YUNSHEN XIE a8b5741fb4
add a few unittests for setting timeout properity (#26630)
5 years ago
wanghuancoder c1f5df5269
optimized transformation form tensor to numpy (#26447)
5 years ago
zhupengyang c80fcf901e
reduce_mean error if keepdim=True and reduce_all=True (#26614)
5 years ago
whs a065a24232
【2.0 API】Enhance affine grid operator (#26385)
5 years ago
Qi Li 6f69fbc8ea
fix elu grad whne alpha less then zero, test=develop (#26543)
5 years ago
whs 786373ba29
Use atomicAdd defined in paddle framework (#26628)
5 years ago
ruri 1f82c0cd62
[Api2.0] add pixel shuffle (#26071)
5 years ago
wanghuancoder 422a162019
api2.0 paddle.nn.Bilinear and paddle.nn.functional.bilinear (#26399)
5 years ago
wanghuancoder 6e823cfec3
add op_function_generator.exe retry in windows, test=develop (#26591)
5 years ago
石晓伟 fa08a834be
update op_version_registry, test=develop (#26592)
5 years ago
whs 79539cf198
【2.0 API】Add CUDA kernel and enhance options for grid_sample (#26576)
5 years ago
Guanghua Yu 8645591d66
support fp64 in huber_loss cuda kernel (#26583)
5 years ago
yaoxuefeng efee426742
support generator seed in related kernals test=develop (#26495)
5 years ago
Zhong Hui bf4a4636f1
change to use bce_loss op, add shape check for bce_loss
5 years ago
ShenLiang 0e81626081
add div, floor_div, remainder (#26562)
5 years ago
石晓伟 656e60b18f
new class: op_version_registry, test=develop (#26542)
5 years ago
qingqing01 24566e951c
Support empty bbox in bipartite math op (#26488)
5 years ago
Jack Zhou 199b0c7c1b
Add isfinite v2 op (#26344)
5 years ago
wangchaochaohu ebf9b2125e
add paddle.gather for API2.0 (#26455)
5 years ago
wangchaochaohu 9219b79104
gather_nd Op for API 2.0 refine (#26540)
5 years ago
zhupengyang 9b14117cac
logsumexp: impl kernel, refine docs (#26307)
5 years ago
Wojciech Uss 5c2b9258a6
Fix (de/re)quantize cache keys (#26549)
5 years ago
wawltor 6b28456ed0
add the argmax, argmin for the api2.0
5 years ago
LielinJiang d26ae9ad87
Update conv_transpose api (#26427)
5 years ago
lilong12 faa9b97b78
fix cscatter, test=develop (#26554)
5 years ago
WangXi 45711dade7
【API】rename div to divide, add floor_divide, remainder (#26434)
5 years ago
LutaoChu 4e0c6d91aa
add paddle.tensor.linalg.diag API, diag_v2 OP and CUDA kernel
5 years ago
zhupengyang f8863e0603
leaky_relu and LeakyReLU: alpha->negative_slope (#26216)
5 years ago
ShenLiang c609066074
Add Matmul op (#26411)
5 years ago
Leo Chen aa2a9b5d89
add bernoulli op (#26511)
5 years ago
Adam f3909020de
Add mechanism for blocking oneDNN cache clearing (#26502)
5 years ago
ShenLiang b6eb37f5b3
add error message for cholesky (#26444)
5 years ago
QingshuChen 138ecf24aa
support Baidu Kunlun AI Accelerator (#25959)
5 years ago
yaoxuefeng 4f259354d2
mod cvm test=develop (#25146)
5 years ago
wangchaochaohu e167e87974
【API2.0】add masked_select Op for API2.0 (#26374)
5 years ago
Pei Yang 379222c3f1
add output scale and trt op teller support for hard_swish and hard_sigmoid (#26499)
5 years ago
zhupengyang 6e5670b8bd
mean: not support int32, int64; add check for axis (#26401)
5 years ago
zhupengyang 4ad504e7c7
hardshrink: support threshold < 0 (#26403)
5 years ago
lilong12 e92f770c42
Add collective ops (reduce) (#26340)
5 years ago
wangchaochaohu bdb805505e
【API2.0】add numel API for paddle test=develop (#26311)
5 years ago
wangchaochaohu 2073ffc04d
Enhance the data type of linspace API (#26247)
5 years ago
hong19860320 40d193ed17
Add the ReLU6, Tanhshrink, SELU, Softplus, Softshrink and Softsign for the api 2.0 (#26376)
5 years ago
Chen Weihang 9108282883
Polish framework error message part 5 (#26204)
5 years ago
Zhaolong Xing f00f982a02
add cub impl for arg max, min (#25941)
5 years ago
Zhang Ting 6914a12f82
rename the inputs of allclose (#26360)
5 years ago
littletomatodonkey bcf03273f6
add pad func (#26106)
5 years ago
Chengmo eeeef957c7
Fix ps gpu (#26218)
5 years ago
Zhong Hui 6cbeafb6c0
add zero norm, inf norm support for p_norm op (#26364)
5 years ago
Zhaolong Xing b7a86e92a8
fix dy shape bug in trt7.1 (#26273)
5 years ago
ceci3 56890dc729
Add SyncBatchNorm (#26032)
5 years ago
GaoWei8 1fbee267d4
remove scope in cudnn lstm (#25188)
5 years ago
Pei Yang b757466b0d
fix trt dynamic ernie serialization unit test (#26228)
5 years ago
Wilber 3ec0bcbbb8
[Bug] Fix prune for save_inference_model about transformer (#25347)
5 years ago
cc 3f816bc8b4
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639)
5 years ago
lilong12 638bbb6153
Improve expand as (#26290)
5 years ago
Thunderbrook a83e0f264c
fix heter proto (#26093)
5 years ago
Leo Chen 049ac56c08
Print user-friendly error message in core.ops [part 2] (#26377)
5 years ago
zhupengyang 586a6dd358
log_softmax and LogSoftmax: impl kernel and refind docs (#26088)
5 years ago
yaoxuefeng 23261ff44b
add cpu random Generator (#26013)
5 years ago
Sylwester Fraczek 69742bd9a4
Enable mkldnn layout conversion (#25778)
5 years ago
Leo Chen 672578a797
Print user-friendly error message in core.ops (#26261)
5 years ago
Jack Zhou 6d22f5c73e
Add PADDLE_ENFORCE in nll loss cuda kernel (#26294)
5 years ago
wangchaochaohu 0b81d76310
[API2.0] add op for cudnn version query test=develop (#26180)
5 years ago
lilong12 241b44db14
[API 2.0] adaptive expand op to use shape instead of expand_times (#26206)
5 years ago
wangchaochaohu bb11cbc250
[API2.0] add Device api (set_device and get_device)(#26103)
5 years ago
Zhou Wei 6de463d3d1
expose and unify the Tensor concepts to the user (#25978)
5 years ago
lilong12 fbd4d3cc97
[API 2.0] add paddle.tile op (#26245)
5 years ago
Zhou Wei 20147ace3f
fix_copy_if_different (#25868)
5 years ago
Wilber c84aa9c61f
update diff val. (#26242)
5 years ago
Yang Zhang a2d3e5c03b
Fix `paddle.abs` docstring (#25942)
5 years ago
Yang Zhang 22165934bc
Fix `paddle.acos` docstring (#25958)
5 years ago
Yang Zhang a5b5b00e02
Fix `paddle.asin` docstring (#25967)
5 years ago
Yang Zhang c758765769
Fix `paddle.atan` docstring (#25968)
5 years ago
Yang Zhang c4e480efc5
Fix `paddle.cos` docstring (#25969)
5 years ago
wawltor 2d6cc0b125
support the tuple for attribute of axis in min, max for api2.0
5 years ago
Dong Daxiang 50a5bcfc9d
【paddle.fleet】paddle.fleet -> paddle.distributed.fleet. (#26186)
5 years ago
Leo Chen ffe52b4452
[OpDevOptimize] Add common infershape functions (#26096)
5 years ago
Leo Chen 2d95280e1f
Feature/Enable Auto-Mixed-Precision in dynamic graph (#24903)
5 years ago
Chen Weihang 838e36e9ed
Fix loaded variable suffix repeat error (#26169)
5 years ago
Jack Zhou dea41da715
add nll loss API for the paddlepaddle api2.0
5 years ago
Wilber fb72b192e7
[DOC] Fix dead link (#26154)
5 years ago
wawltor 9c17b3c9f8
Add the max, min, maximum, minimum api for the API 2.0
5 years ago
JZ-LIANG 54003b873e
【paddle.fleet】add lamb to fleet meta optimizer (#26025)
5 years ago
Yiqun Liu 1be6bf45ae
Add assign to fusion_group and enhance inplace execution in fusion_group. (#26121)
5 years ago
lidanqing 65b97d6215
GRU model xnli dataset C++ tester (#25534)
5 years ago
Zhen Wang a86e8c0eef
add more error info for these ops without double grad ops. (#25987)
5 years ago
MRXLT 6559229b7e
fix encryption infer (#25979)
5 years ago
lilong12 8caee2ad51
【paddle.fleet】add the support for multi-node training for pipeline (#25907)
5 years ago
LutaoChu bf2db646de
fix cumsum op for API 2.0, optimize performance
5 years ago
Adam 1893cd6bb8
Add oneDNN relu6 op (#26037)
5 years ago
Zhaolong Xing 50f149a48e
fix cudnn workspace size problem during inference. (#26021)
5 years ago
tangwei12 c14ec8782b
【paddle.fleet】Feature/fleet ps api 2.0 (#25857)
5 years ago
Chen Weihang 3c8daa9b89
Add pin memory control for BufferedReader (#26026)
5 years ago
Chen Weihang ad4a0466a5
Add cuda pinned place branch in slice op GetExpectedKernelType (#26027)
5 years ago
Feiyu Chan e853ece0a2
update document template for unary elementwise layers (#25896)
5 years ago
joanna.wozna.intel 734cf1c3e9
Change use_quantizer attribute name and data type (#25838)
5 years ago
Leo Chen 5258d53d65
refine unsqueeze, test=develop (#25470)
5 years ago
tangwei12 3755564ae1
Fix/large scale fix (#25999)
5 years ago
Leo Chen 751305ecf0
Add flags to control call stack of error message (#25997)
5 years ago
Thunderbrook fd2947babf
fix compile error with mkl (#26030)
5 years ago
Leo Chen 0a47387bd8
Use static local variable instead of global variable for safty (#26018)
5 years ago
Pei Yang beb0ca5fab
Fix TRT plugin registry without TRT lib (#25982)
5 years ago
123malin 2191a08317
【paddle.fleet】fleet_util move to paddle.fleet (#25805)
5 years ago
yaoxuefeng 224620071b
add new flatten op test=develop (#25393)
5 years ago
Adam 68c6160e63
Add oneDNN fusion_gru kernel (#25594)
5 years ago
Thunderbrook 0cb60c700d
add heter ps mode (#25682)
5 years ago
Zhong Hui dca56f47f5
fix invalid read of pnorm gradient function
5 years ago
WangXi 2c9d0f3cb9
【paddle.fleet】Add dgc to fleet meta optimizer (#25738)
5 years ago
Zhaolong Xing 358bc06c72
[CUDNN8 support] : support CUDNN8 (#25664)
5 years ago
Zhaolong Xing 5970871a64
add eltwise clip cuda impl. (#25689)
5 years ago
Zhen Wang 82374dc12f
Add some error messages for the op without double grads. (#25951)
5 years ago
Pei Yang b717895f64
Fix registering trt plugin (#25744)
5 years ago
wawltor a697e94693
Update the code of the compare ops for the broadcast function
5 years ago
Chen Weihang 9b5a65b819
refine init signal handler meg dumper (#25911)
5 years ago
wangchaochaohu ff717d5158
Add support for tuple of concat Op test=develop (#25800)
5 years ago
tangwei12 253fd407e8
Fix/distibuted heart beat (#25902)
5 years ago
WangXi a6c87fd091
Add amp to fleet meta optimizer, test=develop (#25770)
5 years ago
Pei Yang 9e9a569dae
add trt int8 support for elementwise_mul and scale (#25676)
5 years ago
xujiaqi01 d11c140e28
fix dump, fix cvm check (#25400)
5 years ago
JZ-LIANG 8ebffc78c9
add lars to fleet meta optimizer (#25884)
5 years ago
Dong Daxiang 8d2896f1fe
【paddle.fleet】Fleet run graph in Executor and add two more strategies (#25844)
5 years ago
Zhang Ting 6486fe8a94
improve GPU performance of transpose, test=develop (#25862)
5 years ago
Zhang Ting 2d24f56a7a
avoid data transfer, test=develop (#25810)
5 years ago
ShenLiang bca303165a
fix inverse bug (#25641)
5 years ago
Chen Weihang 48b9a56f1c
Polish framework error message - part 4 (#25807)
5 years ago
Aurelius84 e52dae6ef6
Using input.place() in GetExpectedKernel in slice_op (#25595)
5 years ago
wawltor 595a719795
Update the api for the compare_ops
5 years ago
wangchaochaohu 32b9577b2a
refine the split op for API 2.0 test=develop (#25320)
5 years ago
lilong12 ce506930c3
Fix the bug that Input(Offsets) and attr(offsets) cannot be set at the same time. (#24975)
5 years ago
tangwei12 2d9dbd31ad
Fix/mkl dnn (#25835)
5 years ago
Zhaolong Xing bcddefef39
[Fix Ut]: fix inference ut which exist bug on windows. (#25814)
5 years ago
lilong12 5f30e57cdd
fix test_pipeline, test=develop (#25808)
5 years ago
Chen Weihang d47304e6d9
Refine paddle error stack format (#25790)
5 years ago
tangwei12 caa90a6510
Integrated Trainer of Parameter Server (API add `fluid.contrib.layers.sparse_embedding` only) (#22957)
5 years ago
hong c2a21ca9c9
Fix dygraph grad bugs (#25781)
5 years ago