Commit Graph

10423 Commits (e9651068f04757a19bc6f704aa82b7eefc876a8c)

Author SHA1 Message Date
LoveAn 02fc1fef8b
Fix the cmake-function named inference_download_and_uncompress on Windows (#26512)
5 years ago
YUNSHEN XIE a8b5741fb4
add a few unittests for setting timeout properity (#26630)
5 years ago
wanghuancoder c1f5df5269
optimized transformation form tensor to numpy (#26447)
5 years ago
zhupengyang c80fcf901e
reduce_mean error if keepdim=True and reduce_all=True (#26614)
5 years ago
whs a065a24232
【2.0 API】Enhance affine grid operator (#26385)
5 years ago
Qi Li 6f69fbc8ea
fix elu grad whne alpha less then zero, test=develop (#26543)
5 years ago
whs 786373ba29
Use atomicAdd defined in paddle framework (#26628)
5 years ago
ruri 1f82c0cd62
[Api2.0] add pixel shuffle (#26071)
5 years ago
wanghuancoder 422a162019
api2.0 paddle.nn.Bilinear and paddle.nn.functional.bilinear (#26399)
5 years ago
wanghuancoder 6e823cfec3
add op_function_generator.exe retry in windows, test=develop (#26591)
5 years ago
石晓伟 fa08a834be
update op_version_registry, test=develop (#26592)
5 years ago
whs 79539cf198
【2.0 API】Add CUDA kernel and enhance options for grid_sample (#26576)
5 years ago
Guanghua Yu 8645591d66
support fp64 in huber_loss cuda kernel (#26583)
5 years ago
yaoxuefeng efee426742
support generator seed in related kernals test=develop (#26495)
5 years ago
Zhong Hui bf4a4636f1
change to use bce_loss op, add shape check for bce_loss
5 years ago
ShenLiang 0e81626081
add div, floor_div, remainder (#26562)
5 years ago
石晓伟 656e60b18f
new class: op_version_registry, test=develop (#26542)
5 years ago
qingqing01 24566e951c
Support empty bbox in bipartite math op (#26488)
5 years ago
Jack Zhou 199b0c7c1b
Add isfinite v2 op (#26344)
5 years ago
wangchaochaohu ebf9b2125e
add paddle.gather for API2.0 (#26455)
5 years ago
wangchaochaohu 9219b79104
gather_nd Op for API 2.0 refine (#26540)
5 years ago
zhupengyang 9b14117cac
logsumexp: impl kernel, refine docs (#26307)
5 years ago
Wojciech Uss 5c2b9258a6
Fix (de/re)quantize cache keys (#26549)
5 years ago
wawltor 6b28456ed0
add the argmax, argmin for the api2.0
5 years ago
LielinJiang d26ae9ad87
Update conv_transpose api (#26427)
5 years ago
lilong12 faa9b97b78
fix cscatter, test=develop (#26554)
5 years ago
WangXi 45711dade7
【API】rename div to divide, add floor_divide, remainder (#26434)
5 years ago
LutaoChu 4e0c6d91aa
add paddle.tensor.linalg.diag API, diag_v2 OP and CUDA kernel
5 years ago
zhupengyang f8863e0603
leaky_relu and LeakyReLU: alpha->negative_slope (#26216)
5 years ago
ShenLiang c609066074
Add Matmul op (#26411)
5 years ago
Leo Chen aa2a9b5d89
add bernoulli op (#26511)
5 years ago
Adam f3909020de
Add mechanism for blocking oneDNN cache clearing (#26502)
5 years ago
ShenLiang b6eb37f5b3
add error message for cholesky (#26444)
5 years ago
QingshuChen 138ecf24aa
support Baidu Kunlun AI Accelerator (#25959)
5 years ago
yaoxuefeng 4f259354d2
mod cvm test=develop (#25146)
5 years ago
wangchaochaohu e167e87974
【API2.0】add masked_select Op for API2.0 (#26374)
5 years ago
Pei Yang 379222c3f1
add output scale and trt op teller support for hard_swish and hard_sigmoid (#26499)
5 years ago
zhupengyang 6e5670b8bd
mean: not support int32, int64; add check for axis (#26401)
5 years ago
zhupengyang 4ad504e7c7
hardshrink: support threshold < 0 (#26403)
5 years ago
lilong12 e92f770c42
Add collective ops (reduce) (#26340)
5 years ago
wangchaochaohu bdb805505e
【API2.0】add numel API for paddle test=develop (#26311)
5 years ago
wangchaochaohu 2073ffc04d
Enhance the data type of linspace API (#26247)
5 years ago
hong19860320 40d193ed17
Add the ReLU6, Tanhshrink, SELU, Softplus, Softshrink and Softsign for the api 2.0 (#26376)
5 years ago
Chen Weihang 9108282883
Polish framework error message part 5 (#26204)
5 years ago
Zhaolong Xing f00f982a02
add cub impl for arg max, min (#25941)
5 years ago
Zhang Ting 6914a12f82
rename the inputs of allclose (#26360)
5 years ago
littletomatodonkey bcf03273f6
add pad func (#26106)
5 years ago
Chengmo eeeef957c7
Fix ps gpu (#26218)
5 years ago
Zhong Hui 6cbeafb6c0
add zero norm, inf norm support for p_norm op (#26364)
5 years ago
Zhaolong Xing b7a86e92a8
fix dy shape bug in trt7.1 (#26273)
5 years ago
ceci3 56890dc729
Add SyncBatchNorm (#26032)
5 years ago
GaoWei8 1fbee267d4
remove scope in cudnn lstm (#25188)
5 years ago
Pei Yang b757466b0d
fix trt dynamic ernie serialization unit test (#26228)
5 years ago
Wilber 3ec0bcbbb8
[Bug] Fix prune for save_inference_model about transformer (#25347)
5 years ago
cc 3f816bc8b4
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639)
5 years ago
lilong12 638bbb6153
Improve expand as (#26290)
5 years ago
Thunderbrook a83e0f264c
fix heter proto (#26093)
5 years ago
Leo Chen 049ac56c08
Print user-friendly error message in core.ops [part 2] (#26377)
5 years ago
zhupengyang 586a6dd358
log_softmax and LogSoftmax: impl kernel and refind docs (#26088)
5 years ago
yaoxuefeng 23261ff44b
add cpu random Generator (#26013)
5 years ago
Sylwester Fraczek 69742bd9a4
Enable mkldnn layout conversion (#25778)
5 years ago
Leo Chen 672578a797
Print user-friendly error message in core.ops (#26261)
5 years ago
Jack Zhou 6d22f5c73e
Add PADDLE_ENFORCE in nll loss cuda kernel (#26294)
5 years ago
wangchaochaohu 0b81d76310
[API2.0] add op for cudnn version query test=develop (#26180)
5 years ago
lilong12 241b44db14
[API 2.0] adaptive expand op to use shape instead of expand_times (#26206)
5 years ago
wangchaochaohu bb11cbc250
[API2.0] add Device api (set_device and get_device)(#26103)
5 years ago
Zhou Wei 6de463d3d1
expose and unify the Tensor concepts to the user (#25978)
5 years ago
lilong12 fbd4d3cc97
[API 2.0] add paddle.tile op (#26245)
5 years ago
Zhou Wei 20147ace3f
fix_copy_if_different (#25868)
5 years ago
Wilber c84aa9c61f
update diff val. (#26242)
5 years ago
Yang Zhang a2d3e5c03b
Fix `paddle.abs` docstring (#25942)
5 years ago
Yang Zhang 22165934bc
Fix `paddle.acos` docstring (#25958)
5 years ago
Yang Zhang a5b5b00e02
Fix `paddle.asin` docstring (#25967)
5 years ago
Yang Zhang c758765769
Fix `paddle.atan` docstring (#25968)
5 years ago
Yang Zhang c4e480efc5
Fix `paddle.cos` docstring (#25969)
5 years ago
wawltor 2d6cc0b125
support the tuple for attribute of axis in min, max for api2.0
5 years ago
Dong Daxiang 50a5bcfc9d
【paddle.fleet】paddle.fleet -> paddle.distributed.fleet. (#26186)
5 years ago
Leo Chen ffe52b4452
[OpDevOptimize] Add common infershape functions (#26096)
5 years ago
Leo Chen 2d95280e1f
Feature/Enable Auto-Mixed-Precision in dynamic graph (#24903)
5 years ago
Chen Weihang 838e36e9ed
Fix loaded variable suffix repeat error (#26169)
5 years ago
Jack Zhou dea41da715
add nll loss API for the paddlepaddle api2.0
5 years ago
Wilber fb72b192e7
[DOC] Fix dead link (#26154)
5 years ago
wawltor 9c17b3c9f8
Add the max, min, maximum, minimum api for the API 2.0
5 years ago
JZ-LIANG 54003b873e
【paddle.fleet】add lamb to fleet meta optimizer (#26025)
5 years ago
Yiqun Liu 1be6bf45ae
Add assign to fusion_group and enhance inplace execution in fusion_group. (#26121)
5 years ago
lidanqing 65b97d6215
GRU model xnli dataset C++ tester (#25534)
5 years ago
Zhen Wang a86e8c0eef
add more error info for these ops without double grad ops. (#25987)
5 years ago
MRXLT 6559229b7e
fix encryption infer (#25979)
5 years ago
lilong12 8caee2ad51
【paddle.fleet】add the support for multi-node training for pipeline (#25907)
5 years ago
LutaoChu bf2db646de
fix cumsum op for API 2.0, optimize performance
5 years ago
Adam 1893cd6bb8
Add oneDNN relu6 op (#26037)
5 years ago
Zhaolong Xing 50f149a48e
fix cudnn workspace size problem during inference. (#26021)
5 years ago
tangwei12 c14ec8782b
【paddle.fleet】Feature/fleet ps api 2.0 (#25857)
5 years ago
Chen Weihang 3c8daa9b89
Add pin memory control for BufferedReader (#26026)
5 years ago
Chen Weihang ad4a0466a5
Add cuda pinned place branch in slice op GetExpectedKernelType (#26027)
5 years ago
Feiyu Chan e853ece0a2
update document template for unary elementwise layers (#25896)
5 years ago
joanna.wozna.intel 734cf1c3e9
Change use_quantizer attribute name and data type (#25838)
5 years ago
Leo Chen 5258d53d65
refine unsqueeze, test=develop (#25470)
5 years ago
tangwei12 3755564ae1
Fix/large scale fix (#25999)
5 years ago
Leo Chen 751305ecf0
Add flags to control call stack of error message (#25997)
5 years ago
Thunderbrook fd2947babf
fix compile error with mkl (#26030)
5 years ago
Leo Chen 0a47387bd8
Use static local variable instead of global variable for safty (#26018)
5 years ago
Pei Yang beb0ca5fab
Fix TRT plugin registry without TRT lib (#25982)
5 years ago
123malin 2191a08317
【paddle.fleet】fleet_util move to paddle.fleet (#25805)
5 years ago
yaoxuefeng 224620071b
add new flatten op test=develop (#25393)
5 years ago
Adam 68c6160e63
Add oneDNN fusion_gru kernel (#25594)
5 years ago
Thunderbrook 0cb60c700d
add heter ps mode (#25682)
5 years ago
Zhong Hui dca56f47f5
fix invalid read of pnorm gradient function
5 years ago
WangXi 2c9d0f3cb9
【paddle.fleet】Add dgc to fleet meta optimizer (#25738)
5 years ago
Zhaolong Xing 358bc06c72
[CUDNN8 support] : support CUDNN8 (#25664)
5 years ago
Zhaolong Xing 5970871a64
add eltwise clip cuda impl. (#25689)
5 years ago
Zhen Wang 82374dc12f
Add some error messages for the op without double grads. (#25951)
5 years ago
Pei Yang b717895f64
Fix registering trt plugin (#25744)
5 years ago
wawltor a697e94693
Update the code of the compare ops for the broadcast function
5 years ago
Chen Weihang 9b5a65b819
refine init signal handler meg dumper (#25911)
5 years ago
wangchaochaohu ff717d5158
Add support for tuple of concat Op test=develop (#25800)
5 years ago
tangwei12 253fd407e8
Fix/distibuted heart beat (#25902)
5 years ago
WangXi a6c87fd091
Add amp to fleet meta optimizer, test=develop (#25770)
5 years ago
Pei Yang 9e9a569dae
add trt int8 support for elementwise_mul and scale (#25676)
5 years ago
xujiaqi01 d11c140e28
fix dump, fix cvm check (#25400)
5 years ago
JZ-LIANG 8ebffc78c9
add lars to fleet meta optimizer (#25884)
5 years ago
Dong Daxiang 8d2896f1fe
【paddle.fleet】Fleet run graph in Executor and add two more strategies (#25844)
5 years ago
Zhang Ting 6486fe8a94
improve GPU performance of transpose, test=develop (#25862)
5 years ago
Zhang Ting 2d24f56a7a
avoid data transfer, test=develop (#25810)
5 years ago
ShenLiang bca303165a
fix inverse bug (#25641)
5 years ago
Chen Weihang 48b9a56f1c
Polish framework error message - part 4 (#25807)
5 years ago
Aurelius84 e52dae6ef6
Using input.place() in GetExpectedKernel in slice_op (#25595)
5 years ago
wawltor 595a719795
Update the api for the compare_ops
5 years ago
wangchaochaohu 32b9577b2a
refine the split op for API 2.0 test=develop (#25320)
5 years ago
lilong12 ce506930c3
Fix the bug that Input(Offsets) and attr(offsets) cannot be set at the same time. (#24975)
5 years ago
tangwei12 2d9dbd31ad
Fix/mkl dnn (#25835)
5 years ago
Zhaolong Xing bcddefef39
[Fix Ut]: fix inference ut which exist bug on windows. (#25814)
5 years ago
lilong12 5f30e57cdd
fix test_pipeline, test=develop (#25808)
5 years ago
Chen Weihang d47304e6d9
Refine paddle error stack format (#25790)
5 years ago
tangwei12 caa90a6510
Integrated Trainer of Parameter Server (API add `fluid.contrib.layers.sparse_embedding` only) (#22957)
5 years ago
hong c2a21ca9c9
Fix dygraph grad bugs (#25781)
5 years ago
cc 42189be67b
[Quant] Remove the output for moving_average_abs_max_scale op (#25697)
5 years ago
Dong Daxiang a96d54ac19
Generate final strategy (#25782)
5 years ago
Chen Weihang 2469b578f5
Unified paddle error format when catch system signal (#25765)
5 years ago
Zhou Wei b484a59c39
fix copy file random fail on windows (#25731)
5 years ago
Chen Weihang 23d1228c4d
remove ProgramTranslator.save_inference_model (#25740)
5 years ago
Chen Weihang 1b3081b1b4
Simplify BufferedReader to improve DataLoader performance (#25648)
5 years ago
Pei Yang 55b6205ddf
add set_mkldnn_cache_capacity python api(#25524)
5 years ago
Zhou Wei e0a9115e28
fix random compile failure due to missing file (#25661)
5 years ago
Pei Yang eef98b7f86
add macro check for using TRT api dynamicRangeIsSet() (#25694)
5 years ago
Pei Yang f82baed866
fix trt instance norm plugin on gcc8. test=develop (#25730)
5 years ago
Dong Daxiang 920d998f1e
add more settings for distributed strategy (#25685)
5 years ago
Sylwester Fraczek 1aaa26f102
add dnnl sigmoid (logistic) activation (#25745)
5 years ago
Chen Weihang c34c80d302
Polish framework error message part3 (#25701)
5 years ago
arlesniak e52df3b125
Added DNNL cache management for DyGraph (#25624)
5 years ago
wangchaochaohu 1e4ab728fb
refine the concat Op for API 2.0 test=develop (#25307)
5 years ago
Zhen Wang cea5086853
Fix the double grad bug for the star gan. (#25655)
5 years ago
Chen Weihang 364cc53618
Polish paddle fluid framework error message - part2 (#25667)
5 years ago
Adam 98899b73d2
Fix FC + GRU fuse pass (#25687)
5 years ago
wanghuancoder 1917b38099
fix some errmsg report,in framework/ir/, about 21 files (#25525)
5 years ago
Leo Chen 4ec1251a1e
Refine squeeze, test=develop (#25281)
5 years ago
joanna.wozna.intel e5bbffa84c
Add NOMINMAX define due to windows.h max/min macro conflict (#25637)
5 years ago
cnn 70cee22fde
New features, add sinh and cosh op, test=develop (#25495)
5 years ago
Zhang Ting a1350744eb
register fp16 kernel, test=develop (#25630)
5 years ago
mapingshuo 5453a912fe
add fp64 support in sequence_pool, test=develop (#25662)
5 years ago
Leo Chen 417b243968
fix best_fit_allocator_test on windows, test=develop (#25650)
5 years ago
GaoWei8 6e86fd3750
fix concat dimension (#25606)
5 years ago
donproc 95fa383df2
optimize embedding cuda kernel lookup_table_v2,test=develop (#25587)
5 years ago
石晓伟 7206417259
supports xpu runtime, test=develop (#25554)
5 years ago
Chen Weihang dfb3ae1b9b
Polish some error message in framework holder - Part 1 (#25509)
5 years ago
Zhang Ting 30d1ff3bb4
call cublasGemmStridedBatchedEx when using fp16, test=develop (#25553)
5 years ago
Zhaolong Xing 9df18b08f3
Disable windows static library generation (#25593)
5 years ago
Aurelius84 ca1185d06b
[Dy2Stat] Fix scope in run_program_op (#25579)
5 years ago
Chen Weihang a6abd92dfd
Polish install error hint message (#25531)
5 years ago
wanghuancoder 9b46fe0440
fix some errmsg report,in framework/ir/, about 5 files (#25539)
5 years ago
Dong Daxiang e657d7062d
fleet base initial implementation and the API (#25442)
5 years ago
Jacek Czaja 7dbc441eab
[oneDNN] cache cosmetics improvement (#25576)
5 years ago
hong e362095e45
fix softmax with cross entropy out of bound; test=develop (#25549)
5 years ago
Huihuang Zheng d8fe517bf8
Add Support for SelectedRows for Transpose OP and Fix a Bug That SelectedRows Cannot be Supported in SimNet (#25536)
5 years ago
Wilber 848aca7ae8
[CI] [Lite-Subgraph] CI add lite subgraph check. (#25346)
5 years ago
wanghuancoder e65c5b8e83
fix some errmsg report, in framework/ir/ (#25471)
5 years ago
Shibo Tao 71c71e684c
fix logical_* ops' doc (#25479)
5 years ago
Aurelius84 4717bdbcfb
Fix hang in seq_topk_avg_pooling op (#25522)
5 years ago
LielinJiang 7129f544f0
Add bilateral_slice op (#25401)
5 years ago
GaoWei8 c10dcff12d
refine PADDLE_ENFORCE (#25456)
5 years ago
wanghuancoder 6c0982b942
fix some errmsg report, in framework/ir/mkldnn (#25467)
5 years ago
wanghuancoder fce6466217
fix some errmsg report, in framework/ir/ subdir(memory,optimizer,multi_device) (#25460)
5 years ago
Zhang Ting ca725c82f2
improve fp16 performance of slice_grad, test=develop (#25523)
5 years ago
yaoxuefeng 5d3766ff3d
modify flip test=develop (#25312)
5 years ago
Chen Weihang 41d2247275
[Dy2static] Refactor ProgramTranslator save_inference_model API (#24989)
5 years ago
Pei Yang 43f9f180e5
Add api to clear intermediate tensors in AnalysisPredictor (#25069)
5 years ago
yaoxuefeng aaa7cbd56f
modify trace api test=develop (#25397)
5 years ago
Huihuang Zheng f9ac5fb992
[Dy2stat] Fix Memory Optimization in run_program_op and Add SimNet as Unit Test (#25383)
5 years ago
yaoxuefeng c42d662e2a
modify roll test=develop (#25321)
5 years ago
Zhen Wang 548cdbc544
Quantization-aware training for dygraph (#24634)
5 years ago
Chen Weihang 0b54d54fd8
Fix index overflow bug of the CUDA kernel loop increment (#25435)
5 years ago
zlsh80826 e528392de9
[Paddle-TRT] SkipLayernorm vectorized memory optimization (#25117)
5 years ago
Chen Weihang 4061aa6488
Polish ParallelExecutor exception process logic (#25449)
5 years ago
Jeng Bai-Cheng fc93266b0a
Improve qkv transpose performance (#23919)
5 years ago
zhupengyang 5b573c58e2
randperm API: remove out, devive, stop_gradient; add name (#25410)
5 years ago
Chen Weihang 7be285a66f
remove useless property, test=develop (#25461)
5 years ago
Jacek Czaja a5d1592f6c
Added missing oneDNN format (#25450)
5 years ago
Chen Weihang 172d4ecb6c
remove WITH_DSO compile option (#25444)
5 years ago
Zhen Wang bb45af02ac
add the c++ part of Imperative QAT. test=develop (#25446)
5 years ago
Jacek Czaja 050a9bf79d
[oneDNN] LRN cleanup (#25416)
5 years ago
GaoWei8 1974aadcf0
fix concat shape error (#25414)
5 years ago
tangwei12 4b3778a3ee
Revert/barrier for sync (#25417)
5 years ago
ceci3 52be62c5ae
fix instance norm in dy (#24717)
5 years ago
lilong12 e39aa70ec7
add the support for pipeline (#24560)
5 years ago
hong 70d7d07fea
catch bad alloc exception (#25140)
5 years ago
gongweibao 80f1c50738
Fix typo in interface. (#24779)
5 years ago
Zhaolong Xing 7b7e605189
[Fix BUGs]: fix multhead matmul pass's instable bug (#25123)
5 years ago
zhupengyang eb3173e2b6
rand API: remove out, device, stop_gradient; add name (#25246)
5 years ago
GaoWei8 ea7e532598
Refine PADDLE_ENFORCE (#25369)
5 years ago
zhupengyang 6de75082cb
fix test_hsigmoid windows ci (#25311)
5 years ago
Dong Daxiang d5e40d1ba9
Paddle fleet distributed strategy (#25379)
5 years ago
WuHaobo f593c3fb2f
fix the formula of floor OP and ceil OP (#25292)
5 years ago
Wojciech Uss d0a921ba98
Quant2 updates and fixes (#25313)
5 years ago
Zhang Ting bc7610583b
use eval() to improve CPU performance (#25243)
5 years ago
lilong12 3d96601b82
modify pipeline optimizer to only support the mode of sync pipeline training (#25065)
5 years ago
Kaipeng Deng 74468bf428
add mish op. (#24565)
5 years ago
Chen Weihang f07b25d8e5
fix DataLoader.generrator using error, test=develop (#25355)
5 years ago
GaoWei8 fb70682f00
fix PADDLE_ENFORCE (#25297)
5 years ago
Yang Zhang 6d6efafeeb
Add `matrix_nms_op` (#24400)
5 years ago
Chen Weihang 5a959f6e6e
Refactor dynamic dso search functions (#25214)
5 years ago
Jacek Czaja 17c751bec6
[oneDNN] Fix to #25078 (#25256)
5 years ago
MRXLT 3b8f0a64c2
Encryption infer (#25119)
5 years ago
Wilber 4474fc1033
fix compile on windows. test=develop (#25310)
5 years ago
Aurelius84 bc2bd3c1ed
modify into eager_tmp of Base Class test=develop (#25323)
5 years ago
Chengmo e85fcaa712
Fix fluid.embedding in Distributed Training (#25174)
5 years ago
Aurelius84 494cb36d09
Modify tmp var name prefix in dygraph (#25280)
5 years ago
Wilber 0371cf6f94
fix compile for lite subgraph. test=develop (#25285)
5 years ago
Yiqun Liu c00f827843
Avoid data transforming ShapeTensor from CPU to GPU in fill_constant op. (#25267)
5 years ago
Wojciech Uss 23a4f54b73
rename qat into quant (#24948)
5 years ago
123malin f1a9593d69
test=develop, bug fix for index_select and roll op (#25251)
5 years ago
FDInSky c2e072587c
test=develop fix generate_proposals's error (#25227)
5 years ago
Sylwester Fraczek 36abeff44f
adding elementwiseadd quantization (#25178)
5 years ago
Wojciech Uss 56fa3880e3
rename qat into quant in filenames only (#25194)
5 years ago
Wilber 4c964abdf7
support build on arm. test=develop (#25212)
5 years ago
Wilber f78e161ea3
remove paddle_use_kernel and paddle_use_op. test=develop (#25189)
5 years ago
liym27 1458cc0c68
Fix bug: Don't check dims if contain_unknown_dim of cross_entropy_grad_op in compile time (#25221)
5 years ago
liu zhengxi 68e93d8a17
Fix beam_search InferShape (#25169)
5 years ago
Chen Weihang 353ea9e8ad
Add default cudnn lib path (#25175)
5 years ago
Leo Chen ff5be2fb77
Refine error message in memory folder (#25095)
5 years ago
Adam bd0b38e671
Refactor of conv fp32 oneDNN operator (#25137)
5 years ago
Pei Yang b2f5a149e7
[Paddle-TRT] Better Paddle-TensorRT support for PaddleSlim quant models (#25097)
5 years ago
Tao Luo 2996315fc9
fix profiler_test on win32 (#25073)
5 years ago
Shibo Tao 19c4db1b56
don't re-generate header file if content doesn't change (#25130)
5 years ago
iducn f282599229
disable unitest for gcc8(#25134)
5 years ago
tianshuo78520a 1eb9ee242b
delete buddy_allocator_test_data to make repo clean (#25046)
5 years ago
Chen Weihang b23801a262
polish tensor set error messag, test=develop (#25113)
5 years ago
Jacek Czaja a7944904d3
[oneDNN]elementwise_add and elementwise_mul int8 support (#24984)
5 years ago
Zhaolong Xing 843581154f
fix emb eltwise layernorm (#24873)
5 years ago
石晓伟 9ab3cf039c
remove useless test_dot, test=develop (#24957)
5 years ago
石晓伟 6783441e70
fix repeat definitions in liengine.cc, test=develop (#25020)
5 years ago
Leo Chen fa657b3dbb
fix bug of prelu when rank not equal 4, test=develop (#25067)
5 years ago
zlsh80826 479c8834f7
[Paddle-TRT] Fixes #24731, opt for SoftmaxKernelWithEltadd kernel, test=develop (#24834)
5 years ago
hutuxian 5822862d8a
Monitor Framework (#24079)
5 years ago
Leo Chen 028de857d4
fix dtype error of compare op, test=develop (#25059)
5 years ago
Jeng Bai-Cheng bef4afa6de
bugfix for unique_ptr of IOptimizationProfile (#23917)
5 years ago
zlsh80826 49e4ee27e1
[Paddle-TRT] slice kernel optimization (#24783)
5 years ago
tianshuo78520a 770c11a117
fix make device_context error (#25045)
5 years ago
tangwei12 be6a315fbd
Fix/sync barrier (#25016)
5 years ago
ceci3 8db66fc3f6
fix cos_sim, test=develop (#25017)
5 years ago
Leo Chen 25a4dac4c2
Use allow list instead of white list (#25002)
5 years ago
Zhang Ting 621b638550
improve performance of instance_norm, test=develop (#25005)
5 years ago
hutuxian 1c224e26af
support CMatchAuc (#24990)
5 years ago
Leo Chen bfa46c38d5
bn supports reverse_space, test=develop (#24988)
5 years ago
wangchaochaohu 613303dbf6
refine the slice Op to improve the performance of xlnet for fp16 training (#24967)
5 years ago
silingtong123 37bdb5269f
test=develop, add log message in the function UpdateDllFlag (#24937)
5 years ago
Chen Weihang d152d7231e
clear old var in scope, test=develop (#24976)
5 years ago
Sylwester Fraczek 53d563a0fe
Reshape transpose matmul coverage (#24970)
5 years ago
wawltor 0eb1b0bc01
Add support the 5d, 6d tensor support for the reduce ops
5 years ago
liuwei1031 8603b5fb72
fix randomly hang issue of PaddleDetection training task on windows (#24977)
5 years ago
silingtong123 640196c446
test=develop, remove the tensorrt dll file from windows package (#24922)
5 years ago
wangchaochaohu feba131893
fix the sgement fault error of profiler in seqseq model test=develop (#24952)
5 years ago
Sylwester Fraczek a7ee634b45
fix WARNING: ThreadSanitizer: heap-use-after-free (#24929)
5 years ago
mapingshuo 24e24987f0
fixes the place info in the Print op (#24934)
5 years ago
Aurelius84 6be0ee159e
Support LoDTensorArray in reverse_op (#24797)
5 years ago
Leo Chen 6190023ac9
Refine error message in pybind folder (#24886)
5 years ago
Zhou Wei 4058e736ff
temporarily disable these unittests failed on windows (#24942)
5 years ago
Leo Chen a7cb97a1a5
Fix/isfinite on windows (#24927)
5 years ago
silingtong123 ef9b36873d
test=develop, remove the gflags/gflags.h form paddle_api.h (#24921)
5 years ago
whs 4c01d6d53e
Enhance checking in some operator. (#24473)
5 years ago
Chen Weihang 4a702ef361
Support SelelctedRows allreduce in multi-cards imperative mode (#24690)
5 years ago
Pei Yang 14b8540551
add default ctor for AnalysisConfig python api. test=develop (#24924)
5 years ago
silingtong123 fc4435174b
test=develop, fix the bug of tensorrt package can't compile on windows (#24860)
5 years ago
lilong12 29de0d97a5
add the support to specify device index for device_guard (#24555)
5 years ago
lilong12 6e10022781
add queue_generator_op, dequeue_op, enqueue_op and ut (#24481)
5 years ago
hutuxian b8f17a049d
fix problem in dump and add log (#24891)
5 years ago
石晓伟 76cdbb84b0
ignore warnings of external libraries, test=develop (#24193)
5 years ago
Leo Chen 1e818158f5
Feature/add amp_checkout_finite_and_scale op (#24875)
5 years ago
leesusu a6beb96dd0
FTRL with sparse update, test=develop (#22092)
5 years ago
Chen Weihang 6aae034f8d
add dep for fs.cc, test=develop, test=document_fix (#24881)
5 years ago
Jacek Czaja 40a5f3fd86
[oneDNN] Clearing mkldnn cache in naiveexecutor destructor (#24756)
5 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
5 years ago
Chen Weihang a4f6003404
append try-catch to opbase run, test=develop (#24870)
5 years ago
Michał Gallus 23a85f030c
Remove old mkldnn_elementwise_mul test (#24855)
5 years ago
123malin 9d2bd0ac38
downpour_worker增加try_catch机制,打印program所有参数 (#24700)
5 years ago
Michał Gallus b2ba830eeb
Fix bug in ShareBufferWith causing eltwise_add inplace ut to fail (#24853)
5 years ago
Zhang Ting 7d0cbfd045
fix negative framework overhead in Profiling Report (#24850)
5 years ago
Yanghello aa47356b74
Add crypto python (#24836)
5 years ago
Leo Chen b67ded04f2
Support gradient accumulation of fp16 in imperative mode (#24823)
5 years ago
Wilber 1e190a9e02
[Inference] [unittest] Inference unit tests rely on dynamic libraries (2) (#24859)
5 years ago
Chen Weihang 0aed095188
The third time to simplify the C ++ error stack (#24831)
5 years ago
Qi Li 704cad6a66
Add histc op (#24562)
5 years ago
Qi Li a125697e2c
fix ut test_recognize_digits (#24794)
5 years ago
Yi Liu 12bffdc086
Enhance error message of checkpoint_notify_op, fake_init_op gen_nccl_id_op and listen_and_serv_op (#24554)
5 years ago
Wojciech Uss 78d4f0cc91
add option to exclude ops by id from quantization (#24689)
5 years ago
Yanghello 62b4ff7dd2
Aes_cipher_test and cipher_utils_test failed fixed (#24816)
5 years ago
Adam b490e41c1d
Add isCached() mechanism for BatchNorm and LRN oneDNN operators (#24798)
5 years ago
Wilber f8e370ac7f
[Inference] [unittest] Inference unit tests rely on dynamic libraries (#24743)
5 years ago
silingtong123 126d3d693b
support C++ inference shared library on windows (#24672)
5 years ago
Zhen Wang 23d253e1be
Fix out of range error for outs map. test=develop (#24774)
5 years ago
Zhou Wei d1047d0a69
add WITH_GPU for cudaerror download (#24056)
5 years ago
Aurelius84 a7e21cbed3
Move input_size check into RunTime phrase of gru_unit_op and refine error message (#24776)
5 years ago
Zhou Wei f66594a558
fix bug that diag API can't use on Windows(#24762)
5 years ago
Leo Chen c0911fdd32
rename inplace/no_need_buffer inferer, part4, test=develop (#24781)
5 years ago
Chen Weihang be82de4c79
polish two error message, test=develop (#24778)
5 years ago
Yanghello 5a7a517cde
Add crypto api (#24694)
5 years ago
Leo Chen b0e7439fbc
rename inplace/no_need_buffer inferer, part2, test=develop (#24733)
5 years ago
Leo Chen a6fbba65ff
rename inplace/no_need_buffer inferer, part3, test=develop (#24734)
5 years ago
wangchaochaohu 355caee18b
fix conv_transpose Op fp16 error test=develop (#24695)
5 years ago
Chen Weihang 19e5f7879c
Append error op hint for GradOpMaker (#24750)
5 years ago
Pei Yang 181b1f5a30
adjust minimum trt version for hard_sigmoid converter to 5130. test=develop (#24746)
5 years ago
wangchaochaohu 79caed6667
fix the print error of PE record_event and framework overhead in profiler test=develop (#24744)
5 years ago
Adam 56a714a19b
Add isCached() machinism to oneDNN pooling primitive (#24724)
5 years ago
Aurelius84 6db0e2b1a9
[Dy2stat] Support len syntax (#24638)
5 years ago
ShenLiang 950892044f
fix conflict, test=develop (#24238)
5 years ago
lidanqing c3c61d34c1
Update PADDLE_ENFORCE in DNNL related ops (#24333)
5 years ago
Wilber dbe2497768
fix mkldnn concat bug. test=develop (#24722)
5 years ago
Leo Chen de8b4f42cf
rename inplace/no_need_buffer inferer, part 1, test=develop (#24711)
5 years ago
hutuxian e6b87b3193
Support AucRunner in PaddleBox (#22884)
5 years ago
Chen Weihang 7fa9f16c17
Polish reader folder error message (#24698)
5 years ago
wangchaochaohu dbfe5333c5
Add pe profiler Event (#24611)
5 years ago
Liufang Sang 55b664a131
remove pow to speed up in dequantize_log op (#24607)
5 years ago
Jacek Czaja ca68b13f85
[oneDNN] Fix to elementwise_add grad (#24639)
5 years ago
Adam 586b587519
Add isCached() check in Softmax handler (#24637)
5 years ago
Leo Chen 9c9e635c00
support tensor to varbase, test=develop (#24660)
5 years ago
zlsh80826 fdbe114b12
[Paddle-TRT] use float constant instead of double test=develop (#24544)
5 years ago
Leo Chen 14dd6388c5
fix bug of varbase.__getitem__, test=develop (#24642)
5 years ago
jinyuKING 6ca44cbaa3
API/OP(sequence_first_step, sequence_last_step, sequence_mask, beam_search, beam_search_decode) error message enhancement (#24590)
5 years ago
Wilber ba2f8f0ce4
fix embedding_eltwise_layernorm_fuse_pass. test=develop (#24592)
5 years ago
Huihuang Zheng f831f61216
Make buddy_allocator_test Exclusive (#24636)
5 years ago
hutuxian 0ec3a42e97
Random Dump (#24477)
5 years ago
Leo Chen 1d03469685
use vector instead of pointer, test=develop (#24620)
5 years ago
Yiqun Liu 6b464f969a
Add an operator node in unittest to make the fusing result unique. (#24617)
5 years ago
ShenLiang 837dd47a96
Add lod in gather/scatter (#24613)
5 years ago
Jacek Czaja 3292f0ef58
[onednn] elementwise add broadcasting support (#24594)
5 years ago
Yiqun Liu 560c815390
Add some check for CUDA Driver API and NVRTC (#22719)
5 years ago
Guo Sheng fe8d006f7e
API/OP(sequence_expand_as) error message enhancement (#23712)
5 years ago
Leo Chen d980d251f0
specify outs, test=develop (#24537)
5 years ago
yaoxuefeng 16817c70fc
OP(datanorm lookupsparsetable lookuptable) error message enhancement (#24506)
5 years ago
liu zhengxi fff9faae72
API(dynamic_gru, chunk_eval, BeamSearchDecoder) error message enhancement (#24513)
5 years ago
wangchaochaohu c4dd596de6
cusum op optimization for GPU kernel (#24321)
5 years ago
Leo Chen d43e4047ff
refine err msg, test=develop (#24550)
5 years ago
liym27 615a8bfcf4
Support int32 int64 and fix bug (#24407)
5 years ago
Zhaolong Xing f68d4fb3f1
fix bert bug using trt6 when compile with CUDA_ARCH_NAME=All (#24517)
5 years ago
lilong12 a13a4dbc7b
Improving error reporting messages for ops (#24438)
5 years ago
123malin 45ef6ff338
test=develop, error info improvement (#24496)
5 years ago
Adam dcf17f4813
Add isCached() mechanism to elementwise_add DNNL (#24563)
5 years ago
pawelpiotrowicz db2b6b6568
Hide globals & redesign restore PR (#24279)
5 years ago
Jiawei Wang 4a105f803e
SamplingID Op fix error print (#24521)
5 years ago
ShenLiang 53e3c53423
fix error message, test=develop (#24425)
5 years ago
ruri ea2c498712
Fix err message (#24507)
5 years ago
Wilber 4ec7287602
fix compile when with_nccl=off. test=develop (#24444)
5 years ago
chajchaj e5a624935f
Enhance error message of cross_entropy_op, sigmoid_cross_entropy_with_logits_op (#24485)
5 years ago
FDInSky 526a211774
update conv error info (#24430)
5 years ago
Yibing Liu a9520db24e
Format error message for ops (#24482)
5 years ago
Yang Zhang 2644cb819d
Remove extraneous comma in error messages (#24478)
5 years ago
suytingwan 05c9642dc3
Update paddle enforce message (#24498)
5 years ago
lijianshe02 9f83f0fe69
API/OP (group_norm, layer_norm, random_crop, unpool) error message enhancement (#24413)
5 years ago
hutuxian 7f85dfeb18
Upgrade Error Message for AucOP & MultiplexOP (#24458)
5 years ago
WangXi 027f9953bc
Optimize error message, include dgc, nccl, size op (#24456)
5 years ago
gongweibao f1c57d648c
Enhance error message of prefetch_op, proximal_adagrad_op, proximal_gd_op (#24436)
5 years ago
danleifeng d98e1182a1
fix check and error message for flatten hash is_empty op (#24434)
5 years ago
liu zhengxi 417b576c34
API(dynamic_lstm, dynamic_lstmp) error message enhancement (#24450)
5 years ago
wangchaochaohu 53bdee64e6
add tensor support for gaussian_random_op test=develop (#24389)
5 years ago
qingqing01 da4a1db7bb
Refine error message in some OPs (#24443)
5 years ago
Xing Wu 479c47f316
fix error info for transpose sequence_conv_pool max_sequence_len sequ… (#24437)
5 years ago
WuHaobo 1c00732dd0
Polish PADDLE_ENFORCE of unfold_op (#24423)
5 years ago
MRXLT 71ff32b65d
update error message for unstack op and lamb op; test=develop (#24439)
5 years ago
ForFishes 05c3bc3bf6
fix error message, test=develop (#24447)
5 years ago
ceci3 633aaea07e
Update err message (#24422)
5 years ago
Chen Weihang 05d20e5793
API/OP (Some SL API) error message enhancement (#24441)
5 years ago
tangwei12 a97d5a6153
fix op error, test=develop (#24451)
5 years ago
Yang Zhang 7c17ed57e0
Refine `squared_l2_distance_grad` error message (#24409)
5 years ago
huangjun12 046b7ebc03
refine Error message of modified_huber_loss (#24417)
5 years ago
zhang wenhui 621a4085b9
enhance cvm bpr_loss adam adagrad adamax ftrl error message, test=develop (#24452)
5 years ago
Li Fuchen e58619295e
OP(rank_loss, similarity_focus, squeeze) error message enhancement (#24448)
5 years ago
wangguanzhong abb49df26f
Enhance yolo_box & yolov3_loss (#24370)
5 years ago
mapingshuo d1bb76a2f7
fix error log, test=develop (#24419)
5 years ago
Jacek Czaja 8b88cd5167
[oneDNN] Fix to inplace pass (#24442)
5 years ago
Bai Yifan f1c4c14c7a
Fix fsp_op error message,test=develop (#24405)
5 years ago
Jacek Czaja d0307145a3
[oneDNN] disabling oneDNN inplace pass (#24406)
5 years ago
Kaipeng Deng 6435d1b4b8
fix PADDLE_THROW in spectral_norm_op.h. test=develop (#24414)
5 years ago
wangguanzhong cd327e66df
optimize error message, test=develop (#24420)
5 years ago
FDInSky 7fedf26b87
add linear interpolate operator (#23357)
5 years ago
Liufang Sang e24575c8ba
fix error message in accuracy_op.h test=develop (#24416)
5 years ago
wawltor 2de5075a6d
Fix the elementwise ops in broadcast in the process of backward (#24319)
5 years ago
Wojciech Uss fddf442470
add batch size to the mkldnn matmul cache key (#24408)
5 years ago
Tao Luo 72c370c8d2
remove unused test_multi_thread_helper.h (#24399)
5 years ago
hutuxian 123255cf9f
change InitializeGPU to InitializeGPUAndLoadModel (#24377)
5 years ago
Tao Luo 63da846de0
remove old inference C++ tests (#24368)
5 years ago
Chen Weihang aa0f254fbe
Add macro BOOST_GET to enrich the error information of boost :: get (#24175)
5 years ago