Commit Graph

5691 Commits (8f2656ef5ca4ab16f06d94b8ca9392d3f0f760ae)

Author SHA1 Message Date
lilong12 bf461fa524
Improving error report message for sequence_expand op (#27245)
5 years ago
Zhong Hui bbad3414e8
Enhance the error messages for files in operators/math
5 years ago
Pei Yang aae41c6fca
refine error message related to paddle-TRT (#27256)
5 years ago
Zhen Wang d708b21074
Update amp_check_finite_and_scale_op and add an updating_loss_scaling op for static graph amp training. (#26240)
5 years ago
Adam cc3f4b813a
Add int8 GRU kernel (#27220)
5 years ago
Jack Zhou 9437ce36c4
Error description optimize for math dir
5 years ago
Zhang Ting 5c1bafbbc6
use eval to improve performance, test=develop (#25459)
5 years ago
lidanqing 5c4eed66fd
Fix GRU mkldnn kernel fail on look_table_v2 (#27198)
5 years ago
Chen Weihang 33ff833af2
fix loaded no params layer run error (#27241)
5 years ago
Wilber 1b84c0bf43
Lite subgraph refine predictor (#27167)
5 years ago
furnace 2e59769612
add empty op (c++, python, unit test) (#26659)
5 years ago
lilong12 c5f957ae38
add double grad for tile op and expand_v2 op (#27114)
5 years ago
lilong12 58a88ba9af
add double grad for expand (#27183)
5 years ago
Qi Li 7c7fbd3218
fix error msg of fused_embedding_fc_lstm_op, test=develop (#27231)
5 years ago
Jacek Czaja e005861598
[oneDNN]Introducing oneDNN 1.6 (#27137)
5 years ago
ShenLiang 5bd84b22c4
revert divide (#27202)
5 years ago
wawltor fde5cfe881
fix the CudaPinMemory bug for the equal op (#27176)
5 years ago
zhupengyang cc3306f7c8
restruct logsumexp to speed up compiling (#27191)
5 years ago
Steffy-zxf 50e60e8779
update error info for selected_rows_functor
5 years ago
JZ-LIANG 5d039f4086
modified the implement of Lars optimizer (#26733)
5 years ago
wangchaochaohu c71d79b1d2
[cuda11 support] change the CMakeLists to support the cuda11 (#27124)
5 years ago
Qinghe JING 43b0445b29
Add double grad in reduce sum (#27115)
5 years ago
kinghuin ed292695c5
optimize the error message for math dir
5 years ago
yongqiangma 4558d395e9
fix Norm op error (#26771)
5 years ago
LielinJiang 4d7d661249
Fix kl and summary bug (#27132)
5 years ago
whs eb01976037
[2.0 API]Add checker in grid_sample_grad op (#27126)
5 years ago
wangguanzhong a28ae86e11
Enhance ops to support LoD as input for dygraph detection models. (#25316)
5 years ago
LielinJiang 8df5b4d608
Add correlation api to contrib (#27015)
5 years ago
kinghuin 1b102dd552
optimize the error message for unpooling.cc
5 years ago
xiaoting 58f3ef982a
fix typo for interp_v2,test=develop (#26843)
5 years ago
wangchaochaohu 5af81f833c
fix gpu kernel for numel Op (#27085)
5 years ago
zhupengyang 19ca6d9dd2
add .part to speed up compile (#27044)
5 years ago
GaoWei8 4ff16eb201
Add padding cudnn interface (#26370)
5 years ago
wawltor 8857e3911f
add the dynamic dtype check for the argmin/argma
5 years ago
wangchaochaohu 041f4ab842
refine linspace Op for dtype setting(#27071)
5 years ago
yaoxuefeng 9aa39584fe
fix cuda generator hard-coded offset step (#27027)
5 years ago
Jacek Czaja f6653c71e9
[oneDNN] Fix to conv2d grad with groups (#27006)
5 years ago
Chengmo a72752263b
support heter-xpu-ps (#27018)
5 years ago
whs 2660ea379d
Fix cuda kernel of affine grid (#27003)
5 years ago
ShenLiang ff3dc8ac73
fix the remainder (#26995)
5 years ago
yaoxuefeng 7f3e6ca596
add cuda generator (#26786)
5 years ago
joanna.wozna.intel 95e1434bb2
Add bfloat16 data type (#25402)
5 years ago
Yang Zhang 29b844ad5e
Fix clip op attr (#26924)
5 years ago
huangjun12 e480168fae
fix dropout bug in backward when input is 1d tensor (#26837)
5 years ago
Jacek Czaja 5e874cc333
- Cosmetic fixes to align with PADDLE_ENFORCE guidelines (#26891)
5 years ago
Thunderbrook 5205748481
fix eigen in push sparse; fix hadoop command (#26872)
5 years ago
wawltor 0a29fc85d6
fix the argmin,argmax op for the paddlepaddle 2.0
5 years ago
Chengmo d0962abd20
supplement bug fix of parameter server (#26217)
5 years ago
Leo Chen 60ffc22026
Refine bernoulli and unsqueeze op (#26842)
5 years ago
tangwei12 ebc5f99789
add embedding 2.0 (#26649)
5 years ago
hong19860320 40378edfa8
Add the AddCheckpoint macro to softplus op (#26809)
5 years ago
GaoWei8 11fb8a1c10
Refine cudnn softmax (#25757)
5 years ago
wawltor 7ee70a47b8
update the doc for the some ops
5 years ago
zhupengyang 0f1ad9b06c
leaky_relu and hardshrink add checkpoint for behavior changed (#26802)
5 years ago
Chengmo 7f2aa2db3c
【paddle.fleet】Support Heter Parameter Server (#25998)
5 years ago
Jiawei Wang a1b99fae07
Adadelta Optimizer (#26590)
5 years ago
LielinJiang 346689c6f1
Register conv_transpose Op version for compatible Op upgrades (#26745)
5 years ago
Wojciech Uss 7afb1df11e
Decouple weights and bias from fc primitive in MKLDNN cache (#26708)
5 years ago
Leo Chen 844583c8fd
Refine paddle.manual_seed (#26496)
5 years ago
ShenLiang 29494d703d
fix remainder, floor_div (#26732)
5 years ago
lilong12 5f524efe56
modify error report message, test=develop (#26743)
5 years ago
wangchaochaohu 4561fc37e2
Add check point for gather Op (#26696)
5 years ago
LutaoChu 1ec30cb160
register cumsum Op version for compatible Op upgrades (#26734)
5 years ago
Jack Zhou c282db3a93
add broadcast feature for elementwise logical op
5 years ago
Yang Zhang 63eef7632e
Fix clip input check (#26683)
5 years ago
joejiong f311d3c1cf
Fix pow api type error with python side method, merge elementwise_pow and pow. (#26163)
5 years ago
yongqiangma e4cc6a28b0
Norm op support 2-axis (#26492)
5 years ago
xiaoting 89d7d86684
add intepolte_v2 (#26520)
5 years ago
Zhang Ting 97cebfa4d3
add dtype for unique (#26655)
5 years ago
lilong12 1c68138327
[api 2.0] add collective op for cpu using gloo and paddle.distributed.* apis (#26552)
5 years ago
joanna.wozna.intel 559e43eee4
Small change in conv2d and quantize pass (#26671)
5 years ago
Bai Yifan 8986a82131
fix adaptive gpu grad bug, add doc refine (#26660)
5 years ago
wawltor 286eca2d9e
update the code for the topk v2
5 years ago
whs f82384113b
Fix atomicAdd in grid sample op and affine grid op (#26647)
5 years ago
Wilber 32ba8602c6
Enhance py_func error info message. (#26557)
5 years ago
Zhang Ting 0a895bc0df
improve unique op (#26537)
5 years ago
whs a004dfde3d
Use atomicAdd defined in paddle fromework (#26631)
5 years ago
zhupengyang c80fcf901e
reduce_mean error if keepdim=True and reduce_all=True (#26614)
5 years ago
whs a065a24232
【2.0 API】Enhance affine grid operator (#26385)
5 years ago
Qi Li 6f69fbc8ea
fix elu grad whne alpha less then zero, test=develop (#26543)
5 years ago
whs 786373ba29
Use atomicAdd defined in paddle framework (#26628)
5 years ago
ruri 1f82c0cd62
[Api2.0] add pixel shuffle (#26071)
5 years ago
whs 79539cf198
【2.0 API】Add CUDA kernel and enhance options for grid_sample (#26576)
5 years ago
Guanghua Yu 8645591d66
support fp64 in huber_loss cuda kernel (#26583)
5 years ago
yaoxuefeng efee426742
support generator seed in related kernals test=develop (#26495)
5 years ago
Zhong Hui bf4a4636f1
change to use bce_loss op, add shape check for bce_loss
5 years ago
ShenLiang 0e81626081
add div, floor_div, remainder (#26562)
5 years ago
qingqing01 24566e951c
Support empty bbox in bipartite math op (#26488)
5 years ago
Jack Zhou 199b0c7c1b
Add isfinite v2 op (#26344)
5 years ago
wangchaochaohu ebf9b2125e
add paddle.gather for API2.0 (#26455)
5 years ago
wangchaochaohu 9219b79104
gather_nd Op for API 2.0 refine (#26540)
5 years ago
zhupengyang 9b14117cac
logsumexp: impl kernel, refine docs (#26307)
5 years ago
Wojciech Uss 5c2b9258a6
Fix (de/re)quantize cache keys (#26549)
5 years ago
wawltor 6b28456ed0
add the argmax, argmin for the api2.0
5 years ago
LielinJiang d26ae9ad87
Update conv_transpose api (#26427)
5 years ago
lilong12 faa9b97b78
fix cscatter, test=develop (#26554)
5 years ago
WangXi 45711dade7
【API】rename div to divide, add floor_divide, remainder (#26434)
5 years ago
LutaoChu 4e0c6d91aa
add paddle.tensor.linalg.diag API, diag_v2 OP and CUDA kernel
5 years ago
zhupengyang f8863e0603
leaky_relu and LeakyReLU: alpha->negative_slope (#26216)
5 years ago
ShenLiang c609066074
Add Matmul op (#26411)
5 years ago
Leo Chen aa2a9b5d89
add bernoulli op (#26511)
5 years ago
Adam f3909020de
Add mechanism for blocking oneDNN cache clearing (#26502)
5 years ago
ShenLiang b6eb37f5b3
add error message for cholesky (#26444)
5 years ago
QingshuChen 138ecf24aa
support Baidu Kunlun AI Accelerator (#25959)
5 years ago
yaoxuefeng 4f259354d2
mod cvm test=develop (#25146)
5 years ago
wangchaochaohu e167e87974
【API2.0】add masked_select Op for API2.0 (#26374)
5 years ago
zhupengyang 6e5670b8bd
mean: not support int32, int64; add check for axis (#26401)
5 years ago
zhupengyang 4ad504e7c7
hardshrink: support threshold < 0 (#26403)
5 years ago
lilong12 e92f770c42
Add collective ops (reduce) (#26340)
5 years ago
wangchaochaohu bdb805505e
【API2.0】add numel API for paddle test=develop (#26311)
5 years ago
wangchaochaohu 2073ffc04d
Enhance the data type of linspace API (#26247)
5 years ago
hong19860320 40d193ed17
Add the ReLU6, Tanhshrink, SELU, Softplus, Softshrink and Softsign for the api 2.0 (#26376)
5 years ago
Zhaolong Xing f00f982a02
add cub impl for arg max, min (#25941)
5 years ago
Zhang Ting 6914a12f82
rename the inputs of allclose (#26360)
5 years ago
littletomatodonkey bcf03273f6
add pad func (#26106)
5 years ago
Chengmo eeeef957c7
Fix ps gpu (#26218)
5 years ago
Zhong Hui 6cbeafb6c0
add zero norm, inf norm support for p_norm op (#26364)
5 years ago
GaoWei8 1fbee267d4
remove scope in cudnn lstm (#25188)
5 years ago
cc 3f816bc8b4
[Quantization] Conv2d_transpose and mul support channnelwise quantization (#25639)
5 years ago
lilong12 638bbb6153
Improve expand as (#26290)
5 years ago
zhupengyang 586a6dd358
log_softmax and LogSoftmax: impl kernel and refind docs (#26088)
5 years ago
yaoxuefeng 23261ff44b
add cpu random Generator (#26013)
5 years ago
Sylwester Fraczek 69742bd9a4
Enable mkldnn layout conversion (#25778)
5 years ago
Jack Zhou 6d22f5c73e
Add PADDLE_ENFORCE in nll loss cuda kernel (#26294)
5 years ago
lilong12 241b44db14
[API 2.0] adaptive expand op to use shape instead of expand_times (#26206)
5 years ago
lilong12 fbd4d3cc97
[API 2.0] add paddle.tile op (#26245)
5 years ago
Yang Zhang a2d3e5c03b
Fix `paddle.abs` docstring (#25942)
5 years ago
Yang Zhang 22165934bc
Fix `paddle.acos` docstring (#25958)
5 years ago
Yang Zhang a5b5b00e02
Fix `paddle.asin` docstring (#25967)
5 years ago
Yang Zhang c758765769
Fix `paddle.atan` docstring (#25968)
5 years ago
Yang Zhang c4e480efc5
Fix `paddle.cos` docstring (#25969)
5 years ago
wawltor 2d6cc0b125
support the tuple for attribute of axis in min, max for api2.0
5 years ago
Leo Chen ffe52b4452
[OpDevOptimize] Add common infershape functions (#26096)
5 years ago
Leo Chen 2d95280e1f
Feature/Enable Auto-Mixed-Precision in dynamic graph (#24903)
5 years ago
wawltor 9c17b3c9f8
Add the max, min, maximum, minimum api for the API 2.0
5 years ago
Yiqun Liu 1be6bf45ae
Add assign to fusion_group and enhance inplace execution in fusion_group. (#26121)
5 years ago
lilong12 8caee2ad51
【paddle.fleet】add the support for multi-node training for pipeline (#25907)
5 years ago
LutaoChu bf2db646de
fix cumsum op for API 2.0, optimize performance
5 years ago
Adam 1893cd6bb8
Add oneDNN relu6 op (#26037)
5 years ago
Zhaolong Xing 50f149a48e
fix cudnn workspace size problem during inference. (#26021)
5 years ago
Chen Weihang 3c8daa9b89
Add pin memory control for BufferedReader (#26026)
5 years ago
Chen Weihang ad4a0466a5
Add cuda pinned place branch in slice op GetExpectedKernelType (#26027)
5 years ago
Feiyu Chan e853ece0a2
update document template for unary elementwise layers (#25896)
5 years ago
joanna.wozna.intel 734cf1c3e9
Change use_quantizer attribute name and data type (#25838)
5 years ago
Leo Chen 5258d53d65
refine unsqueeze, test=develop (#25470)
5 years ago
yaoxuefeng 224620071b
add new flatten op test=develop (#25393)
5 years ago
Adam 68c6160e63
Add oneDNN fusion_gru kernel (#25594)
5 years ago
Zhong Hui dca56f47f5
fix invalid read of pnorm gradient function
5 years ago
Zhaolong Xing 358bc06c72
[CUDNN8 support] : support CUDNN8 (#25664)
5 years ago
Zhaolong Xing 5970871a64
add eltwise clip cuda impl. (#25689)
5 years ago
Pei Yang b717895f64
Fix registering trt plugin (#25744)
5 years ago
wawltor a697e94693
Update the code of the compare ops for the broadcast function
5 years ago
wangchaochaohu ff717d5158
Add support for tuple of concat Op test=develop (#25800)
5 years ago
tangwei12 253fd407e8
Fix/distibuted heart beat (#25902)
5 years ago
xujiaqi01 d11c140e28
fix dump, fix cvm check (#25400)
5 years ago
Zhang Ting 6486fe8a94
improve GPU performance of transpose, test=develop (#25862)
5 years ago
Zhang Ting 2d24f56a7a
avoid data transfer, test=develop (#25810)
5 years ago
ShenLiang bca303165a
fix inverse bug (#25641)
5 years ago
Aurelius84 e52dae6ef6
Using input.place() in GetExpectedKernel in slice_op (#25595)
5 years ago
wawltor 595a719795
Update the api for the compare_ops
5 years ago
wangchaochaohu 32b9577b2a
refine the split op for API 2.0 test=develop (#25320)
5 years ago
lilong12 ce506930c3
Fix the bug that Input(Offsets) and attr(offsets) cannot be set at the same time. (#24975)
5 years ago
tangwei12 2d9dbd31ad
Fix/mkl dnn (#25835)
5 years ago
tangwei12 caa90a6510
Integrated Trainer of Parameter Server (API add `fluid.contrib.layers.sparse_embedding` only) (#22957)
5 years ago
cc 42189be67b
[Quant] Remove the output for moving_average_abs_max_scale op (#25697)
5 years ago
Chen Weihang 23d1228c4d
remove ProgramTranslator.save_inference_model (#25740)
5 years ago
Chen Weihang 1b3081b1b4
Simplify BufferedReader to improve DataLoader performance (#25648)
5 years ago
Zhou Wei e0a9115e28
fix random compile failure due to missing file (#25661)
5 years ago
Sylwester Fraczek 1aaa26f102
add dnnl sigmoid (logistic) activation (#25745)
5 years ago
wangchaochaohu 1e4ab728fb
refine the concat Op for API 2.0 test=develop (#25307)
5 years ago
Adam 98899b73d2
Fix FC + GRU fuse pass (#25687)
5 years ago
Leo Chen 4ec1251a1e
Refine squeeze, test=develop (#25281)
5 years ago
joanna.wozna.intel e5bbffa84c
Add NOMINMAX define due to windows.h max/min macro conflict (#25637)
5 years ago
cnn 70cee22fde
New features, add sinh and cosh op, test=develop (#25495)
5 years ago
Zhang Ting a1350744eb
register fp16 kernel, test=develop (#25630)
5 years ago
mapingshuo 5453a912fe
add fp64 support in sequence_pool, test=develop (#25662)
5 years ago
GaoWei8 6e86fd3750
fix concat dimension (#25606)
5 years ago
donproc 95fa383df2
optimize embedding cuda kernel lookup_table_v2,test=develop (#25587)
5 years ago
石晓伟 7206417259
supports xpu runtime, test=develop (#25554)
5 years ago
Zhang Ting 30d1ff3bb4
call cublasGemmStridedBatchedEx when using fp16, test=develop (#25553)
5 years ago
Aurelius84 ca1185d06b
[Dy2Stat] Fix scope in run_program_op (#25579)
5 years ago
Jacek Czaja 7dbc441eab
[oneDNN] cache cosmetics improvement (#25576)
5 years ago
hong e362095e45
fix softmax with cross entropy out of bound; test=develop (#25549)
5 years ago
Huihuang Zheng d8fe517bf8
Add Support for SelectedRows for Transpose OP and Fix a Bug That SelectedRows Cannot be Supported in SimNet (#25536)
5 years ago
Wilber 848aca7ae8
[CI] [Lite-Subgraph] CI add lite subgraph check. (#25346)
5 years ago
Shibo Tao 71c71e684c
fix logical_* ops' doc (#25479)
5 years ago
Aurelius84 4717bdbcfb
Fix hang in seq_topk_avg_pooling op (#25522)
5 years ago
LielinJiang 7129f544f0
Add bilateral_slice op (#25401)
5 years ago
Zhang Ting ca725c82f2
improve fp16 performance of slice_grad, test=develop (#25523)
5 years ago
yaoxuefeng 5d3766ff3d
modify flip test=develop (#25312)
5 years ago
Chen Weihang 41d2247275
[Dy2static] Refactor ProgramTranslator save_inference_model API (#24989)
5 years ago
yaoxuefeng aaa7cbd56f
modify trace api test=develop (#25397)
5 years ago
Huihuang Zheng f9ac5fb992
[Dy2stat] Fix Memory Optimization in run_program_op and Add SimNet as Unit Test (#25383)
5 years ago
yaoxuefeng c42d662e2a
modify roll test=develop (#25321)
5 years ago
Zhen Wang 548cdbc544
Quantization-aware training for dygraph (#24634)
5 years ago
Chen Weihang 0b54d54fd8
Fix index overflow bug of the CUDA kernel loop increment (#25435)
5 years ago
zlsh80826 e528392de9
[Paddle-TRT] SkipLayernorm vectorized memory optimization (#25117)
5 years ago
zhupengyang 5b573c58e2
randperm API: remove out, devive, stop_gradient; add name (#25410)
5 years ago
Zhen Wang bb45af02ac
add the c++ part of Imperative QAT. test=develop (#25446)
5 years ago
Jacek Czaja 050a9bf79d
[oneDNN] LRN cleanup (#25416)
5 years ago
GaoWei8 1974aadcf0
fix concat shape error (#25414)
5 years ago
tangwei12 4b3778a3ee
Revert/barrier for sync (#25417)
5 years ago
ceci3 52be62c5ae
fix instance norm in dy (#24717)
5 years ago
zhupengyang eb3173e2b6
rand API: remove out, device, stop_gradient; add name (#25246)
5 years ago
zhupengyang 6de75082cb
fix test_hsigmoid windows ci (#25311)
5 years ago
WuHaobo f593c3fb2f
fix the formula of floor OP and ceil OP (#25292)
5 years ago
Zhang Ting bc7610583b
use eval() to improve CPU performance (#25243)
5 years ago
Kaipeng Deng 74468bf428
add mish op. (#24565)
5 years ago
Yang Zhang 6d6efafeeb
Add `matrix_nms_op` (#24400)
5 years ago
Chengmo e85fcaa712
Fix fluid.embedding in Distributed Training (#25174)
5 years ago
Yiqun Liu c00f827843
Avoid data transforming ShapeTensor from CPU to GPU in fill_constant op. (#25267)
5 years ago
123malin f1a9593d69
test=develop, bug fix for index_select and roll op (#25251)
5 years ago
FDInSky c2e072587c
test=develop fix generate_proposals's error (#25227)
5 years ago
Wilber 4c964abdf7
support build on arm. test=develop (#25212)
5 years ago
liym27 1458cc0c68
Fix bug: Don't check dims if contain_unknown_dim of cross_entropy_grad_op in compile time (#25221)
5 years ago
liu zhengxi 68e93d8a17
Fix beam_search InferShape (#25169)
5 years ago
Adam bd0b38e671
Refactor of conv fp32 oneDNN operator (#25137)
5 years ago
Shibo Tao 19c4db1b56
don't re-generate header file if content doesn't change (#25130)
5 years ago
Jacek Czaja a7944904d3
[oneDNN]elementwise_add and elementwise_mul int8 support (#24984)
5 years ago
Leo Chen fa657b3dbb
fix bug of prelu when rank not equal 4, test=develop (#25067)
5 years ago
zlsh80826 479c8834f7
[Paddle-TRT] Fixes #24731, opt for SoftmaxKernelWithEltadd kernel, test=develop (#24834)
5 years ago
Leo Chen 028de857d4
fix dtype error of compare op, test=develop (#25059)
5 years ago
tianshuo78520a 770c11a117
fix make device_context error (#25045)
5 years ago
tangwei12 be6a315fbd
Fix/sync barrier (#25016)
5 years ago
ceci3 8db66fc3f6
fix cos_sim, test=develop (#25017)
5 years ago
Zhang Ting 621b638550
improve performance of instance_norm, test=develop (#25005)
5 years ago
wangchaochaohu 613303dbf6
refine the slice Op to improve the performance of xlnet for fp16 training (#24967)
5 years ago
Chen Weihang d152d7231e
clear old var in scope, test=develop (#24976)
5 years ago
wawltor 0eb1b0bc01
Add support the 5d, 6d tensor support for the reduce ops
5 years ago
mapingshuo 24e24987f0
fixes the place info in the Print op (#24934)
5 years ago
Aurelius84 6be0ee159e
Support LoDTensorArray in reverse_op (#24797)
5 years ago
Leo Chen a7cb97a1a5
Fix/isfinite on windows (#24927)
5 years ago
whs 4c01d6d53e
Enhance checking in some operator. (#24473)
5 years ago
lilong12 6e10022781
add queue_generator_op, dequeue_op, enqueue_op and ut (#24481)
5 years ago
Leo Chen 1e818158f5
Feature/add amp_checkout_finite_and_scale op (#24875)
5 years ago
leesusu a6beb96dd0
FTRL with sparse update, test=develop (#22092)
5 years ago
Chen Weihang d1062d5278
Replace all errors thrown by LOG(FATAL) with PADDLE_THROW (#24759)
5 years ago
Michał Gallus 23a85f030c
Remove old mkldnn_elementwise_mul test (#24855)
5 years ago
Leo Chen b67ded04f2
Support gradient accumulation of fp16 in imperative mode (#24823)
5 years ago
Qi Li 704cad6a66
Add histc op (#24562)
5 years ago
Yi Liu 12bffdc086
Enhance error message of checkpoint_notify_op, fake_init_op gen_nccl_id_op and listen_and_serv_op (#24554)
5 years ago
Adam b490e41c1d
Add isCached() mechanism for BatchNorm and LRN oneDNN operators (#24798)
5 years ago
Aurelius84 a7e21cbed3
Move input_size check into RunTime phrase of gru_unit_op and refine error message (#24776)
5 years ago
Zhou Wei f66594a558
fix bug that diag API can't use on Windows(#24762)
5 years ago
Leo Chen c0911fdd32
rename inplace/no_need_buffer inferer, part4, test=develop (#24781)
5 years ago
Chen Weihang be82de4c79
polish two error message, test=develop (#24778)
5 years ago
Leo Chen b0e7439fbc
rename inplace/no_need_buffer inferer, part2, test=develop (#24733)
5 years ago
Leo Chen a6fbba65ff
rename inplace/no_need_buffer inferer, part3, test=develop (#24734)
5 years ago
wangchaochaohu 355caee18b
fix conv_transpose Op fp16 error test=develop (#24695)
5 years ago
Adam 56a714a19b
Add isCached() machinism to oneDNN pooling primitive (#24724)
5 years ago