Commit Graph

5199 Commits (05c3bc3bf616731a2da15747b8da9cb8064e39b9)

Author SHA1 Message Date
Zhaolong Xing 430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. (#23138)
5 years ago
liym27 6af480ca33
Support int64 for op assign_value. test=develop (#23179)
5 years ago
Zeng Jinle 53e6f8e1da
rename macro, test=develop (#23161)
5 years ago
Zeng Jinle bba740710d
add cuda resource pool for BufferedReader, test=develop (#23152)
5 years ago
Zeng Jinle 7d8d50b6cc
rename no_need_buffer_vars macro, test=develop (#23160)
5 years ago
Liufang Sang a486a739e1
fix compile error in win gpu (#23196)
5 years ago
Zeng Jinle 7ca77a90ac
add Tensor::IsSharedBufferWith method, test=develop (#23175)
5 years ago
Zeng Jinle b8886bf122
rename no_need_buffer_vars_macro, test=develop (#23159)
5 years ago
wangchaochaohu b721e23b25
transpose cudnn using cudnn v7 api (#19738)
5 years ago
Adam 4f5e4540f8
Improve SGD jit code to work with large data (#23120)
5 years ago
Liufang Sang 4db031902d
add dequantize_log_op and make pyramid hash support int8 weight (#22548)
5 years ago
Zeng Jinle 9258e96094
fix read op comments, test=develop, test=document_fix (#23122)
5 years ago
Zeng Jinle acfc9b8a70
Reader sequential and inference partial feed (#22699)
5 years ago
Wilber 95b356a069
update embedding_eltwise_layernorm fuse and kernel. test=develop (#23114)
5 years ago
Zeng Jinle a31d7328b7
Add dygraph double grad implementation (#22939)
5 years ago
songyouwei 2e2da7124b
high-performance dygraph slice (#22879)
5 years ago
Sylwester Fraczek abee05a8c8
added mkldnn swish activation (#23041)
5 years ago
Zhaolong Xing 8c6fde9e69
fix align error (#23090)
5 years ago
Liufang Sang 915b892a15
Fix div zero in fake quantize op (#22966)
5 years ago
Feiyu Chan 01ab8a0619
add approximation for gelu, test=develop (#22961)
5 years ago
Adam 5842ae6785
Revert "Change ShareDataWith() to TensorCopy() in conv_mkldnn (#22695)" (#22985)
5 years ago
GaoWei8 1dc1f9270e
Fix lod error of concat op for axis = 0 (#22538)
5 years ago
Zhang Ting 714b0076b6
Override GetKernelTypeForVar to avoid device transform, test=develop (#23032)
5 years ago
wangchaochaohu 112e3edbf6
fix the conv group problem test=develop (#23025)
5 years ago
wangchaochaohu 3757e0687c
Add Unittest for backward of fusion group (#22932)
5 years ago
chengjuntao 63f3ada7b9
fix bug which input shape (#22965)
5 years ago
wangchaochaohu f0d193a23c
Cast fusion for fusion group (#22876)
5 years ago
yaoxuefeng 29a7a52d38
Fix instag (#22632)
5 years ago
wawltor f154d5860f
Speed up the matmul op, use the gemm replace the batch gemm (#22926)
5 years ago
Adam 056edf3929
Change ShareDataWith() to TensorCopy() in conv_mkldnn (#22695)
5 years ago
Zhaolong Xing 8d6dc102fe
[Ernie GPU Optimize]: Embedding_eltwise_layernorm Fuse (#22494)
5 years ago
guofei 3d8571e884
modify assign op and add unittest of assign op (#22769)
5 years ago
Zeng Jinle d33c4343e1
Imperative tracer refactoring (#22457)
5 years ago
tangwei12 ad9c8f6d2d
fix communicator when break under pyreder mode (#22911)
5 years ago
mapingshuo 5ba9dfc16a
add lookup_table_dequant_op (#22900)
5 years ago
Zhaolong Xing dd67d44a50
[Paddle-TRT] : (Part1) Dynamic shape support (#22868)
5 years ago
tangwei12 07e13b84cd
remove vlog, test=develop (#22898)
5 years ago
Wilber f686310d81
fix concat_mkldnn op. test=develop (#22692)
5 years ago
Zhaolong Xing 1a533ed2de
[BUG]: Multihead matmul op's ouput size should be BxSx(N*H) (#22848)
5 years ago
石晓伟 ddb9b46fec
change the function in op_teller, test=develop (#22794)
5 years ago
tianshuo78520a 433cef03e5
fix typo word (#22784)
5 years ago
Kaipeng Deng ebc7ffc300
fix detection_map. test=develop (#22705)
5 years ago
zhaoyuchen2018 72dde4abde
Refine adam op to improve performance, test=develop (#22346)
5 years ago
wangguanzhong f2d1cd119a
fix lod level, test=develop (#22755)
5 years ago
FlyingQianMM 79d712346f
Correct CPU gradients of the argsort op (#22739)
5 years ago
guofei ae8b5f11a3
Change ShareDataWith() to TensorCopy() in ref_by_trainer_id (#22717)
5 years ago
chengjuntao 15c2667143
register fp16 for assign op (#22744)
5 years ago
dyning 1c0653462d
fix generate_mask_labels lod level (#22743)
5 years ago
GaoWei8 ba140222d6
fix compile&runtime lod_equality of lod_reset (#22737)
5 years ago
ShenLiang 3132681e8a
add partial_sum op in contrib (#22292)
5 years ago
ShenLiang e136661304
add partial_concat op in contrib (#22528)
5 years ago
tianshuo78520a d2ba91aad1
fix typo words (#22653)
5 years ago
Yibing Liu 6e7bfe30a6
register fp16 kernel for some ops (#22650) (#22696)
5 years ago
tangwei12 66a3150135
SYNC with communicaotor (#22344)
5 years ago
Yiqun Liu 22bbd54719
Add the support of fp16 in fusion_group (#22239)
5 years ago
Huihuang Zheng adfa5b8354
Add PADDLE_ENFORCE to Check Sequence Length of RecurrentOp (#22673)
5 years ago
lidanqing d926214535
[UT coverage] improve the mul_mkldnn_op line coverage (#22408)
5 years ago
Zhaolong Xing a06d75a280
[Paddle-TRT] Refine the error log about runtime batch and max_batch_size. (#22535)
5 years ago
Adam 608447bfd5
Update MKLDNN to v1.2 (#22521)
5 years ago
Adam ab610a34ff
transpose_mkldnn code change to meet Paddle standards (#22591)
5 years ago
Jiawei Wang 8f035fb637
Add TopK Op Grad CPU&GPU Kernel test=develop (#22628)
5 years ago
Steffy-zxf 90ee366653
update ops's unittest data type from float32 to float64 and shape over 100 (#22544)
5 years ago
Zhaolong Xing 8acd745c25
[Ernie GPU Optim]: Fuse three fc to multihtead matmul (#22486)
5 years ago
Guo Sheng 31b5464632
Add support for dynamic_decode(while) training. (#22231)
5 years ago
Wojciech Uss 4cddb43c5c
Add support for Ernie NLP model to the Slim QAT (#22506)
5 years ago
Double_V 58d99247f4
support slice double grad, test=develop (#22166)
5 years ago
hutuxian 1a7962be97
Paddlebox about box_wrapper (#22497)
5 years ago
huzhiqiang 9e29d3ebed
【OpPorting Example】DEMO OF FIX COMPILE&RUNTIME LOD_EQUALITY (#22460)
5 years ago
zhaoyuchen2018 54970444ce
Improve transpose performance with tile sm copy, test=develop (#22311)
5 years ago
Wilber a90fa54092
Compile without nccl deps. [1/2] (#22509)
5 years ago
Wilber de009152a7 Compile without nccl deps. [2/2] (#22484)
5 years ago
Yiqun Liu 4b2227e958 Fix dismatch of std::max's arguments type on windows. (#22507)
5 years ago
Wilber 870f465887 fix test_fusion_seqpool_concat lod level between compile and runtime (#22488)
5 years ago
Zhong Hui a61d09527b
Fix the integer overflow problem of sequence2batch (#22479)
5 years ago
cc 197913ebe1
Add weight quantization in post_training_quanzitaion (#22445)
5 years ago
Tao Luo 7c9ce097f1
refine reshape_op shape error message (#22480)
5 years ago
LielinJiang 2b1386b2b2
optimize performance of interpolate op (#22436)
5 years ago
Yiqun Liu 44b45b9f07
Correct the use of DeviceContext in unittest sequence_pooling_test and sequence_padding_test (#22456)
5 years ago
Wilber 7bc4b09500
add WITH_NCCL option for cmake. (#22384)
5 years ago
Tao Luo 943cb8c664
fix sigmoid cudnn bug (#22439)
5 years ago
石晓伟 e1b0d7cbb1
remove anakin from code, test=develop (#22420)
5 years ago
liu zhengxi 0404e7a985
Update the precision of pad, pad2d, pad_constant_like's unit tests from fp32 to fp64 (#22394)
5 years ago
Michał Gallus 269db0d1d1
[DNNL] Fix accuracy in INT8 FC (#22404)
5 years ago
joanna.wozna.intel fb3086fd57
[UT coverage]Remove unnecessary transpose op registration (#22402)
5 years ago
lidanqing ade5022681 [UT Coverage]Improve sum_mkldnn_op line coverage (#22275)
5 years ago
Wojciech Uss 92462e948d improve elementwise_add_mkldnn_op test code coverage (#22359)
5 years ago
ceci3 20f30dd604
add benchmark flag for conv_transpose (#22389)
5 years ago
Chengmo 8f36c39537
Fix GEO-SGD init & send Bug (#22375)
5 years ago
zhupengyang c6f888e5a5 update unittest accuracy to float64 for relu, prelu, maxout (#22273)
5 years ago
wangchaochaohu 0d8b222b79
Optimize the depthwise op test=develop (#22265)
5 years ago
qingqing01 2d20869c94 Fix infer_shape in compling for elementwise_op (#22291)
5 years ago
tangwei12 82bc814a57
integrated HALF_ASYNC to communicator (#21869)
5 years ago
wangchaochaohu 1e932eccfa
remove unused code test=develop (#22327)
5 years ago
Leo Chen 3e5744aa65
Remove unused inputs for some operators (#22284)
5 years ago
zhangchunle 805328e13b fix typo in error message (#22312)
5 years ago
lidanqing 895f8da7d6 change std::cout to log(INFO), vlog (#22316)
5 years ago
Bai Yifan faba4b116a
Remove disable flag in test_fsp_op.py (#22171)
5 years ago
Adam 9942d9ed5c Add caching mechanizm to requantize_mkldnn_op (#22223)
5 years ago
123malin 985bceac53
Bug fix for sparse recorder (#21969)
5 years ago
FlyingQianMM 443a713c9e
add backward gradient computation for op argsort (#22203)
5 years ago
Zhen Wang 46189b166d Add bn and relu fuse pass (#22048)
5 years ago
baojun 298ee7d28a Improve ngraph file line coverage (#22155)
5 years ago
zhongpu d0f0a2520c test Optimizer in dygraph (#21949)
5 years ago
石晓伟 ad0dfb17c1
[Feature] Lite subgraph (#22114)
5 years ago
zhaoyuchen2018 3d4f2aa689
Refine stack op to improve xlnet performance, test=develop (#22142)
5 years ago
liu zhengxi 64a4044292
add double register op_data_type of pad2d and fix compile error, test=develop (#22075)
5 years ago
Double_V 6ea3809143 Support prroi_pool_op with Tensor and LoDTensor rois (#20649)
5 years ago
zhaoyuchen2018 3dbd4087fe
Fix windows build not kernel issue, test=develop (#22105)
5 years ago
Chengmo 418abc92f4
Update pyramid related OP (#21372)
5 years ago
Feiyu Chan 14aebc7a95
add erf op (#21785)
5 years ago
Chen Weihang ba8414d3a5
replace CUDNN_ENFORCE with PADDLE_ENFORCE_CUDA_SUCCESS, test=develop (#22109)
5 years ago
Double_V fab4b0765a support elu_op double grad (#21822)
5 years ago
Pei Yang 0a51098a71
Add TRT support for BERT (#21135)
5 years ago
Jacek Czaja b0b27ff699 [MKL-DNN] Conv grad and Batch Norm grad NHWC support (#22088)
5 years ago
123malin 7fb817d447
add distributed_strategy (#21710)
5 years ago
Jacek Czaja ad8a9cb82c [MKL-DNN] Pool & LRN Grad Ops NHWC support (#21747)
5 years ago
Kaipeng Deng 34c57120eb polish cross_entropy ENFORCE (#22056)
5 years ago
SunAhong1993 7f4abaf2f5
register int/int64_t/float16 in pow/square kernel,test=develop (#22023)
5 years ago
Leo Chen 3f653c8323
register NoNeedBufferVarsInference for max_pool_grad_op, test=develop (#22055)
5 years ago
Yiqun Liu d48320777e
Add the first implememtation of fusion_group op (#19621)
5 years ago
Michał Gallus 6192108408 [DNNL] 3D Fully-Connected (#21746)
5 years ago
FDInSky aa2ed0dcc6 fix generate_proposal_labesl op (#21793)
5 years ago
ceci3 95d79b6d00
update error log for batch_norm_grad (#22017)
5 years ago
Aurelius84 c53b62eb8e
fix integer overflow in match_matrix (#22036)
5 years ago
wangchaochaohu 64baee4144
polish code test=develop (#22014)
5 years ago
danleifeng b7697f6218 fix broadcast bug;test=develop (#21898)
5 years ago
zhaoyuchen2018 8859ddd6cf
Refine multihead kernel, align block to 32 (#21961)
5 years ago
zhoushiyu cee2ccb078
add shuffle batch op (#21674)
5 years ago
mapingshuo c3e1954918
make reverse op support negative axis (#21925)
5 years ago
Aurelius84 10d6846900
Remove double registered dataType in Pad2d (#21942)
5 years ago
hutuxian 27decacb8a
fix aucop stat shape (#21846)
5 years ago
Aurelius84 5cb2c74127
add register op_data_type of pad/expand_as et.al (#21718)
5 years ago
hong 30d000f8c2
fix matmul error message; test=develop (#21885)
5 years ago
zhouwei25 a01663ca1f remove patch command and file of cares to Improved quality of Paddle Repo (#21776)
5 years ago
Aurelius84 51a86d2b6b Optimize adam speed (#21777)
5 years ago
FDInSky 6b9fbcf3ad Update iou_similarity op to support non-normalized bbox (#21671)
5 years ago
guofei 46f9184aff Modify the while_loop API (#21844)
5 years ago
Guo Sheng 7689b6aaa4
Fix default label dim of label_smooth_op. test=develop (#21862)
5 years ago
GaoWei8 d4dda8628e optimize fc jit (#21878)
5 years ago
Chen Weihang 2b941736f3 fix softmax_with_cross_entropy_fix bug, test=develop (#21810)
5 years ago
Chengmo a86f11b5f5
Speed GEO dense calc & communication (#21579)
5 years ago
Wojciech Uss 666c3bb9b0 handle multi-inputs with empty inputs for mkldnn_concat_op (#21827)
5 years ago
guofei 8b7c50f49a Make While Op could run on GPU place and add while_loop unittest (#21672)
5 years ago
WangXi 17299b8d21 fix batch_norm_grad infer shape=0 & add allreduce enforce shape, test=develop (#21801)
5 years ago
Huihuang Zheng 0677a1c1c1
Fix That conditional_block_op Doesn't Have InferShape (#21733)
5 years ago
zhaoyuchen2018 a5a8d14414
Fix softmax cuda bug (#21720)
6 years ago
Kaipeng Deng 943a44492b
yolo_box OP add Attr(clip_bbox). (#21620)
6 years ago
Leo Chen 7181afd75c
Fix elementwise_pow bug on CUDA place with integer (#21675)
6 years ago
Chen Weihang 1fd1f06f11 Rename paddle throw error macro (#21657)
6 years ago
joanna.wozna.intel d419b859c0 Add reshape int8 mkldnn op (#21428)
6 years ago
tangwei12 9ad940fdfe
memory leak for cpu (#21174)
6 years ago
GaoWei8 5af0c7ba89 Modify padding strategy: remove weight copy in fc padding (#21650)
6 years ago
wangchaochaohu 5eec8cf5af
fix the mean grad OP performance improvement test=develop (#21658)
6 years ago
Zeng Jinle 29f64c8c9e
refine some grad op makers, test=develop (#21629)
6 years ago
mapingshuo e2d849b989
Dropout with seed (#21590)
6 years ago
Adam e81f0228df MKL-DNN 1.0 Update (#20162)
6 years ago
wangchaochaohu 95b95a284b
Mean gpu optimize (#21643)
6 years ago
Zeng Jinle 0f8888360e
Polish op registry codes (#21561)
6 years ago
Aurelius84 3d9dee575e
Set lod_level of Out in compile time of sequence_pool_op (#21604)
6 years ago
Huihuang Zheng 1dcf6a7212
Add Much Complex Test and Fix Bugs for Control Flow cond API (#21532)
6 years ago
Jacek Czaja 8f5a93a07b - Fix to regression in performance of ResNet-50 training (#21588)
6 years ago
Jacek Czaja 9ce0e29dc3 [MKL-DNN] Batch norm mkl-dnn NHWC support (#21553)
6 years ago
Youwei Song cdba41af4d dygraph Embedding layer use lookuptable v2 (#21209)
6 years ago
wangchaochaohu 4c9b3dafa7
fill_constant_batch_size_like OP precious problem fix (#21337)
6 years ago
WangXi 768f9242e9 Fix dgc clip & rampup step, test=develop (#21491)
6 years ago
Zeng Jinle 3662fb71a7
remove eval() calls in Eigen, test=develop (#21498)
6 years ago
Jacek Czaja 18a5d30754 [MKL-DNN] Conv2d and Conv2d transpose MKL-DNN NHWC support (#21466)
6 years ago
Tao Luo 70eb397677
remove unused snappy/snappystream depends in distributed codes (#21484)
6 years ago
lilong12 0bc8bdf724
set dim[0] to -1 if dim[0] < 0 during compiling for c_allgather op (#21402)
6 years ago
tangwei12 0bddb951c2
fix async mode, test=develop (#21367)
6 years ago
Leo Chen b3090ad406
fix synchronization problem in softmax_with_cross_entropy_op, test=develop (#21480)
6 years ago
Tao Luo 01fa4ead61
fix -Wno-error=sign-compare warning in gcc8 (#21434)
6 years ago
Lv Mengsi 37f3e56dea
Fix transpose conv (#21406)
6 years ago
hutuxian 7e68bc896b
refactor AUC OP and add its CUDA Kernel (#21336)
6 years ago
wawltor dbbe6e9cb6
fix the device supported of the op unique and unique_with_counts. (#21395)
6 years ago
wangguanzhong 379e3febf2
fix shape check in density_prior_box, test=develop (#21414)
6 years ago
Adam 76b55da15a Fix bug in UpdatePadding for int64_t type (#21465)
6 years ago
Pei Yang 7b28d938bf
show shape diff in wrong trt input shape errmsg, test=develop (#21451)
6 years ago
Jie Fang 5e813b53c5 nhwc optimization for batchnorm (#21090)
6 years ago
Leo Chen e0c9d856fb
add unused input vars check for OpWithKernel, test=develop (#21169)
6 years ago
Chen Weihang 664f958a02
Fix optimizer op infershape failed in dygraph multi-cards mode (#21374)
6 years ago
Huihuang Zheng 630be31952
Fix Cond Bug for Nested Control Flow (#21340)
6 years ago
Jacek Czaja cd43c4440e [MKL-DNN] LRN and Pool2d (FWD) NHWC support (#21375)
6 years ago
Leo Chen add62acfd1
remove kDepXOut for abs_grad op, test=develop (#21407)
6 years ago
Adam 9107bf209f Add template version of UpdatePadding (#21426)
6 years ago
zhaoyuchen2018 b16274556a
Add dscending for argsort (#21400)
6 years ago
hong ac8546701d
Add dygraph execution context (#20157)
6 years ago
hutuxian a6b089c614
add macro to ban windows (#21422)
6 years ago
Kaipeng Deng ebfb720a63
add Adam beta1/beta2 support Variable (#21234)
6 years ago
Zeng Jinle 09696d5df8
Use system allocator in OpTest (#21335)
6 years ago
Kaipeng Deng 67c836fb5c
batch_norm momentum support variable (#21246)
6 years ago
ShenLiang e2c6f434ec Add Lod information for gather_nd & scatter_nd (#21404)
6 years ago
Tao Luo c0656dcb1a
remove -Wno-error=sign-compare, make warning as error (#21358)
6 years ago
Zeng Jinle b97fc16d21
fix lod_reset bug, test=develop (#21392)
6 years ago
Zeng Jinle 89966525f1
Polish reference count pass (#21324)
6 years ago
hutuxian 47a82e38e3
Support data_norm gpu kernel (#21325)
6 years ago
GaoWei8 8493f20ebc Polish the codes of fc when needs padding (#21378)
6 years ago
Michał Gallus 5d7d548275 INT8 Fully-connected (#17641)
6 years ago
Zeng Jinle b639a882c3 fix syn bn grad maker, test=develop, test=document_fix (#21317)
6 years ago
Youwei Song 4d0f5ab1a8 add axis check for concat op (#21288)
6 years ago
zhaoyuchen2018 afb134847d
Fix ernie python infer diff (#21311)
6 years ago
Lv Mengsi b6ce4f8b2f
Fix mistake of batch norm op (#21237)
6 years ago
lilong12 41d13209d7
add the framework support for distfc (#21197)
6 years ago
GaoWei8 234060f88f Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
6 years ago
Jacek Czaja f4cf028a8c [MKL-DNN] Error throwing for NHWC layout for MKL-DNN ops (#21207)
6 years ago
Michał Gallus ed9ceb9f98 Refactor MKL-DNN ElementwiseMul (#21061)
6 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
6 years ago
wangchaochaohu 6514f52e46
fix the fill_constant op precious problem (#21322)
6 years ago
zhaoyuchen2018 08c19c585d
Improve argsort performance. (#21267)
6 years ago
WangXi 8ac7687e36 Fix dgc accuracy by mv regularization to local (#21278)
6 years ago
Leo Zhao b19e1a1b56 use prefetch to load next mem into cache (#21206)
6 years ago
gongweibao ed2a185248
optimize nhwc for tensor core in ConvOp and ConvGradOp (#20597)
6 years ago
Yihua Xu 69dd5152cf Fix the crash issue when scale or bias was null-pointer. (#21284)
6 years ago
Zhang Ting 698b8b73ad
optimize lod_reset op to avoid data transform
6 years ago
Liufang Sang f0b1518438 add dequantize_abs_max op and modify lookup_table op (#20899)
6 years ago
hutuxian a6ce2306f9 support cvm_op run in gpu (#21300)
6 years ago
Yihua Xu b085ecc258 Avoid the string as the key of map to improve the jit performance (#21292)
6 years ago
zhongpu c4ede95c74 open dygraph op test, test=develop (#19787)
6 years ago
danleifeng 6fc3e8ec84 edit elementwise_mul doublegrad inplace (#21245)
6 years ago
zhaoyuchen2018 3ff5cc2d5e
Fix topk compile failed on windows (#21243)
6 years ago
Zhang Ting 01a9646323
optimize assign op to avoid copy data from GPU to GPU (#21181)
6 years ago
danleifeng 0e7baabe59 extend elementwise broadcast function (#20957)
6 years ago
Adam d623e863c9 Fix GELU grad error (#21204)
6 years ago
yaoxuefeng b5d8ba8394
fix data_norm op to avoid impractical normalization result test=develop (#21152)
6 years ago
Zhang Ting 9cbe7bccba modified error message and API doc for channel_last supported Op (#21002)
6 years ago
guofei 56b5d14704 Fix the error of init variable in StaticRNN when stop_gradient=ON (#21118)
6 years ago
WangXi 3c98ec90ce Fix INF bug of softmax_cross_entropy_op (#21165)
6 years ago
Yihua Xu eec9c9cbe7 Fix jit tls issue (#21151)
6 years ago
ruri aeb887911f
Refine edit distance cn (#21121)
6 years ago
Kaipeng Deng 98b59cb82c
fix elementwise_mod float point kernel. test=develop (#21183)
6 years ago
whs cfdd1fc2cd
Fix warpctc in padding mode. (#21033)
6 years ago
Chen Weihang 8da0cd537a
Add examples for error message writing specification - NotFound, OutOfRange, AlreadyExists, PermissionDenied (#21134)
6 years ago
zhaoyuchen2018 b93870e696
Improve topk performance. (#21087)
6 years ago
Chen Weihang 8414575b78
Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137)
6 years ago
Chen Weihang 7e5f74b825
Add examples for error message writing specification - InvalidArgument (#21132)
6 years ago
zhaoyuchen2018 4a544762a2
Add Asypadding for conv fusion. (#21041)
6 years ago
WangXi de5d3ff688 Fix dgc buffer illegal & reuse velocity (#21012)
6 years ago
ceci3 f62a929151
fix instance norm (#21042)
6 years ago
lilong12 e249d9a3e2
fix the computation for dx (grad for x) for prelu operation. (#20949)
6 years ago
Zhang Ting e0285eae64 add check for input channels and Attr(groups), test=develop (#21095)
6 years ago
Yiqun Liu 35f17ae28f
Add the check of lod_level between compile-time and runtime. (#20961)
6 years ago
Chen Weihang 826254f664
Add pre-condition check for fuse optimizer op pass (#21005)
6 years ago
Aurelius84 1cd6721873 Optimizer mmcpy if _rand_len=16 and remove data copy in GradKernel (#21099)
6 years ago
joanna.wozna.intel 77c2083586 Add transpose2 INT8 for mkl-dnn (#19424)
6 years ago
LielinJiang 06063b7001 add op locality_aware_nms, test=develop (#20976)
6 years ago
wangchaochaohu fc385777e4
fix the compile cost long time test=develop (#21064)
6 years ago
Chen Weihang 2f27b10331
Add dependency for error_codes.proto (#21084)
6 years ago
wangchaochaohu 149a1e3124
Expand refine (#21063)
6 years ago
Wojciech Uss af3ff422cc Fix dst memory allocation in elementwise_add (#21059)
6 years ago
liym27 26a6e27afe fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation, _get_padding_with_SAME and conv2dtranspose_forward_naive. (#20997)
6 years ago