Commit Graph

4994 Commits (05c00af5f16da64d1e8953711c647512121ef3d2)

Author SHA1 Message Date
Michał Gallus ed9ceb9f98 Refactor MKL-DNN ElementwiseMul (#21061)
5 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
5 years ago
wangchaochaohu 6514f52e46
fix the fill_constant op precious problem (#21322)
5 years ago
zhaoyuchen2018 08c19c585d
Improve argsort performance. (#21267)
5 years ago
WangXi 8ac7687e36 Fix dgc accuracy by mv regularization to local (#21278)
5 years ago
Leo Zhao b19e1a1b56 use prefetch to load next mem into cache (#21206)
5 years ago
gongweibao ed2a185248
optimize nhwc for tensor core in ConvOp and ConvGradOp (#20597)
5 years ago
Yihua Xu 69dd5152cf Fix the crash issue when scale or bias was null-pointer. (#21284)
5 years ago
Zhang Ting 698b8b73ad
optimize lod_reset op to avoid data transform
5 years ago
Liufang Sang f0b1518438 add dequantize_abs_max op and modify lookup_table op (#20899)
5 years ago
hutuxian a6ce2306f9 support cvm_op run in gpu (#21300)
5 years ago
Yihua Xu b085ecc258 Avoid the string as the key of map to improve the jit performance (#21292)
5 years ago
zhongpu c4ede95c74 open dygraph op test, test=develop (#19787)
5 years ago
danleifeng 6fc3e8ec84 edit elementwise_mul doublegrad inplace (#21245)
5 years ago
zhaoyuchen2018 3ff5cc2d5e
Fix topk compile failed on windows (#21243)
5 years ago
Zhang Ting 01a9646323
optimize assign op to avoid copy data from GPU to GPU (#21181)
5 years ago
danleifeng 0e7baabe59 extend elementwise broadcast function (#20957)
5 years ago
Adam d623e863c9 Fix GELU grad error (#21204)
5 years ago
yaoxuefeng b5d8ba8394
fix data_norm op to avoid impractical normalization result test=develop (#21152)
5 years ago
Zhang Ting 9cbe7bccba modified error message and API doc for channel_last supported Op (#21002)
5 years ago
guofei 56b5d14704 Fix the error of init variable in StaticRNN when stop_gradient=ON (#21118)
5 years ago
WangXi 3c98ec90ce Fix INF bug of softmax_cross_entropy_op (#21165)
5 years ago
Yihua Xu eec9c9cbe7 Fix jit tls issue (#21151)
5 years ago
ruri aeb887911f
Refine edit distance cn (#21121)
5 years ago
Kaipeng Deng 98b59cb82c
fix elementwise_mod float point kernel. test=develop (#21183)
5 years ago
whs cfdd1fc2cd
Fix warpctc in padding mode. (#21033)
5 years ago
Chen Weihang 8da0cd537a
Add examples for error message writing specification - NotFound, OutOfRange, AlreadyExists, PermissionDenied (#21134)
5 years ago
zhaoyuchen2018 b93870e696
Improve topk performance. (#21087)
5 years ago
Chen Weihang 8414575b78
Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137)
5 years ago
Chen Weihang 7e5f74b825
Add examples for error message writing specification - InvalidArgument (#21132)
5 years ago
zhaoyuchen2018 4a544762a2
Add Asypadding for conv fusion. (#21041)
5 years ago
WangXi de5d3ff688 Fix dgc buffer illegal & reuse velocity (#21012)
5 years ago
ceci3 f62a929151
fix instance norm (#21042)
5 years ago
lilong12 e249d9a3e2
fix the computation for dx (grad for x) for prelu operation. (#20949)
5 years ago
Zhang Ting e0285eae64 add check for input channels and Attr(groups), test=develop (#21095)
5 years ago
Yiqun Liu 35f17ae28f
Add the check of lod_level between compile-time and runtime. (#20961)
5 years ago
Chen Weihang 826254f664
Add pre-condition check for fuse optimizer op pass (#21005)
5 years ago
Aurelius84 1cd6721873 Optimizer mmcpy if _rand_len=16 and remove data copy in GradKernel (#21099)
5 years ago
joanna.wozna.intel 77c2083586 Add transpose2 INT8 for mkl-dnn (#19424)
5 years ago
LielinJiang 06063b7001 add op locality_aware_nms, test=develop (#20976)
5 years ago
wangchaochaohu fc385777e4
fix the compile cost long time test=develop (#21064)
5 years ago
Chen Weihang 2f27b10331
Add dependency for error_codes.proto (#21084)
5 years ago
wangchaochaohu 149a1e3124
Expand refine (#21063)
5 years ago
Wojciech Uss af3ff422cc Fix dst memory allocation in elementwise_add (#21059)
5 years ago
liym27 26a6e27afe fix bug in pool/conv/conv_transpose: UpdatePaddingAndDilation, _get_padding_with_SAME and conv2dtranspose_forward_naive. (#20997)
5 years ago
Chen Weihang 7ee25189c3
Enrich the type of error and declare the error type interfaces (#21024)
5 years ago
Adam 3fda695bb0 Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21062)
5 years ago
Huihuang Zheng 1957192f05
Add select_input_op and select_output_op (#21016)
5 years ago
Liufang Sang e5e699ecc0 set lod level for compile time test=develop (#21022)
5 years ago
liym27 f0e95a6049 Polish error messages of pool_2d/3d and add Raises in English document. test=develop (#21017)
5 years ago
zhaoyuchen2018 0059404e77
Fix ce ocr_recognition test fails (#20987)
5 years ago
Chengmo bc8e600ce5
Fix rpc not wait in GEO communicator (#20967)
5 years ago
Tao Luo 25ffa8445d
refine murmurhash3_x64_128 for bloom_filter (#20996)
5 years ago
Zeng Jinle 878a40f57d
Support NoNeedBufferVarsInference in dygraph backward (#20868)
5 years ago
wangchaochaohu bf379fef96
refine code for code reuse test=develop (#20988)
5 years ago
Zhang Ting de9bec607e lrn supports channel_last input, test=develop (#20954)
5 years ago
Liufang Sang 9b666cae67 fix diff in dequantize op between cpu and gpu test=develop (#20953)
5 years ago
Zhang Ting f4f85831d3 fix the bug of conv_transpose cudnn kernel, test=develop (#20958)
5 years ago
zhaoyuchen2018 7f3a445e9a
Fix gru as small frame_size has error. (#20922)
5 years ago
123malin 20cdff0e02
Optimize decay (#20816)
5 years ago
Chengmo 16596f6498
Fix Paddle Cloud role maker (#20860)
5 years ago
liym27 59de8e1214 Compatible int32 and int64 for attr in concat/split/unsqueeze. test=develop (#20912)
5 years ago
Zhang Ting 8d1e9f0f7e maxout supports channel_last input (#20846)
5 years ago
Yihua Xu b6260f3866 Optimize the kernel implementation of layernorm with openmp (#20895)
5 years ago
hong 8c4573a3cb
GradMaker for dygraph (#19706)
5 years ago
Chen Weihang 768551b25d Add parameter init check add run_startup_progrom error message for fc(mul) (#20906)
5 years ago
Zhang Ting c18f1bd716 fix the bug of conv_transpose:compatible with Anylayout setting, test=develop (#20897)
5 years ago
Wilber b489760099
fix jit_matmul bug test=develop (#20886)
5 years ago
Yiqun Liu 03ba0fdae6
Move the codes of fused operators to operators/fused directory. (#20881)
5 years ago
zhang wenhui d428912503
fix select_rows mergeadd bug, test=develop (#20876)
5 years ago
liym27 6802539a2e support Tensor for split and concat, support -1 in num_or_sections, add check num_or_sections (#20780)
5 years ago
wangchaochaohu 28ca2e5ffa
strided_slice perforamnce improvement test=develop (#20852)
5 years ago
Yiqun Liu 6fcfd32e6c
Check and correct the output's lod_level in DynamicRNN related operators (#19144)
5 years ago
liym27 84d221b667 improve unsqueeze op to support int, Tensor for argument axes (#20824)
5 years ago
silingtong123 03d7f3ddb2 Make shape tensor support int32 (#20757)
5 years ago
Huihuang Zheng 95ba4bd2ab
Add shape and type check at read_op (#20754)
5 years ago
Aurelius84 aacd16dbb4 add pyramid_hash_op (#20698)
5 years ago
Chen Weihang 8b59ac3ad0 delete paddle infershape enforce marco (#20832)
5 years ago
whs c8e49be2f1
Fix roi_perspective_transform op (#20764)
5 years ago
Chen Weihang 26cc1fe508
Replace risky GetInputType method with secure IndicateVarDataType interface (#20668)
5 years ago
Yamei-Lee cf717fd6dd fix bug in reshape: (#20781)
5 years ago
Zhang Ting 5a8d885d72 All elements in attr(shape) of crop_tensor can be -1 and int32/64 kernel registered (#20756)
5 years ago
danleifeng 9171f73714 fix fp16 grid_size for size=1; test=develop (#20812)
5 years ago
Tao Luo efbdad0596
make search_compute support avx default (#20779)
5 years ago
WangXi 250e72d254 Fix DGC algorithm flow to make it the same as paper (#20758)
5 years ago
zhaoyuchen2018 6e6eab07e8
Fix multihead op bug. (#20783)
5 years ago
lvmengsi dfa0549f87
Revert "fix_depthwise_conv_cudnn, test=develop (#20712)" (#20782)
5 years ago
whs 4c7d196d83
Add norm_by_time for warpctc op in padding mode. (#17580)
5 years ago
Pei Yang e89c16b90d
Bug Fix: Paddle-TRT cannot handle adaptive pooling in pool2d op converter and "num" attribute in split op converter (#20733)
5 years ago
石晓伟 37cd43545a
update the infer shape of matmul, test=develop (#20717)
5 years ago
Adam 67b59ddb38 Minor MKL-DNN conv int8 performance fixes (#20753)
5 years ago
wangchaochaohu 0687bcd64f
Refine getitem of Variable (#20729)
5 years ago
danleifeng 79e08ecebf add assertions on whether elementwise_div divison is zero (#20618)
5 years ago
123malin 95e90aa102
test=develop, add communicator_is_sgd_optimizer flag (#20677)
5 years ago
Aurelius84 74a28f5ea4
fix fill_constant shape with -1 and enhance cross_entropy test=develop (#20722)
5 years ago
lvmengsi dc229b4195
fix_depthwise_conv_cudnn, test=develop (#20712)
5 years ago
gongweibao c1710e91b2
Disable GRPC_ARG_ALLOW_REUSEPORT to avoid potencial problem. (#20690)
5 years ago
lidanqing 46e93f7c86 Revert "Refactor conv computeINT8" (#20640)
5 years ago
Zeng Jinle ab575de725 Fix op run log when memory optimization strategy is enabled (#20695)
5 years ago
Jacek Czaja a1cd27f13f [MKL-DNN] Added mkl-dnn cache clearing when creating Executor instance (#20241)
5 years ago
Zeng Jinle 10505faf4e
polish codes, test=develop (#20672)
5 years ago
Zeng Jinle 34e3adaece
Refine reduce codes to save compiling time and binary size (#20676)
5 years ago
whs a3e641e93c
Fix infer shape of warpctc op. (#20653)
5 years ago
Zeng Jinle 4922eb6da5
make_conv_workspace_size_configurable, test=develop (#20662)
5 years ago
zhongpu efa10937bd fix elementwise_floordiv_op and elementwise_mod_op (#20534)
5 years ago
tangwei12 04384502a8
fix bug with heart beat , test=develop (#20654)
5 years ago
wangchaochaohu 7783d3bd43
Conv refine (#20644)
5 years ago
Chen Weihang 003f369bb2
Add IndicateVarDataType interface to block tensor is not initialized problem in OP GetExceptedKernelType (#20044)
5 years ago
gongweibao f3f52fc1e2
Retry when failed to bind address. (#20642)
5 years ago
qingqing01 01eddc1a04
Support fp16 in GPU impl of fused_elemwise_activation_op. (#20636)
5 years ago
Chengmo 940c6ff1c8
Fix communicator slow bug & fix communicator stop bug (#20366)
5 years ago
zhaoyuchen2018 8314e64a8b
Fix sum op fails as no memory in tensor(#20602)
5 years ago
Yibing Liu ee2869cae9
Remove redundant infershape in linear chain crf grad, test=develop (#20629)
5 years ago
123malin b4a3b75002
bug fix: invalid learning rate decay in pserver async mode (#20325)
5 years ago
石晓伟 a4753f3a79 Optimize error message of mean_op and matmul_op (#20413)
5 years ago
Leo Chen d6c1d6ca56 update class name, test=develop (#20578)
5 years ago
Double_V 0b39218749 memory optimizer for reshape op,test=develop (#20569)
5 years ago
chengduo 36c85ef492
Add sub-scope check in RecurrentOp (#20468)
5 years ago
JesseyXujin 2ff18e537f
add expand_as op, test=develop (#20565)
5 years ago
Zeng Jinle 40effc61af
Refine py_reader exit (#20331)
5 years ago
Zhang Ting 78910480c1 fix conv_transpose's bug: compatible with Anylayout setting, test=develop (#20589)
5 years ago
Yuan Shuai 172e91c008
Refine error message of transpose_op (#20437)
5 years ago
liym27 fc6ec3b9f6 fill_constant support Tensor; (#20521)
5 years ago
Zhang Ting 0130cc969c fixed group_norm's bug and modified unittest (#20506)
5 years ago
zhaoyuchen2018 8fb569e5b9
Fix api doc example bug and polish square doc (#20491)
5 years ago
Guo Sheng dfd1eee7f7 Add seq2seq api related code (#19820)
5 years ago
lvmengsi 2384589383
Fix conv_grad_grad (#20469)
5 years ago
Double_V 8299203370 Support reshape_op double gradient (#20304)
5 years ago
hong19860320 4d0d5e4cc7
refine eng doc for hard_sigmoid op (#20442)
5 years ago
Aurelius84 22823df2e2
enhance embedding error message test=develop (#20246)
5 years ago
zhupengyang 3997743a5b add input type and dtype check, enhance shape error message for concat_op (#20101)
5 years ago
zhupengyang 95524a4d30 fix APIs: relu, relu6, hash (#20416)
5 years ago
JesseyXujin 843bdbaae1
add input type and dtype check for accuracy_op (#20399)
5 years ago
lijianshe02 211f5b0319
enhance mul_op input error message test=develop (#20414)
5 years ago
GaoWei8 5ea2cc6733 fix API:cos, exp, ceil, elu, brelu English doc (#20032)
5 years ago
wopeizl 3044a62f2a
fix the precise roi poop op test=develop (#20126)
5 years ago
Wilber 2893cd1ae0
modify english api (#20159)
5 years ago
zhouwei25 b1218d056b fix English Doc of API:layers.py_func/sum (#20329)
5 years ago
qingqing01 63194d6e67
Enhance InferShape in deformable_conv and prior_box op (#20372)
5 years ago
tangwei12 a010d883b4
doc fix, test=develop, test=document_fix (#20239)
5 years ago
huzhiqiang 6a8e54047f
fix reorder_lod_tensor_by_rank doc en (#20256)
5 years ago
Yibing Liu 899ab30df0
Fix several api docs (#20282)
5 years ago
wangchaochaohu 1288ac2983
fix expand bug (#20340)
5 years ago
SunGaofeng a73e1f68b4
fix document of 11 APIs (#20278)
5 years ago
Pei Yang 057d782d51
fix en api doc of [round, sin, sqrt], test=develop, test=document_fix (#20296)
5 years ago
Kaipeng Deng 3833b511a6
refine en API doc (#20206)
5 years ago
wangchaochaohu bc6126dd07
fix the reduce bug test=develop (#20102)
5 years ago
FDInSky e2c7b6821a test=develop enhance uniform_random op python api (#20295)
5 years ago
danleifeng 3a0f93b3f9 fix error message for elementwise_add/mul (#20283)
5 years ago
liym27 670937e11d add input type and dtype check for reshape op. (#20099)
5 years ago
Zeng Jinle 48029ab06c
Remove some DefaultGradOpDescMaker (#20185)
5 years ago
Aurelius84 729f5846cc
enhance shape error message of fc API (#20172)
5 years ago
wangguanzhong 6fbf441001
enhance input check for roi_align, test=develop (#20238)
5 years ago
Yibing Liu d849e9835f
Add detailed error messages for nce layer (#20231)
5 years ago
Double_V 98da70f63f fix API en doc (#20261)
5 years ago
zhaoyuchen2018 5ebf4078dc
add input type and dtype check for squeeze (#20100)
5 years ago
liuwei1031 e03c1d8a9e
fix conv_op compilation issue on windows (#20230)
5 years ago
JesseyXujin 407efcf7b1
fix API doc, solve conflict, test=develop, test=document_fix (#20196)
5 years ago
liym27 ad60b3b8ac mv two function in conv op for good code style (#20116)
5 years ago
liym27 869cef6dc0 fix bug of infer shape in pool op. test=develop (#20213)
5 years ago
lvmengsi 59a7c222ea
refine en doc (#20088)
5 years ago
Zeng Jinle 3eebd5b391
refine sequence_softmax grad maker, test=develop (#20127)
5 years ago
Chengmo eb05db7104
Speed GEO-SGD (#20158)
5 years ago
Zhang Ting cf6919bf6e conv_transpose supports channel_last input, test=develop, test=document_preview (#20072)
5 years ago
tangwei12 c9139c3db3
trainer from dataset fetch targets (#19760)
5 years ago
tangwei12 b5a410466c
Trainer heartbeat for async mode (#19600)
5 years ago
lvmengsi 76ba55e891
add error log for python api and c++ (#20061)
5 years ago
Yibing Liu 01ad8d2e06
Refactor linear chain crf op & crf decoding op (#19982)
5 years ago
wangchaochaohu 6e73e90bfb
fix the error message for reduce_mean and reduce_sum op (#20063)
5 years ago
wangchaochaohu 9a76f3f916
Fill constant error message fix (#20075)
5 years ago
zhaoyuchen2018 e867366805
Add multihead op for ernie opt (#19933)
5 years ago
Chengmo 728ec1b43d
Add GEO-SGD distribute training algorithm (#20018)
5 years ago
Li Fuchen 5365cd2f14
Set lod level of sequence_unpad's output to 1 in compile time (#20068)
5 years ago
danleifeng 425279a57b Improve elementwise operators performance in same dimensions. (#19763)
5 years ago
liuwei1031 292aae4385 fix windows compilation issue when compile with VS2015, test=release/1.6 (#20114)
5 years ago
Wilber 276b5e3440
fix compile paddle with anakin bug
5 years ago
silingtong123 649bcd5fe2 Modify the style of function names (#20071)
5 years ago
liym27 3aa331d97e fix conv2d and conv3d: (#20042)
5 years ago
chengjuntao 6f184775e8 Fix compling warning in deformable conv. (#20036)
5 years ago
wangguanzhong da892cafd5
Refine api doc (#20037)
5 years ago
silingtong123 f1eebf75aa improve op uniform_random, argument shape support tensor and tensor in list (#19786)
5 years ago
liym27 24010472d4 fix pool2d pool3d,support asymmetric padding and channel_last (#19739)
5 years ago
Adam fe581b0e8a Minor GetMKLDNNFormat changes (#20055)
5 years ago
lvmengsi c92348c3b9
fix conv_grad_grad (#20054)
5 years ago
Kaipeng Deng e7a6567be5
polish pool infer shape (#20038)
5 years ago
chengduo fb2a9cdf83
Add fp16 support for pad and split (#19881)
5 years ago
lvmengsi 647ff784e2
fix mul double grad (#20040)
5 years ago
tangwei12 8f0b3c0516
the integrated communicator (#19849)
5 years ago
danleifeng 5cef7a2f25 Polish English docs of elementwise_add/sub/mul/div (#20027)
5 years ago
Li Fuchen c8e125872c
Fixed warpctc, test=develop (#20011)
5 years ago
wangchaochaohu 3409db950c
fix reduce bug test=develop (#19971)
5 years ago
Adam 4b65af7719 MKLDNN BatchNorm operator refactor (#20012)
5 years ago
joanna.wozna.intel 1d32897c5c Fix test pool2d int8 mkldnn (#19976)
5 years ago
Aurelius84 f58c8db668
Require x.dims=label.dims in huber_loss (#20017)
5 years ago
Aurelius84 137e6336ef
Remove constraint that last dimension is forced to be 1 in rank_loss (#19997)
5 years ago
chengduo 101a2b610a Add dtype for coalesce_tensor_op (#20016)
5 years ago
Zhaolong Xing f04f2b232a
fix if else error info (#19974)
5 years ago
gongweibao a7512db2bc
Polish elementwise max min pow document to add more examples. (#19946)
5 years ago
Aurelius84 2b5b4b3c5e
fix dataType in C++ comment in embedding op (#20004)
5 years ago
Tao Luo bcb2903e60
enhance shape error message of mul_op (#19998)
5 years ago
Chen Weihang 1409586eaa Add LoD empty check for all related sequence ops (#19980)
5 years ago
zhongpu b1bb23841e add kernel for fill_op, test=develop (#19719)
5 years ago
wangchaochaohu 382d099dcb
add support tensor and tensorlist for strided_slice OP (#19929)
5 years ago
lvmengsi 619a241bd0
Fix OpTest of bn (#19062)
5 years ago
Bob Zhu c670058a8d add support of matmul with multiple head even different width and height (#19708)
5 years ago
Liufang Sang 6884dc800a refine ctc align op with padding (#19926)
5 years ago
Aurelius84 99a9615a4b
Removing length dims constraints of seq_pad and seq_unpad (#19497)
5 years ago
jhjiangcs 766bd529d1 add optimizer:dpsgd,test=develop (#19915)
5 years ago
Yang Zhang ebff68fa74
Add float16 support to `sync_batch_norm_op` (#19681)
5 years ago
Aurelius84 039b9710d5
Remove constraint that last dimension is forced to be 1 by adding lookup_table_v2 (#19735)
5 years ago
xujiaqi01 cedc04775c
support change shuffle and train thread num (#19841)
5 years ago
Kaipeng Deng 14625ffe9e
add elementwise mod support float/double. test=develop (#19570)
5 years ago
Jacek Czaja 5b07ca9cdd - ReImplemented pooling fwd mkldnn (#19911)
5 years ago
Zeng Jinle b1e83b33b0
fix huber loss op attr type, test=develop (#19937)
5 years ago
Zeng Jinle cc157d5990
add inplace to assign op, test=develop (#19927)
5 years ago
Leo Chen 57606205f5 Make OpTest check grad inplace even if forward has no inplace (#19847)
5 years ago
Zhang Ting cb8f3c03a7 resize Ops support data_layout:channel_last, test=develop, test=document_preview (#19914)
5 years ago
Kaipeng Deng 3f021781a1
fix softmax CE time limit check failed (#19846)
5 years ago
石晓伟 30adea0a23
tensor_array_to_tensor_op.cc, test=develop (#19289)
5 years ago
lvmengsi 4155e62559
add instance norm (#19500)
5 years ago
Adam cb65439da8 Add support for other axes in MKLDNN softmax op (#19907)
5 years ago
Pei Yang baccd7e2ca
Add TRT input shape check between model and runtime (#19864)
5 years ago
Aurelius84 fcf53e55ff
support 2-level lod of input in sequence_pool (#19839)
5 years ago
Zhang Ting 93364b45c1 group_norm support data_layout:NHWC, test=develop, test=document_preview (#19614)
5 years ago
Jacek Czaja 619c797a7f [MKL-DNN] LRN refactoring (#19798)
5 years ago
Zhang Ting 439d95e157 modified interpolate op to support tensor attribute, test=develop, test=document_preview (#19287)
5 years ago
Zhang Ting b38889413d add crop_tensor_op, test=develop, test=document_preview (#19314)
5 years ago
lidanqing 2c32c2d649 Refactor conv computeINT8 (#19574)
5 years ago
Adam c7e688921b Add template functions for Acquire primitive/primitive_desc (#19867)
5 years ago
Aurelius84 b125e327aa
Remove constraint that last dimension is forced to be 1 in cross_entropy (#19606)
5 years ago
wopeizl a7c440d303
add precise roi pooling op test=develop (#18960)
5 years ago
Yiqun Liu 3cd985a669
Add a pass to fuse fc+elementwise_add+layernorm (#19776)
5 years ago
wangchaochaohu 47af618f70
Strided slice (#19642)
5 years ago
123malin 1bc285a53a
add retry function to try to solve grpc error code 14 (#19661)
5 years ago
LielinJiang 6d72a86b14 fix_roi_transform_bug (#19785)
5 years ago
Zeng Jinle 3fd3b663a8
fix gc bug in controlflow ops, test=develop (#19827)
5 years ago
Leo Chen 982e61f5ff Update elementwise double grad to save gpu memory (#19509)
5 years ago
Adam dfdd73cbc0 Add MKLDNNhandlerT templatized class (#19801)
5 years ago
Zeng Jinle cabb9501bd
fix leaky_relu op when alpha is zero, test=develop (#19833)
5 years ago
chengjuntao 00efd1d8a9
add deformable conv v1 op and cpu version of deformable conv v2 (#18500)
5 years ago
liym27 677e714425 fix pow op, support tensor for agument factor. (#19313)
5 years ago
liym27 bd89a27308 add tensor support for argument shape in reshape op; (#19268)
5 years ago
liym27 88628016b2 add tensor(tensor and tensor in list) support for argument starts and ends in slice op; (#19208)
5 years ago
liym27 e9e3c08777 fix expand op: (#19302)
5 years ago
lvmengsi b76343c3b7
cpu Conv double grad (#19672)
5 years ago
翟飞跃 93c85c930a Implement FusedEmbeddingSeqPoolGradKernel with cblas_saxpy (#19770)
5 years ago
Yiqun Liu c67c8758cb
Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733)
5 years ago
zhongpu 52673956de add kernel for squeeze_op, test=develop (#19656)
5 years ago
zhongpu 2a81c3679a add kernel for unstack_op, test=develop (#19538)
5 years ago
Kaipeng Deng 99c78b772a
fix softmax axis!=-1. test=develop (#19800)
5 years ago