Commit Graph

4783 Commits (b085ecc25896c0a4aea70bcfff316683a76ec5e4)

Author SHA1 Message Date
Kaipeng Deng 14625ffe9e
add elementwise mod support float/double. test=develop (#19570)
6 years ago
Jacek Czaja 5b07ca9cdd - ReImplemented pooling fwd mkldnn (#19911)
6 years ago
Zeng Jinle b1e83b33b0
fix huber loss op attr type, test=develop (#19937)
6 years ago
Zeng Jinle cc157d5990
add inplace to assign op, test=develop (#19927)
6 years ago
Leo Chen 57606205f5 Make OpTest check grad inplace even if forward has no inplace (#19847)
6 years ago
Zhang Ting cb8f3c03a7 resize Ops support data_layout:channel_last, test=develop, test=document_preview (#19914)
6 years ago
Kaipeng Deng 3f021781a1
fix softmax CE time limit check failed (#19846)
6 years ago
石晓伟 30adea0a23
tensor_array_to_tensor_op.cc, test=develop (#19289)
6 years ago
lvmengsi 4155e62559
add instance norm (#19500)
6 years ago
Adam cb65439da8 Add support for other axes in MKLDNN softmax op (#19907)
6 years ago
Pei Yang baccd7e2ca
Add TRT input shape check between model and runtime (#19864)
6 years ago
Aurelius84 fcf53e55ff
support 2-level lod of input in sequence_pool (#19839)
6 years ago
Zhang Ting 93364b45c1 group_norm support data_layout:NHWC, test=develop, test=document_preview (#19614)
6 years ago
Jacek Czaja 619c797a7f [MKL-DNN] LRN refactoring (#19798)
6 years ago
Zhang Ting 439d95e157 modified interpolate op to support tensor attribute, test=develop, test=document_preview (#19287)
6 years ago
Zhang Ting b38889413d add crop_tensor_op, test=develop, test=document_preview (#19314)
6 years ago
lidanqing 2c32c2d649 Refactor conv computeINT8 (#19574)
6 years ago
Adam c7e688921b Add template functions for Acquire primitive/primitive_desc (#19867)
6 years ago
Aurelius84 b125e327aa
Remove constraint that last dimension is forced to be 1 in cross_entropy (#19606)
6 years ago
wopeizl a7c440d303
add precise roi pooling op test=develop (#18960)
6 years ago
Yiqun Liu 3cd985a669
Add a pass to fuse fc+elementwise_add+layernorm (#19776)
6 years ago
wangchaochaohu 47af618f70
Strided slice (#19642)
6 years ago
123malin 1bc285a53a
add retry function to try to solve grpc error code 14 (#19661)
6 years ago
LielinJiang 6d72a86b14 fix_roi_transform_bug (#19785)
6 years ago
Zeng Jinle 3fd3b663a8
fix gc bug in controlflow ops, test=develop (#19827)
6 years ago
Leo Chen 982e61f5ff Update elementwise double grad to save gpu memory (#19509)
6 years ago
Adam dfdd73cbc0 Add MKLDNNhandlerT templatized class (#19801)
6 years ago
Zeng Jinle cabb9501bd
fix leaky_relu op when alpha is zero, test=develop (#19833)
6 years ago
chengjuntao 00efd1d8a9
add deformable conv v1 op and cpu version of deformable conv v2 (#18500)
6 years ago
liym27 677e714425 fix pow op, support tensor for agument factor. (#19313)
6 years ago
liym27 bd89a27308 add tensor support for argument shape in reshape op; (#19268)
6 years ago
liym27 88628016b2 add tensor(tensor and tensor in list) support for argument starts and ends in slice op; (#19208)
6 years ago
liym27 e9e3c08777 fix expand op: (#19302)
6 years ago
lvmengsi b76343c3b7
cpu Conv double grad (#19672)
6 years ago
翟飞跃 93c85c930a Implement FusedEmbeddingSeqPoolGradKernel with cblas_saxpy (#19770)
6 years ago
Yiqun Liu c67c8758cb
Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733)
6 years ago
zhongpu 52673956de add kernel for squeeze_op, test=develop (#19656)
6 years ago
zhongpu 2a81c3679a add kernel for unstack_op, test=develop (#19538)
6 years ago
Kaipeng Deng 99c78b772a
fix softmax axis!=-1. test=develop (#19800)
6 years ago
Adam d4413a54bc Add common CreateKey for mkldnn handlers (#19767)
6 years ago
Aurelius84 8c7e411908
Remove constraint that last dimension is forced to be 1 by adding one_hot_v2 (#19716)
6 years ago
Jacek Czaja 9e4c958552 Refactoring activation mkldnn op (#19748)
6 years ago
Huihuang Zheng 12542320c5
Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989)
6 years ago
Zeng Jinle 0daa5c9772
Make leaky relu inplacable (#19676)
6 years ago
Zeng Jinle 078a678219
refine math_op_patch, test=develop (#19727)
6 years ago
Jacek Czaja 47f670d58c - Softmax mkl-dnn refactoring (#19615)
6 years ago
Yiqun Liu a65c728e5d
Implement the GPU kernel of fc operator (#19687)
6 years ago
Aurelius84 22301115d0
Remove constraint that last dimension is forced to be 1 in huber_loss op (#19562)
6 years ago
Tao Luo ec9bc1bd9f
paddle::framework::vectorize() templatization (#19730)
6 years ago
Adam 428b2b9e17 MKLDNN handler cleanup (#19713)
6 years ago
Zeng Jinle 1c25c88aba
refine memory usage of some operators, test=develop (#19700)
6 years ago
wangguanzhong 25dcd74d34
merge empty lod tensor, test=develop (#19228)
6 years ago
yaoxuefeng c6756ed225 fix instag op (#19591)
6 years ago
zhongpu 5f627488db add kernel for unsqueeze_op and Add unsqueezed op test, test=develop (#19436)
6 years ago
Tao Luo f05d2c519d paddle::framework::vectorize() templatization [PART3] (#19643)
6 years ago
hutuxian 1ca6ea0318
fix cmakelist deps (#19668)
6 years ago
Tao Luo bcddbc78d4
remove -Wmaybe-uninitialized warning (#19653)
6 years ago
wangchaochaohu 4440d7ced0
test=develop cuda realization of label smooth op (#19175)
6 years ago
chengduo 31c5a5ee26 Remove linear_chain_crf_op.cu (#19645)
6 years ago
wangchaochaohu ed8f44ea21
codegen for fused elementwise operation (#19520)
6 years ago
Chen Weihang 73daa3d6c0
Code Cleanup: delete three useless raw variables in Conv2D (#19644)
6 years ago
123malin 2f037c3189
fix the diff between async mode and async_half mode (#19535)
6 years ago
tangwei12 f45cb1c2ca
fix bug of communicator flag, test=develop (#19635)
6 years ago
Tao Luo 3ae939e48a
unify PADDLE_ASSERT_MSG into PADDLE_ENFORCE(error_message) (#19631)
6 years ago
Leo Chen af692c9140 update reduce_sum and reduce_mean to save memory, test=develop (#19608)
6 years ago
Zeng Jinle 710767d894
Enable inplace support for some ops (#19612)
6 years ago
Tao Luo d6c85c96dc
paddle::framework::vectorize() templatization (#19627)
6 years ago
danleifeng 8672e15363 elementwise broadcast function enhancement (#19536)
6 years ago
Chen Weihang 8cb54ede8c
Add user-friendly error message in optimizer ops to give a hint about the position sensitive problem of run(startup_program) (#19605)
6 years ago
zhongpu 118bb897cf add kernel for flatten_op, test=develop (#19472)
6 years ago
Tao Luo 0a46d34538
refine some PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19607)
6 years ago
ShenLiang 2cd3fa3e9a add scatter_nd op and scatter_nd_add op (#19571)
6 years ago
wawltor 364c44422e
Add the support the int64 data type of `scatter_op` input Index(#18804) (#19508)
6 years ago
Adam 8d6d95cc2b paddle::framework::vectorize() templatization (#19611)
6 years ago
Tao Luo 75d1571995
refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603)
6 years ago
Adam e94b26daf5 using MKLDNNMemoryFormat = mkldnn::memory::format changes (#19568)
6 years ago
baojun f2ad30c4dd Some ngraph op and unittest fix (#19515)
6 years ago
Tao Luo 49523ea189
replace PADDLE_ASSERT with PADDLE_ASSERT_MSG (#19586)
6 years ago
gongweibao abaf87be2b
Change backward_guard to optimize_guard to maximize the allreduce overlap. (#19506)
6 years ago
gongweibao 57f0f0f2dc
Delete pserver complete file before executor running. (#19468)
6 years ago
JesseyXujin 4a7e6deb63
add padding in linear_chain_crf op (#19583)
6 years ago
zhouwei25 84c728013c fix the compilation issue on windows caused by mkl_CSRMM (#19533)
6 years ago
Jacek Czaja cef95ee30d [MKL-DNN] Refactoring Softmax (#19312)
6 years ago
hutuxian c756b5d231
Paddlebox Framework (#18982)
6 years ago
Zeng Jinle 5dce1da680 remove reset recordio usage (#19519)
6 years ago
ShenLiang 85914f7a88 add gather_nd op and unit test (#19366)
6 years ago
Jacek Czaja ecd9f330c9 [MKL-DNN] Fix to face model on AVX512 platforms (#19282)
6 years ago
GaoWei8 e8405e5c61 Modify the dropout op to multi-thread (#19504)
6 years ago
Huihuang Zheng 2916caa2c4
Change ugly PADDLE_ENFORCE_EQ in recurrent_op.cc (#19470)
6 years ago
Liufang Sang 9dde564097 change var name padding_num to padding_value (#19498)
6 years ago
Aurelius84 5b5379b32a
Add sequence_topk_avg_pooling Op (#19442)
6 years ago
Tao Luo 02270b3eb1
remove unused assert.h (#19529)
6 years ago
lidanqing ba368bf696 clean up intel labeled TODOs (#19476)
6 years ago
Zeng Jinle 11f2f78458
fix sofmax seg fault in AVX, test=develop (#19487)
6 years ago
Zeng Jinle 5c8f210ce3
refine inplace inference registry, test=develop (#19032)
6 years ago
chengduo b6d1d8901f
Increase num_iteration_per_drop_scope (#19075)
6 years ago
Double_V 1d0f04315a fix row_conv_op to force it support lodtensor and tensor input simultaneously, test=develop (#19412)
6 years ago
tangwei12 65c7368400
Fix the correctness of async mode at distributed training (#18863)
6 years ago
baojun 6421c61ae2 Update ngraph engine for multiple threading (#19155)
6 years ago
Yi Liu efb05ba258
supports multiple NCCL communicators preserved in NCCLCommContext (#19407)
6 years ago
Huihuang Zheng 56dd76538c
Delete useless ex-scope in recurrent op (#19426)
6 years ago
vincentXiyu 482ce818bb Support Tensor input with padding for warpctc op (#19322)
6 years ago
Huihuang Zheng 12d29f4d2a
Change TensorCopy in recurrent_op to ShareDataWith (#19319)
6 years ago
tangwei12 19dac67e9f
fix distribute transpiler GRPC error code 4, RPC Deadline (#18984)
6 years ago
翟飞跃 2e3ee57954 Use sparse matrix to implement FusedEmbeddingSeqPoolGradKernel (#19153)
6 years ago
Leo Chen a9d5fc5142 Enhance OpTest to check the consistency of operators when using and not using inplace (#19101)
6 years ago
Aurelius84 0d29cf18f4
Supports diagonal initialization in uniform_random op (#19299)
6 years ago
Adam 97d1db1874 Add generalized Conv+Activation MKLDNN fuse pass creation Part2 (#19237)
6 years ago
wangguanzhong 37428952c6
fix generate mask fpn, test=develop (#19301)
6 years ago
zhaoyuchen2018 5296294dae
Fix elementwise performance poor issue (#19278)
6 years ago
Yihua Xu b920395842 Use sparse matrix to implement fused emb_seq_pool operator (#19064)
6 years ago
wangchaochaohu 6e326ca2c6
optimize the realization of cuda dropout (#19136)
6 years ago
Zhaolong Xing 76c95af000
Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213)
6 years ago
qingqing01 5fc8de449a
Remove warning in batch_norm_op (#19260)
6 years ago
Aurelius84 78a3d837f8
Add match_matrix_tensor op (#18525)
6 years ago
Zeng Jinle 5b6673c44d
merge develop to solve conflict, also fix API doc, test=develop (#18823)
6 years ago
zhang wenhui 539c870753
add fl_listen_and_serv &fl_transpiler,test=develop (#19091)
6 years ago
silingtong123 af0fbd9012 change PADDLE_ENFORCE to PADDLE_ENFORCE_CUDA_SUCCESS (#19205)
6 years ago
gongweibao fd4b15a2f6
Unset unittests http_proxy env to avoid timeout. (#19269)
6 years ago
Kaipeng Deng 2848cb791e
fix temporal_shift OP PADDLE_ENFORCE. test=develop (#19161)
6 years ago
Zeng Jinle 708bd9798d
move_flags_to_unified_files_for_management, test=develop (#19224)
6 years ago
Adam b837689e97 Add generalized Conv+Activation MKLDNN fuse pass creation (#19072)
6 years ago
Yibing Liu 50b1cab122
Add padding support for crf_decoding (#19057)
6 years ago
chengduo b5ba801ef0
Fix gather op bug (#19168)
6 years ago
Leo Chen 80eab822c1 Remove unused DefaultGradOpDescMaker in REGISTER_OPERATOR() (#19166)
6 years ago
chengduo c70a97f46e Use CUDAPinnedPlace in buffered_reader (#19112)
6 years ago
Jiawei Wang 6ac32d0981 Instag Implemention (#18394)
6 years ago
huangjun12 20f18930ae Add hard swish op (new op) (#19001)
6 years ago
joanna.wozna.intel bce72c7fea Replace Relu with bounded Relu in MobileNetV2 quantization (#18988)
6 years ago
wangguanzhong 1fc242a7ed
refine infer shape in box decoder and assign op, test=develop (#19118)
6 years ago
gongweibao 29d8781240
Polish fleet API to support cuda collective mode and nccl2 mode. (#18966)
6 years ago
Kevin 945f3cf631
fix code too big test=develop (#19111)
6 years ago
Zeng Jinle 88f111f885
remove unused inplace act codes, test=develop (#19079)
6 years ago
ShenLiang 4397cb318e add eye op, kernel and unitest test=develop (#18980)
6 years ago
Kaipeng Deng f86fead693
Add trilinear_interp OP (#18711)
6 years ago
Zhang Ting c2063217e7 optimize error message for "embedding" and "cross_entropy" OP (#18765)
6 years ago
Yiqun Liu a445c33552
Add the check of lod in sequence_softmax kernel. (#18996)
6 years ago
Kevin e681d65515 Add var_conv_2d op (#18518)
6 years ago
pawelpiotrowicz e53f517a44 fix for multithreading test_analyzer_image_classification --num_threads=X (#18265)
6 years ago
Liufang Sang faf6890b6c support tensor input for ctc align op (#18887)
6 years ago
hutuxian b62c4f9b04
fix concat check info typo (#18975)
6 years ago
Zeng Jinle 7ac748adb4
Open gc by default (#18836)
6 years ago
石晓伟 ee2f296ef8
Fusion: seqpool_cvm_concat (#18471)
6 years ago
wawltor 3ab1866ca5
Add the op of unique_with_counts, expand count function of the op unique (#18720)
6 years ago
Jacek Czaja 5cf2d38594 - Removed passing X from FWD to GRAD via device context (#18911)
6 years ago
LielinJiang 22fa4c2d24 Fix depthwise conv gpu kernel bug (#18582)
6 years ago
liuwei1031 0d99690809
fix several security bugs reported by security team (#18831)
6 years ago
Zhaolong Xing 61238d31f7
Trt fp16 support (#18860)
6 years ago
chengduo 20859c08e8
[DyGraph] Make multi-card program faster (#18892)
6 years ago
HaoRen 24f8543106 Add center Loss Op Support (#18681)
6 years ago
Leo Zhao 86e494eb64 use mkl to accelerate gelu_grad (#18099)
6 years ago
wopeizl dfd6a62a9a
Optimize the error report information when loadcombine fail to open model files test=develop (#18888)
6 years ago
baojun adcfc53b18 upgrade ngraph version and simplify ngraph engine (#18853)
6 years ago
Jacek Czaja cfcb96d2df [MKL-DNN] Fix int8 performance regression (#18758)
6 years ago
danleifeng e0a2d4dfec Add elementwise_pow_op backward implementation and the unit test codes of it. (#18848)
6 years ago
Zeng Jinle 9a8a7a1ddc
fix affine_channel no_need buffer bug, test=develop (#18844)
6 years ago
Adam ee02227949 Add LeakyReLU MKLDNN support (#18762)
6 years ago
lidanqing b05bdda0cf remove unused TransposeINT8Op for higher UT coverage (#18791)
6 years ago
Physher c5f47c2107 fix mul_mkldnn_op build failure (#18816)
6 years ago
Physher a5c986301c clarify MKLDNN INT8 Mul Op attributes (#18685)
6 years ago
FDInSky cff5e2c173 fix roi_align_op cpu backward's bug (#18789)
6 years ago
Bai Yifan d3ac561d65
fix deformable_conv_op compile error, test=develop (#18793)
6 years ago
lidanqing 9ecd8ee789 change ComputeINT8 to template version to remove checking dst_datatype code (#18756)
6 years ago
JesseyXujin d9e7b5b5e9
fix bug of swish op formula,test=develop (#18772)
6 years ago
Bob Zhu 220eef602e Extend Matmul to support matrix multiplication with multiple heads (#18570)
6 years ago
whs 075e1cf78e
Add python API for appending LoD level (#18702)
6 years ago
Jacek Czaja 95c1816ec0 [MKL-DNN] Extended LRN with reusing via Acquire API (#18675)
6 years ago
chengduo fd3aad6cb3
Make fuse_optimizer_op_pass also work when the model contains sparse gradients. (#18664)
6 years ago
wangchaochaohu 6b78e00da4
Cudnn convolution reconstruction (#18284)
6 years ago
Yi Liu 157211c4e1
supports distributed classification (#18690)
6 years ago
qingqing01 3429e65aa8
Fix CPU implementation of roi_align_op backward (#18728)
6 years ago
Tao Luo bd22453f20
Revert "Add LeakyRelu MKLDNN support (#18656)" (#18723)
6 years ago
whs 189b08dc0d
Make infer shape of pad2d support for input with negative dims in compile time. (#18695)
6 years ago
Bai Yifan 7e3963f295 add license, test=develop (#18709)
6 years ago
cjt222 ccf06a48b0
test=develop (#18701)
6 years ago
wangguanzhong 185b3acea1
fix clip_by_norm doc (#18688)
6 years ago
Huihuang Zheng 89bc3fd841
Support memory eager deletion on recurrent OP (#17710)
6 years ago
Adam d6b6a337a9 Add LeakyRelu MKLDNN support (#18656)
6 years ago
hutuxian bb2f5d24a2
hash_op support int64 hash_size (#18674)
6 years ago
guru4elephant 5ed713d519
remove ctr reader, all functions are satisfied in dataset (#18672)
6 years ago
Yang Zhang ce1ec33299
Add cuda implementation for `prelu` backward pass (#18633)
6 years ago
Yihua Xu 97549a4f13 [CPU] Fix the compiling issue with AVX512F macro. (#18634)
6 years ago
baojun 256ba7cbb8 [NGraph] handle dim element 0 of ngraph op (#18568)
6 years ago
Jacek Czaja 71d883b8ef [MKL-DNN] Reimplemented pool2d mkl-dnn to use Acquire API (#18585)
6 years ago
chengduo f4ec7d54c8
fix bug of scatter op (#18640)
6 years ago
guru4elephant ab57d3893e
make auc op compatible with 1 dim (#18551)
6 years ago
Hongyu Liu a20b2b43fc
fix cudnn lstm shape bug; test=develop (#18492)
6 years ago
Zeng Jinle d3003a1620
Feature/buffer_shared_inplace (#17911)
6 years ago
Zeng Jinle be24e5b391
Clean unused code of dim and place (#18565)
6 years ago
Jacek Czaja 8869d7f735 Activations MKLDNN ops refactoring (#18191)
6 years ago
Yibing Liu b86234fc0b
Register fp16 for concat_op (#18563)
6 years ago
Physher 5e1220ef37 fix compile error which caused by gcc4.8 related commit;test=develop (#18567)
6 years ago
Jiabin Yang 667f88f9a6
Fix/gcc 4.8 ubt link error (#18558)
6 years ago
Physher 0caa08ea40 Add mkldnn int8 mul-op kernel (#17834)
6 years ago
LielinJiang 24d1c44a0c Fix roi_perspective_transform_op bug (#18522)
6 years ago
Zhaolong Xing 88b52a27fe
Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532)
6 years ago
zhaoyuchen2018 832d8191ff
Fix topk cannot handle 1D vector bug (#18466)
6 years ago
qingqing01 7ac4818a98
Refine Infershape in activation_op for double_grad. (#18485)
6 years ago
chengduo 7453857324 Make fuse_all_reduce_op_pass support mix_precision (#17652)
6 years ago
zhoukunsheng 7c6f2350b9 support Tensor input for edit_distance op (#18162)
6 years ago
zhoukunsheng 26318544d2 support Tensor input for chunk_eval op (#18226)
6 years ago
zhoukunsheng 206c44e2a8 add unique kernel and op (#17557)
6 years ago
zhoukunsheng 71af72b1c2 upgrade hash op to support Tensor and LoDTensor input (#17998)
6 years ago
zhoukunsheng d3b3443d10 add ones_like op (#17388)
6 years ago
zhoukunsheng 67b48d7fe7 add size op (#17412)
6 years ago
Leo Zhao 8f5fffca0a rename mkldnn set/get_cur_thread_id() to set/get_cur_mkldnn_session_id() (#18453)
6 years ago
Yi Liu a873fa84ce
supports collective training with programs (#18392)
6 years ago
chengduo e0d8c6ac68
Add find_no_grad_vars in backward.py (#17942)
6 years ago
LielinJiang 449c7a9f98 Make roi_perspective_transform op return mask and transform matrix (#18371)
6 years ago
Brian Liu 4bc2987d2f Fix bug in quantize kernel which cause crash in vgg16/19 model (#17964)
6 years ago
Leo Zhao 681d3553f1 Fix potential mkldnn concat/pool/conv kernel issues (#18393)
6 years ago
Zeng Jinle f5641000bb
Add a unittest to inplace elementwise_add (#18385)
6 years ago
tangwei12 999d9a59a5
fix communicator with pyreader (#18350)
6 years ago
HaoRen b7128bac5f supports collective communicated training (#18175)
6 years ago
Sylwester Fraczek 9252e8fa08 add int8 mkldnn prior_box (#17242)
6 years ago
Jacek Czaja c2efdfd5bc [MKL-DNN] Extending reusing to Elementwise_add_mkldnn op (#18146)
6 years ago
qingqing01 9047ac687e
Simplify multi_box_head API in detection.py and remove assign op. (#18310)
6 years ago
Yibing Liu 23941e43ec
Update lamb optimizer (#18333)
6 years ago
tensor-tang 81ec538279
fix softrelu doc (#18324)
6 years ago
Hongyu Liu df2eee71d8
Sequence mask support tensor (#18249)
6 years ago
Qiao Longfei 0e08e91c18
optimize communicator merge sparse gradient test=develop (#18159)
6 years ago
Yibing Liu f57ee3693b
Fix the bug of sequence_unpad op (#18290)
6 years ago
chengduo 5489216eba
Clean build strategy (#18148)
6 years ago
songhao 6b3d96254d fix some bug when merge sparse embedding parameters, test=develop (#18223)
6 years ago
xiaoting b58bb80248 set src_idx > 0 for bilinear_interp_op (#18238)
6 years ago
Hongyu Liu cefd0fb598
Fix slice op shape=-1 bug (#18107)
6 years ago
翟飞跃 802ea50956 fix spelling errors (#17941)
6 years ago
FlyingQianMM 944c3165ec
fix type error of std::pow in sigmoid_focal_loss_op.cu and sigmoid_focal_loss_op.h (#18152)
6 years ago
Zeng Jinle 6eec66a1b1
Fix py_reader iterable bug (#18108)
6 years ago
qingqing01 80d2e66f9e
Update backward appending stragety to support double backward and fix some bug. (#18104)
6 years ago
FlyingQianMM ff83655f7e
add detection output operator for supporting retinanet (#17896)
6 years ago
FlyingQianMM 0aee1f0074
add sigmoid focal loss operator for supporting retinanet (#17895)
6 years ago
FDInSky 9e4b9d9798 Update generate_proposal_labels_op to support CascadeRCNN. (#17200)
6 years ago
FlyingQianMM 9ed2f936f1
add target assign operator for supporting retinanet (#17893)
6 years ago
chengduo 24e988a471
Fix bug of scope_buffered_ssa_graph_executor (#18100)
6 years ago
whs 354643d8d9
Add warning for cudnn warpctc kernel in CUDA9\CUDA10. (#18046)
6 years ago
Yiqun Liu 660c1a65f3
Optimize fused_elewise_activation_grad op. (#18041)
6 years ago
lidanqing f8ecc3de89 refactor the function ConvFwdPrimitiveDesc (#17897)
6 years ago
Wojciech Uss 78e932862c Added unit test for QAT FP32 & INT8 comparison (#17814)
6 years ago
tensor-tang 566bf2ec56
concat op support negative axis (#18045)
6 years ago
Yiqun Liu 7e463c84a6
Optimize the concat and split cuda implementation for cases when the number of inputs/outputs is less than 5. (#17979)
6 years ago
tangwei12 101f74cb19
fix save/load in fleet (#17675)
6 years ago
Guo Sheng a06b316b94
Fix GetExpectedKernelType of add_position_encoding_op (#17935)
6 years ago
wawltor 8eb134c3c1
Fix scatter and gather op when has duplicate index (#17952)
6 years ago
lujun 75fcd29220
update load_error_info, test=develop (#18000)
6 years ago
wawltor 2ae8decc90
test=develop (#17984)
6 years ago
cjt222 871af28d6c
add deformable psroi pooling (#17827)
6 years ago
SunGaofeng 40885c225b
add unfold op (new op),test=develop (#17944)
6 years ago
Jacek Czaja 84bb45c054 [MKL-DNN] Thread-Safety for MKL-DNN reusing Part 1 (#17965)
6 years ago
石晓伟 bce259e5bf
Update the Anakin interfaces for content-dnn and MLU (#17890)
6 years ago