Commit Graph

4572 Commits (fcf53e55ff22c54175efdb32ac367c6e04f19900)

Author SHA1 Message Date
Aurelius84 fcf53e55ff
support 2-level lod of input in sequence_pool (#19839)
6 years ago
Zhang Ting 93364b45c1 group_norm support data_layout:NHWC, test=develop, test=document_preview (#19614)
6 years ago
Jacek Czaja 619c797a7f [MKL-DNN] LRN refactoring (#19798)
6 years ago
Zhang Ting 439d95e157 modified interpolate op to support tensor attribute, test=develop, test=document_preview (#19287)
6 years ago
Zhang Ting b38889413d add crop_tensor_op, test=develop, test=document_preview (#19314)
6 years ago
lidanqing 2c32c2d649 Refactor conv computeINT8 (#19574)
6 years ago
Adam c7e688921b Add template functions for Acquire primitive/primitive_desc (#19867)
6 years ago
Aurelius84 b125e327aa
Remove constraint that last dimension is forced to be 1 in cross_entropy (#19606)
6 years ago
wopeizl a7c440d303
add precise roi pooling op test=develop (#18960)
6 years ago
Yiqun Liu 3cd985a669
Add a pass to fuse fc+elementwise_add+layernorm (#19776)
6 years ago
wangchaochaohu 47af618f70
Strided slice (#19642)
6 years ago
123malin 1bc285a53a
add retry function to try to solve grpc error code 14 (#19661)
6 years ago
LielinJiang 6d72a86b14 fix_roi_transform_bug (#19785)
6 years ago
Zeng Jinle 3fd3b663a8
fix gc bug in controlflow ops, test=develop (#19827)
6 years ago
Leo Chen 982e61f5ff Update elementwise double grad to save gpu memory (#19509)
6 years ago
Adam dfdd73cbc0 Add MKLDNNhandlerT templatized class (#19801)
6 years ago
Zeng Jinle cabb9501bd
fix leaky_relu op when alpha is zero, test=develop (#19833)
6 years ago
chengjuntao 00efd1d8a9
add deformable conv v1 op and cpu version of deformable conv v2 (#18500)
6 years ago
liym27 677e714425 fix pow op, support tensor for agument factor. (#19313)
6 years ago
liym27 bd89a27308 add tensor support for argument shape in reshape op; (#19268)
6 years ago
liym27 88628016b2 add tensor(tensor and tensor in list) support for argument starts and ends in slice op; (#19208)
6 years ago
liym27 e9e3c08777 fix expand op: (#19302)
6 years ago
lvmengsi b76343c3b7
cpu Conv double grad (#19672)
6 years ago
翟飞跃 93c85c930a Implement FusedEmbeddingSeqPoolGradKernel with cblas_saxpy (#19770)
6 years ago
Yiqun Liu c67c8758cb
Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733)
6 years ago
zhongpu 52673956de add kernel for squeeze_op, test=develop (#19656)
6 years ago
zhongpu 2a81c3679a add kernel for unstack_op, test=develop (#19538)
6 years ago
Kaipeng Deng 99c78b772a
fix softmax axis!=-1. test=develop (#19800)
6 years ago
Adam d4413a54bc Add common CreateKey for mkldnn handlers (#19767)
6 years ago
Aurelius84 8c7e411908
Remove constraint that last dimension is forced to be 1 by adding one_hot_v2 (#19716)
6 years ago
Jacek Czaja 9e4c958552 Refactoring activation mkldnn op (#19748)
6 years ago
Huihuang Zheng 12542320c5
Replace TemporaryAllocator by CUDADeviceContextAllocator (#18989)
6 years ago
Zeng Jinle 0daa5c9772
Make leaky relu inplacable (#19676)
6 years ago
Zeng Jinle 078a678219
refine math_op_patch, test=develop (#19727)
6 years ago
Jacek Czaja 47f670d58c - Softmax mkl-dnn refactoring (#19615)
6 years ago
Yiqun Liu a65c728e5d
Implement the GPU kernel of fc operator (#19687)
6 years ago
Aurelius84 22301115d0
Remove constraint that last dimension is forced to be 1 in huber_loss op (#19562)
6 years ago
Tao Luo ec9bc1bd9f
paddle::framework::vectorize() templatization (#19730)
6 years ago
Adam 428b2b9e17 MKLDNN handler cleanup (#19713)
6 years ago
Zeng Jinle 1c25c88aba
refine memory usage of some operators, test=develop (#19700)
6 years ago
wangguanzhong 25dcd74d34
merge empty lod tensor, test=develop (#19228)
6 years ago
yaoxuefeng c6756ed225 fix instag op (#19591)
6 years ago
zhongpu 5f627488db add kernel for unsqueeze_op and Add unsqueezed op test, test=develop (#19436)
6 years ago
Tao Luo f05d2c519d paddle::framework::vectorize() templatization [PART3] (#19643)
6 years ago
hutuxian 1ca6ea0318
fix cmakelist deps (#19668)
6 years ago
Tao Luo bcddbc78d4
remove -Wmaybe-uninitialized warning (#19653)
6 years ago
wangchaochaohu 4440d7ced0
test=develop cuda realization of label smooth op (#19175)
6 years ago
chengduo 31c5a5ee26 Remove linear_chain_crf_op.cu (#19645)
6 years ago
wangchaochaohu ed8f44ea21
codegen for fused elementwise operation (#19520)
6 years ago
Chen Weihang 73daa3d6c0
Code Cleanup: delete three useless raw variables in Conv2D (#19644)
6 years ago