Commit Graph

3729 Commits (a7a4f053dacd028469c0fd9a2a9e6e54eb3fa55d)

Author SHA1 Message Date
jerrywgz 41471d28ac add box_coder_and_assign, test=develop
6 years ago
lidanqing 02c106c717 MKLDNN: Add UT for conv_transpose_mkldnn op. (#16030)
6 years ago
dengkaipeng 3eab9e4b95 fix statement. test=develop
6 years ago
dengkaipeng e37f5ab5b1 fix API.spec. test=develop
6 years ago
dengkaipeng 54bbbfa71f fix doc statement. test=develop
6 years ago
dengkaipeng c1a69e3ea0 refine doc. test=develop
6 years ago
dengkaipeng 65d375a09f fix format. test=develop
6 years ago
dengkaipeng 82d514345c fix spectral_norm doc. test=develop
6 years ago
dengkaipeng 2ea5843cbf add doc and test_layers. test=develop
6 years ago
dengkaipeng 037855f42d fix attr dim calc. test=develop
6 years ago
dengkaipeng 70dbd59839 add grad kernel for spectral_norm. test=develop
6 years ago
dengkaipeng 72509ec3bd add unittest for spectral_norm. test=develop
6 years ago
dengkaipeng 3bf1ae9b59 add spectral_norm forwarn kenel
6 years ago
Zhen Wang 545247d7b4 add channel wise quantize op.
6 years ago
tensor-tang b16dabd7e0 refine vbroadcast jitcode
6 years ago
tensor-tang c2e56e6bbc Merge remote-tracking branch 'ups/develop' into op/embgrad
6 years ago
chengduo e2da3a5b22
Revert "Add Event for TensorCopy" (#16022)
6 years ago
baojun 9aaea38c0a fix cpplint test=develop (#16028)
6 years ago
chengduo 7235fd662b
Add Event for TensorCopy (#15953)
6 years ago
Tink_Y 31d830de9f refine image_resize annotation (#15976)
6 years ago
tensor-tang 641b3cccce add vbroadcast mkl code and jitcode
6 years ago
tensor-tang 41a1270856 add vbroadcast jitkernel refer code and use it
6 years ago
tensor-tang 867e93b21a add jitkernel vcopy and speedup unit test time
6 years ago
jerrywgz c31da7899a refine code, test=develop
6 years ago
Yiqun Liu 798925453e
Revert "Optimize while_op when is_test is true. (#15811)" (#15968)
6 years ago
Yiqun Liu 87248281f7
Fix error in CUDA kernel of beam_search. (#15957)
6 years ago
jerrywgz e8a8fe07e7 fix code for windows CI, test=develop
6 years ago
jerrywgz 149411762a add gpu kernel, test=develop
6 years ago
Tao Luo 4efdebc6f6
Merge pull request #15931 from yihuaxu/develop_2c5c7b2a7_gelu_mkl_opt
6 years ago
tensor-tang e5f9d3a47c
Merge pull request #15892 from tensor-tang/jit/sgd
6 years ago
Tao Luo e6bab55f1b
Merge pull request #15959 from luotao1/infershape_refine
6 years ago
Yiqun Liu 613d9d0756
Optimize while_op when is_test is true. (#15811)
6 years ago
xiaolil1 1abddd8d97 Optimize Quantize Op with primitive reuse. (#15929)
6 years ago
Tao Luo 7ec97a0a7e
Merge pull request #15930 from xiaolil1/dequantize-reuse
6 years ago
nhzlx 2eff3e26b6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_static_model_load_for_trt
6 years ago
nhzlx 06a088a199 fix comments and fix cpplint
6 years ago
luotao1 34404f9c31 refine infershape of sequence_enumerate, hash and fuse_emb_seq_pool
6 years ago
baojun f285191fb3 Added adam op test=develop (#15710)
6 years ago
jerrywgz b92ef45fe9
Merge pull request #15678 from jerrywgz/refine_softmax_with_cross_entropy
6 years ago
mozga-intel 558f94cd77 Register sum operator (#15889)
6 years ago
tensor-tang 58b8231338
added concat op test=develop (#15946)
6 years ago
Tao Luo 47d36b2008
Merge pull request #15924 from baojun-nervana/ngraph_v14
6 years ago
jerrywgz 0f652f304c add distribute fpn proposals op, test=develop
6 years ago
dzhwinter 225c11a91f polish cudnn related code and fix bug. (#15164)
6 years ago
Yiqun Liu 454f4f2140
Rewrite is_empty op to avoid unnecessary data transform. (#15509)
6 years ago
xiaolil1 6724be2b0d INT8 Pool kernel Key Creation Optimization. (#15883)
6 years ago
xiaoli.liu@intel.com c4187dbd7c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dequantize-reuse
6 years ago
Tao Luo ba90e05281
Merge pull request #15917 from jczaja/prv-tensor-mkldnn-ops
6 years ago
baojun-nervana e4ab40a7b9 added concat op test=develop
6 years ago
colourful-tree 7d8f639883
Merge pull request #15902 from colourful-tree/new_develop
6 years ago
Tao Luo effec86600
Merge pull request #15913 from liangan1/func_coverage
6 years ago
tensor-tang 8bc6381546 fix jitcodekey and refine test
6 years ago
tensor-tang 7044cfa7c7 add sgd jitcode and op test
6 years ago
tensor-tang 8e04133719 add benchmark and mkl sgd implement
6 years ago
tensor-tang 07efdb5139 Merge remote-tracking branch 'ups/develop' into jit/sgd
6 years ago
Jacek Czaja c63f6b2039 - MKL-DNN pooling updated to set_prim_desc
6 years ago
qingqing01 8e439ccfff
Fix bug in fake_quantize_op and add more unit testing (#15912)
6 years ago
qingqing01 f4846bf3dc
loosly check in the InferShape of cross_entropy_op. (#15863)
6 years ago
Yihua Xu 7396788694 Optimize gelu operation with mkl erf.
6 years ago
nhzlx 0ed63b2108 6. delete useless predictor id
6 years ago
xiaoli.liu@intel.com 70759d181b Optimize INT8 DeQuantize Op with primitive reuse.
6 years ago
Yiqun Liu f4634d76d7
Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU. (#15493)
6 years ago
guomingz 630c1e8317 This PR improve performance of prior_box op about 1.25x faster on CPU. (#15909)
6 years ago
Tao Luo 9c05421c97
Merge pull request #15914 from Sand3r-/mgallus/mkldnn-sum-code-reuse
6 years ago
chengduo 7ca8553d4e
Add alloc_continuous_space_op (#15900)
6 years ago
baojun-nervana 2ffacdebc2 Update ngraph version to v0.14 test=develop
6 years ago
Michal Gallus 6ebe9877bb Improve code reuse at MKL-DNN sum
6 years ago
liangan1 4acc522087 Enable function coverage for U8/S8 ConvMKLDNNOpKernel
6 years ago
Xin Pan 44e7fcddc5
Merge pull request #15844 from panyx0718/infer
6 years ago
Jacek Czaja dec9cf53c8 [MKL-DNN] MKL-DNN specific Tensor modification (#15429)
6 years ago
heqiaozhi 08c96d1b48 remove mkldnn & fix commit
6 years ago
Xin Pan 5dd281f738 polish
6 years ago
heqiaozhi fab09ac0b8 Merge branch 'new_develop' of https://github.com/colourful-tree/Paddle into new_develop
6 years ago
heqiaozhi da4f5a2f18 remove mkl & fix commit
6 years ago
colourful-tree f2d6473ef8
Merge branch 'develop' into new_develop
6 years ago
heqiaozhi 04f876f5bc remove mkl & fix commit
6 years ago
dengkaipeng 373cfb0ccf use kernel size in global_pooling. test=develop
6 years ago
dengkaipeng 60305196b8 fix spell mistakes. test=develop
6 years ago
Tao Luo 8a7efc78f1
Merge pull request #15882 from sfraczek/unique_ptr_dereference
6 years ago
tensor-tang a0c37662b9 enable sgd jitkernel refer code and test
6 years ago
xuezhong 1dad36f6aa
Merge pull request #15609 from xuezhong/add_sample_logits_op
6 years ago
Kaipeng Deng 9e524a7b51
Merge pull request #15870 from heavengate/fix_adaptive_pool_doc
6 years ago
dengkaipeng 14df92fe8f fix spell error. test=develop
6 years ago
dengkaipeng 144016fcfc fix adaptive_pool and yolov3_loss. test=develop
6 years ago
Sylwester Fraczek 74672d1aff Change *(smart_ptr.get()) -> *smart_ptr
6 years ago
tensor-tang ee2321debd
Revert 15770 develop a6910f900 gelu mkl opt (#15872)
6 years ago
xuezhong 81870723c6
Merge pull request #15605 from xuezhong/fix_bug_for_lstmp
6 years ago
dengkaipeng eb65b4e47d \frac -> \frac. test=develop
6 years ago
nhzlx 1d5ef7c9ee 5. add static trt load model
6 years ago
dengkaipeng 8167588f14 add blank after math::. test=develop
6 years ago
dengkaipeng d9ec605873 use math:: instead of 29. test=develop
6 years ago
dengkaipeng 19292ac6a1 fix adaptive pool doc.test=develop
6 years ago
Yiqun Liu 7d96c74ab2
Initialize the benchmark tester for operator. (#15772)
6 years ago
Yihua Xu 676995c86c Optimze Gelu with MKL Erf function (#15770)
6 years ago
mozga-intel 5d132ecf83 Auto-cmake generator, auto-fill map (#15402)
6 years ago
Krzysztof Binias 1578c60bdd Add new ut and remove unnecessary code
6 years ago
Xin Pan 5eb87506bc add per kernel config and remove const_cast.
6 years ago
Dun a83e470405
Profiler refine and add CUDA runtime api tracer (#15301)
6 years ago
mozga-intel 13ec2d331b Enable momentum operator for a ngraph engine (#15673)
6 years ago
xuezhong eb7bc3e7ea remove non-ascii charactor
6 years ago