Commit Graph

3706 Commits (065b68b6ca53b3eb140a9f3ebe95b8cdd856fef4)

Author SHA1 Message Date
dzhwinter 225c11a91f polish cudnn related code and fix bug. (#15164)
7 years ago
Yiqun Liu 454f4f2140
Rewrite is_empty op to avoid unnecessary data transform. (#15509)
7 years ago
xiaolil1 6724be2b0d INT8 Pool kernel Key Creation Optimization. (#15883)
7 years ago
xiaoli.liu@intel.com c4187dbd7c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dequantize-reuse
7 years ago
Tao Luo ba90e05281
Merge pull request #15917 from jczaja/prv-tensor-mkldnn-ops
7 years ago
Qiao Longfei 02425b2f64 fix compile
7 years ago
baojun-nervana e4ab40a7b9 added concat op test=develop
7 years ago
colourful-tree 7d8f639883
Merge pull request #15902 from colourful-tree/new_develop
7 years ago
Tao Luo effec86600
Merge pull request #15913 from liangan1/func_coverage
7 years ago
tensor-tang 8bc6381546 fix jitcodekey and refine test
7 years ago
tensor-tang 7044cfa7c7 add sgd jitcode and op test
7 years ago
tensor-tang 8e04133719 add benchmark and mkl sgd implement
7 years ago
tensor-tang 07efdb5139 Merge remote-tracking branch 'ups/develop' into jit/sgd
7 years ago
Jacek Czaja c63f6b2039 - MKL-DNN pooling updated to set_prim_desc
7 years ago
qingqing01 8e439ccfff
Fix bug in fake_quantize_op and add more unit testing (#15912)
7 years ago
qingqing01 f4846bf3dc
loosly check in the InferShape of cross_entropy_op. (#15863)
7 years ago
Qiao Longfei 49f2f4f91d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator
7 years ago
Qiao Longfei f768fbf715 support multi graph
7 years ago
Yihua Xu 7396788694 Optimize gelu operation with mkl erf.
7 years ago
xiaoli.liu@intel.com 70759d181b Optimize INT8 DeQuantize Op with primitive reuse.
7 years ago
Yiqun Liu f4634d76d7
Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU. (#15493)
7 years ago
guomingz 630c1e8317 This PR improve performance of prior_box op about 1.25x faster on CPU. (#15909)
7 years ago
Tao Luo 9c05421c97
Merge pull request #15914 from Sand3r-/mgallus/mkldnn-sum-code-reuse
7 years ago
chengduo 7ca8553d4e
Add alloc_continuous_space_op (#15900)
7 years ago
baojun-nervana 2ffacdebc2 Update ngraph version to v0.14 test=develop
7 years ago
Qiao Longfei cf0511f21e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
7 years ago
Michal Gallus 6ebe9877bb Improve code reuse at MKL-DNN sum
7 years ago
Qiao Longfei b8491bfd4e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-communicator
7 years ago
liangan1 4acc522087 Enable function coverage for U8/S8 ConvMKLDNNOpKernel
7 years ago
Xin Pan 44e7fcddc5
Merge pull request #15844 from panyx0718/infer
7 years ago
Jacek Czaja dec9cf53c8 [MKL-DNN] MKL-DNN specific Tensor modification (#15429)
7 years ago
heqiaozhi 08c96d1b48 remove mkldnn & fix commit
7 years ago
Xin Pan 5dd281f738 polish
7 years ago
heqiaozhi fab09ac0b8 Merge branch 'new_develop' of https://github.com/colourful-tree/Paddle into new_develop
7 years ago
heqiaozhi da4f5a2f18 remove mkl & fix commit
7 years ago
colourful-tree f2d6473ef8
Merge branch 'develop' into new_develop
7 years ago
heqiaozhi 04f876f5bc remove mkl & fix commit
7 years ago
dengkaipeng 373cfb0ccf use kernel size in global_pooling. test=develop
7 years ago
dengkaipeng 60305196b8 fix spell mistakes. test=develop
7 years ago
Tao Luo 8a7efc78f1
Merge pull request #15882 from sfraczek/unique_ptr_dereference
7 years ago
tensor-tang a0c37662b9 enable sgd jitkernel refer code and test
7 years ago
xuezhong 1dad36f6aa
Merge pull request #15609 from xuezhong/add_sample_logits_op
7 years ago
Kaipeng Deng 9e524a7b51
Merge pull request #15870 from heavengate/fix_adaptive_pool_doc
7 years ago
dengkaipeng 14df92fe8f fix spell error. test=develop
7 years ago
dengkaipeng 144016fcfc fix adaptive_pool and yolov3_loss. test=develop
7 years ago
Sylwester Fraczek 74672d1aff Change *(smart_ptr.get()) -> *smart_ptr
7 years ago
tensor-tang ee2321debd
Revert 15770 develop a6910f900 gelu mkl opt (#15872)
7 years ago
xuezhong 81870723c6
Merge pull request #15605 from xuezhong/fix_bug_for_lstmp
7 years ago
dengkaipeng eb65b4e47d \frac -> \frac. test=develop
7 years ago
dengkaipeng 8167588f14 add blank after math::. test=develop
7 years ago
dengkaipeng d9ec605873 use math:: instead of 29. test=develop
7 years ago
dengkaipeng 19292ac6a1 fix adaptive pool doc.test=develop
7 years ago
Yiqun Liu 7d96c74ab2
Initialize the benchmark tester for operator. (#15772)
7 years ago
Yihua Xu 676995c86c Optimze Gelu with MKL Erf function (#15770)
7 years ago
mozga-intel 5d132ecf83 Auto-cmake generator, auto-fill map (#15402)
7 years ago
Krzysztof Binias 1578c60bdd Add new ut and remove unnecessary code
7 years ago
Xin Pan 5eb87506bc add per kernel config and remove const_cast.
7 years ago
Qiao Longfei 31a05d3efd Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
7 years ago
Dun a83e470405
Profiler refine and add CUDA runtime api tracer (#15301)
7 years ago
Qiao Longfei cc71e89499 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-ssa-graph-executor
7 years ago
mozga-intel 13ec2d331b Enable momentum operator for a ngraph engine (#15673)
7 years ago
xuezhong eb7bc3e7ea remove non-ascii charactor
7 years ago
tensor-tang e1c707fe9c
fix warnings (#15790)
7 years ago
xuezhong d328660304 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp
7 years ago
xuezhong f2262d7336 update comment
7 years ago
Tao Luo 6402424f7a
Merge pull request #15773 from chengduoZH/fix_shape_api_doc
7 years ago
xuezhong d12252e6a6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sample_logits_op
7 years ago
xuezhong c5360a3f6b refine code
7 years ago
tensor-tang 5aea2cd2e0
Merge pull request #15652 from tensor-tang/refine/pyramiddnn
7 years ago
mozga-intel df23a6f894 Enable cross_entropy operator for a ngraph engine (#15674)
7 years ago
Yiqun Liu 56a5039e24
Correct the doc in Python API (#15725)
7 years ago
baojun 72061b0ac0 Add ngraph op coverage (#15721)
7 years ago
chengduozh d79d2f686c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_shape_api_doc
7 years ago
xuezhong 4424021623 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sample_logits_op
7 years ago
Yihua Xu 685a20ef56 Add JIT CRF_decoding and Layer_norm unit-test (#15699)
7 years ago
tensor-tang 75fc792d40 fix when table width larger than 64
7 years ago
tensor-tang 40402d5e68 add emb seqpool jitcode
7 years ago
tensor-tang 2ccbcb157d Merge remote-tracking branch 'ups/develop' into refine/pyramiddnn
7 years ago
chengduozh 3ce12b1b8e fix shape api doc
7 years ago
Dun 5e6834d891
inplace group_norm (#15754)
7 years ago
Hongyu Liu 8c0292dead
Merge pull request #15717 from phlrain/fix_leak
7 years ago
Tao Luo 4da291c6a3
Merge pull request #15726 from qingqing01/fix_api_doc
7 years ago
Dun e4b9fcdbd2
More restrict check load_combine_op. (#15479)
7 years ago
qingqing01 48a5cccbcd
Fix debug mode in prior_box_op (#15702)
7 years ago
Dang Qingqing 2868232556 Fix row_conv doc
7 years ago
tensor-tang a3a3d3d861 add embseqpool jitkernel mkl impl and use it
7 years ago
tensor-tang 15da2f9a0d add embseqpool jitkernel refer code, test and benchmark
7 years ago
tensor-tang c2ccf14590 Merge remote-tracking branch 'ups/develop' into refine/pyramiddnn
7 years ago
qingqing01 abcefe7211
Fix debug mode in fake_quantize_op (#15693)
7 years ago
liuhongyu 029be5fda9 fix lstmp bug; test=develop
7 years ago
liuhongyu 393fa6021e set lstm lstmp unsed pointer to nullptr; test=develop
7 years ago
liuhongyu 869f00ffc6 set lstm lstmp unsed pointer to null
7 years ago
jerrywgz 6f11f35abe
Merge pull request #15703 from jerrywgz/enhance_expand_op
7 years ago
Tao Luo 3086502522
Merge pull request #15704 from Sand3r-/mgallus/old-fc-mkldnn-branch-fix-develop
7 years ago
baojun c47e258ea4 Add ngraph sum, sigmoid, relu_grad and tanh_grad op (#15642)
7 years ago
tensor-tang 33d0cebbff
Merge pull request #15695 from tensor-tang/fix/name
7 years ago
Michal Gallus 7a8eff36a6 Fix old FC backward weights descriptor creation
7 years ago
chengduo ad61e1b22c
fix potential bug (#15688)
7 years ago
dzhwinter f9ac88e1a0
Merge pull request #15694 from liuwei1031/fix_security_issue
7 years ago
jerrywgz 8fc0fc314a support multiple var types for expand op, test=develop
7 years ago