Commit Graph

6818 Commits (f31d515ce3293d95c3e4a01fba789b12f4d21f7f)

Author SHA1 Message Date
dzhwinter 225c11a91f polish cudnn related code and fix bug. (#15164)
6 years ago
ceci3 4b7bf06e1f test=develop
6 years ago
Yiqun Liu 454f4f2140
Rewrite is_empty op to avoid unnecessary data transform. (#15509)
6 years ago
xiaolil1 6724be2b0d INT8 Pool kernel Key Creation Optimization. (#15883)
6 years ago
xiaoli.liu@intel.com c4187dbd7c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dequantize-reuse
6 years ago
Tao Luo d5a888e15c
Merge pull request #15943 from kbinias/kbinias/add-placement-pass-tester
6 years ago
Tao Luo ba90e05281
Merge pull request #15917 from jczaja/prv-tensor-mkldnn-ops
6 years ago
mozga-intel 68a9ead17a The flag of mkldnn is enabled iff it is necessary
6 years ago
baojun-nervana e4ab40a7b9 added concat op test=develop
6 years ago
Krzysztof Binias 72253391b6 Add MKL-DNN placement pass tester
6 years ago
minqiyang cb85ee987b Remove var op deps in imperative mode
6 years ago
colourful-tree 7d8f639883
Merge pull request #15902 from colourful-tree/new_develop
6 years ago
Tao Luo effec86600
Merge pull request #15913 from liangan1/func_coverage
6 years ago
Zhen Wang e00c7a2e26
Merge pull request #15830 from wzzju/add_ir_node_encapsulation
6 years ago
tensor-tang 8bc6381546 fix jitcodekey and refine test
6 years ago
tensor-tang 7044cfa7c7 add sgd jitcode and op test
6 years ago
tensor-tang 8e04133719 add benchmark and mkl sgd implement
6 years ago
tensor-tang 07efdb5139 Merge remote-tracking branch 'ups/develop' into jit/sgd
6 years ago
Jacek Czaja c63f6b2039 - MKL-DNN pooling updated to set_prim_desc
6 years ago
shippingwang 3398293272 add API.spec. test=develop
6 years ago
shippingwang 5ce46c637a fix api.spec, test=develop
6 years ago
qingqing01 8e439ccfff
Fix bug in fake_quantize_op and add more unit testing (#15912)
6 years ago
qingqing01 f4846bf3dc
loosly check in the InferShape of cross_entropy_op. (#15863)
6 years ago
minqiyang 28077c4da6 Add gperftools into imperative tracer
6 years ago
Yihua Xu 7396788694 Optimize gelu operation with mkl erf.
6 years ago
xiaoli.liu@intel.com 70759d181b Optimize INT8 DeQuantize Op with primitive reuse.
6 years ago
minqiyang efb2f2baf8 Fix bugs
6 years ago
Yiqun Liu f4634d76d7
Optimize the CUDA implementation of sequence_expand op by reduce the times of copying lod data from CPU to GPU. (#15493)
6 years ago
Tao Luo 60546b78cc
Merge pull request #15923 from Sand3r-/mgallus/conv-residual-ut
6 years ago
guomingz 630c1e8317 This PR improve performance of prior_box op about 1.25x faster on CPU. (#15909)
6 years ago
Tao Luo 9c05421c97
Merge pull request #15914 from Sand3r-/mgallus/mkldnn-sum-code-reuse
6 years ago
chengduo 7ca8553d4e
Add alloc_continuous_space_op (#15900)
6 years ago
wopeizl 2192c46436
Merge pull request #15916 from wopeizl/win/fixevent1
6 years ago
baojun-nervana 2ffacdebc2 Update ngraph version to v0.14 test=develop
6 years ago
Michal Gallus 6a2bc9a275 Add Conv Residual Connection UT for Projection
6 years ago
Zhen Wang 548931456c update some functions' names according to the suggestion. test=develop
6 years ago
Michal Gallus 6ebe9877bb Improve code reuse at MKL-DNN sum
6 years ago
dzhwinter 660e410655
Merge pull request #15855 from dzhwinter/fix/nightly_test
6 years ago
peizhilin c6472579c0 test=develop
6 years ago
peizhilin b5d6e38b05 fix build issue for cudaEvent_t
6 years ago
minqiyang b420ec3a92 invoke backward_hooks after reduce op's depcounts map
6 years ago
Qiyang Min 4bd28b304b
Merge pull request #15831 from velconia/imperative_engine
6 years ago
Xin Pan a6e3cd5eb7
Merge pull request #15425 from panyx0718/api
6 years ago
liangan1 4acc522087 Enable function coverage for U8/S8 ConvMKLDNNOpKernel
6 years ago
wopeizl 3ccd8964a4
Merge pull request #15905 from wopeizl/win/fix_eigen
6 years ago
chengduo 8e904d322f
Remove unnecessary dependence for profiler (#15899)
6 years ago
Zhen Wang 9261cf39db update with develop. test=develop
6 years ago
Zhen Wang 0bf809c9b3 add set_attr for IrOpNode. test=develop
6 years ago
qingqing01 d8128930ef
Refine doc of uniform_random and fix dtype (#15873)
6 years ago
Xin Pan 44e7fcddc5
Merge pull request #15844 from panyx0718/infer
6 years ago
dzhwinter a71f2fbe4f fix default value. test=develop
6 years ago
Jacek Czaja dec9cf53c8 [MKL-DNN] MKL-DNN specific Tensor modification (#15429)
6 years ago
heqiaozhi 08c96d1b48 remove mkldnn & fix commit
6 years ago
minqiyang 84bf4d7b06 Move ClearBlock into OpBase and VarBase's destructor
6 years ago
minqiyang 2b3510bc50 Add imperative python tracer
6 years ago
minqiyang e9fdf9090d Polish code
6 years ago
Xin Pan 5dd281f738 polish
6 years ago
heqiaozhi 725b98f2c2 remove mkldnn & fix commit
6 years ago
peizhilin 6ccdb1b947 fix build issue on windows for sample prop op
6 years ago
heqiaozhi fab09ac0b8 Merge branch 'new_develop' of https://github.com/colourful-tree/Paddle into new_develop
6 years ago
heqiaozhi da4f5a2f18 remove mkl & fix commit
6 years ago
colourful-tree f2d6473ef8
Merge branch 'develop' into new_develop
6 years ago
heqiaozhi 04f876f5bc remove mkl & fix commit
6 years ago
dengkaipeng 373cfb0ccf use kernel size in global_pooling. test=develop
6 years ago
dengkaipeng 60305196b8 fix spell mistakes. test=develop
6 years ago
Dun c6bd434ffe
add memset CUPTI && test=develop (#15868)
6 years ago
乔龙飞 Qiao Longfei ec8e878200
Merge pull request #15840 from jacquesqiao/revert-15684-revert-15661-fix-cpu-broadcast
6 years ago
minqiyang a15a3fc314 Polish code
6 years ago
Tao Luo 8a7efc78f1
Merge pull request #15882 from sfraczek/unique_ptr_dereference
6 years ago
tensor-tang a0c37662b9 enable sgd jitkernel refer code and test
6 years ago
xuezhong 1dad36f6aa
Merge pull request #15609 from xuezhong/add_sample_logits_op
6 years ago
Qiao Longfei 2b7931d5c9 refine code test=develop
6 years ago
Kaipeng Deng 9e524a7b51
Merge pull request #15870 from heavengate/fix_adaptive_pool_doc
6 years ago
Xin Pan 8d83e38a6b remove mutex
6 years ago
shippingwang eb932f717a add cosine decay op, test=develop
6 years ago
Xin Pan 0362ef75f4 fix
6 years ago
dengkaipeng 14df92fe8f fix spell error. test=develop
6 years ago
dengkaipeng 144016fcfc fix adaptive_pool and yolov3_loss. test=develop
6 years ago
minqiyang 9dc64edfd9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_engine
6 years ago
Xin Pan 12a0e2ed9d polish codes
6 years ago
Xin Pan 19d78f6797 polish
6 years ago
Sylwester Fraczek 74672d1aff Change *(smart_ptr.get()) -> *smart_ptr
6 years ago
tensor-tang ee2321debd
Revert 15770 develop a6910f900 gelu mkl opt (#15872)
6 years ago
xuezhong 81870723c6
Merge pull request #15605 from xuezhong/fix_bug_for_lstmp
6 years ago
dengkaipeng eb65b4e47d \frac -> \frac. test=develop
6 years ago
chengduo 3b08c9abf4
enhance profiler (#15842)
6 years ago
dengkaipeng 8167588f14 add blank after math::. test=develop
6 years ago
Xin Pan 32d5a16036 resolve conflicts
6 years ago
Qiao Longfei 3f9263f67e optimize style test=develop
6 years ago
dengkaipeng d9ec605873 use math:: instead of 29. test=develop
6 years ago
Qiao Longfei 4233d0a820 add more comment test=develop
6 years ago
dengkaipeng 19292ac6a1 fix adaptive pool doc.test=develop
6 years ago
Yiqun Liu 7d96c74ab2
Initialize the benchmark tester for operator. (#15772)
6 years ago
Yihua Xu 676995c86c Optimze Gelu with MKL Erf function (#15770)
6 years ago
Michał Gallus c4faf36e7a MKL-DNN: Add test for conv bias fuse pass (#15824)
6 years ago
Qiao Longfei 3bccc1e6e2 optimize broadcast logic test=develop
6 years ago
mozga-intel 5d132ecf83 Auto-cmake generator, auto-fill map (#15402)
6 years ago
Tao Luo 3831a4695d
Merge pull request #15862 from sfraczek/add-override-to-apply_impl
6 years ago
Tao Luo 075b5502b0
Merge pull request #15856 from kbinias/kbinias/activation-mkldnn-ut
6 years ago
Tao Luo 4774dad806
Merge pull request #15857 from sfraczek/fix-typo
6 years ago
Tao Luo e3dd6970fc disable dam temporarily (#15860)
6 years ago
Xin Pan 26e32e095a allow compiler to use graph
6 years ago
Sylwester Fraczek 1943119fc5 fix typo memeroy->memory
6 years ago
Sylwester Fraczek 0b926114c0 add override to ApplyImpl
6 years ago
minqiyang 8fe0c0c52c implement backward refs
6 years ago
Sylwester Fraczek 8bc604571f fix typo seriazlized->serialized
6 years ago
Sylwester Fraczek 543e53db05 fix typo releated->related
6 years ago
Krzysztof Binias 1578c60bdd Add new ut and remove unnecessary code
6 years ago
Dun Liang 35a90e06bf test=develop
6 years ago
Dun Liang c9080f516b test=develop
6 years ago
Dun Liang 1c7bb0e40c test=develop
6 years ago
Xin Pan 5eb87506bc add per kernel config and remove const_cast.
6 years ago
Qiao Longfei 62f1248ff5 fix use gpu test=develop
6 years ago
xuezhong a7e7d95203
Merge pull request #15630 from xuezhong/fix_bug_adagrad2
6 years ago
Xin Pan 6019054cdd
Merge pull request #15716 from Yancey1989/refine_pg
6 years ago
dzhwinter 98ec579d79
Merge pull request #15786 from liuwei1031/develop
6 years ago
Dun a83e470405
Profiler refine and add CUDA runtime api tracer (#15301)
6 years ago
minqiyang 74551758cc Polish code
6 years ago
Zeng Jinle c420375185
Merge pull request #15806 from sneaxiy/fix-compiler
6 years ago
minqiyang f53e1d5c4b implement ClearBlock
6 years ago
mozga-intel 13ec2d331b Enable momentum operator for a ngraph engine (#15673)
6 years ago
Yiqun Liu e38dd91f04
Refine cmake's download function. (#15512)
6 years ago
xuezhong eb7bc3e7ea remove non-ascii charactor
6 years ago
Tao Luo 50ffed27f6
Merge pull request #15813 from luotao1/legacy_any
6 years ago
Tao Luo b28afef73e
Merge pull request #15801 from luotao1/legacy_option
6 years ago
sneaxiy d331e97af8 fix compiler place compare
6 years ago
Tao Luo c797a1f050 remove legacy any.cmake
6 years ago
dzhwinter f2e8409f5a
Merge pull request #15795 from dzhwinter/fix/block_desc
6 years ago
Tao Luo bd2fa73620
Merge pull request #15794 from sneaxiy/fix-warnings
6 years ago
Tao Luo b9d1bf2364 remove leacy WITH_FLUID_ONLY option
6 years ago
xuezhong 6b83845c41 update for backward compatibility
6 years ago
tensor-tang e1c707fe9c
fix warnings (#15790)
6 years ago
xuezhong d328660304 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_for_lstmp
6 years ago
xuezhong 57294fa890 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_bug_adagrad
6 years ago
xuezhong 794b90c93f for backward compatibility
6 years ago
xuezhong f2262d7336 update comment
6 years ago
Tao Luo 6402424f7a
Merge pull request #15773 from chengduoZH/fix_shape_api_doc
6 years ago
xuezhong ef44f1b81d update api spec
6 years ago
xuezhong d12252e6a6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sample_logits_op
6 years ago
sneaxiy 9b8e0e2f17 fix enforce_test
6 years ago
xuezhong c5360a3f6b refine code
6 years ago
Yancey1989 4b193db14c polish code test=develop
6 years ago
dzhwinter 6deb17ed8c fix default value. test=develop
6 years ago
dzhwinter 089d262c41 fix default value. test=develop
6 years ago
dzhwinter 9c92d0304f fix default value. test=develop
6 years ago
sneaxiy 209b355762 fix many warning
6 years ago
Yancey1989 d5090c892d polish code test=develop
6 years ago
tensor-tang 5aea2cd2e0
Merge pull request #15652 from tensor-tang/refine/pyramiddnn
6 years ago
liuwei1031 4c7b6e2e67 fix comment, test=develop
6 years ago
Zeng Jinle adea672b0e
Merge pull request #15783 from sneaxiy/develop-fix-api-arg0
6 years ago
mozga-intel df23a6f894 Enable cross_entropy operator for a ngraph engine (#15674)
6 years ago
liuwei1031 b20a21e299 fix comments of PR 15529, test=develop
6 years ago
Yiqun Liu 56a5039e24
Correct the doc in Python API (#15725)
6 years ago
sneaxiy e6ff549849 small fix doc
6 years ago
sneaxiy 796e221efc fix api arg0
6 years ago
baojun 72061b0ac0 Add ngraph op coverage (#15721)
6 years ago
chengduozh d79d2f686c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_shape_api_doc
6 years ago
dzhwinter 28609b3435
Merge pull request #15696 from dzhwinter/cherry-pick/memory
6 years ago
xuezhong 4424021623 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_sample_logits_op
6 years ago
Yihua Xu 685a20ef56 Add JIT CRF_decoding and Layer_norm unit-test (#15699)
6 years ago
tensor-tang 75fc792d40 fix when table width larger than 64
6 years ago
tensor-tang 40402d5e68 add emb seqpool jitcode
6 years ago
tensor-tang 2ccbcb157d Merge remote-tracking branch 'ups/develop' into refine/pyramiddnn
6 years ago
chengduozh 3ce12b1b8e fix shape api doc
6 years ago
minqiyang 52e5ee60bd Add debug info
6 years ago
Dun 5e6834d891
inplace group_norm (#15754)
6 years ago
Yancey1989 0f8bd73cc9 cleanup code test=develop
6 years ago
Yancey1989 5677c9d4ee update comment test=develop
6 years ago
Yancey1989 642fd68ce0 update by comment test=develop
6 years ago
dzhwinter d94a314db5 add reference. test=develop
6 years ago
dzhwinter 591ad33e32 polish code for reading. test=develop
6 years ago
dzhwinter 18afb77e78 polish code for reading. test=develop
6 years ago
Yan Chunwei 077d12b939
fix scale cleaner (#15742)
6 years ago
dzhwinter 684b572307 polish code for reading. test=develop
6 years ago
Tao Luo daab5ad77e
Merge pull request #15719 from wojtuss/wojtuss/remove-duplicated-code
6 years ago
Hongyu Liu 8c0292dead
Merge pull request #15717 from phlrain/fix_leak
6 years ago
Zeng Jinle fc87ef741b
Merge pull request #15687 from sneaxiy/fix_enforce
6 years ago
dzhwinter 3787e61fca polish code for reading. test=develop
6 years ago
dzhwinter c1455e606d Merge remote-tracking branch 'origin/develop' into cherry-pick/memory
6 years ago
dzhwinter d376cf71b7 polish code for reading. test=develop
6 years ago
Xin Pan 69f5c0eee2
Merge pull request #15557 from panyx0718/imperative
6 years ago
Zhen Wang 832bd720d1
Merge pull request #15610 from wzzju/quantization_inference_passes
6 years ago
Tao Luo 4da291c6a3
Merge pull request #15726 from qingqing01/fix_api_doc
6 years ago
Dun e4b9fcdbd2
More restrict check load_combine_op. (#15479)
6 years ago
qingqing01 48a5cccbcd
Fix debug mode in prior_box_op (#15702)
6 years ago
Dang Qingqing 2868232556 Fix row_conv doc
6 years ago
Xin Pan 989138378d add sugar for fetching parameters
6 years ago
tensor-tang a3a3d3d861 add embseqpool jitkernel mkl impl and use it
6 years ago
tensor-tang 15da2f9a0d add embseqpool jitkernel refer code, test and benchmark
6 years ago
tensor-tang c2ccf14590 Merge remote-tracking branch 'ups/develop' into refine/pyramiddnn
6 years ago
qingqing01 abcefe7211
Fix debug mode in fake_quantize_op (#15693)
6 years ago
Yancey1989 7cd6de37f5 fix cpu test=develop
6 years ago
Yancey1989 bd0d44af24 fix build failed test=develop
6 years ago
liuhongyu 029be5fda9 fix lstmp bug; test=develop
6 years ago
Yancey1989 ecdd1166b8 cleanup code test=develop
6 years ago
Yancey1989 73005ee00d cleanup code test=develop
6 years ago
liuhongyu 393fa6021e set lstm lstmp unsed pointer to nullptr; test=develop
6 years ago
Yancey1989 88d3dc949e Merge branch 'develop' of github.com:PaddlePaddle/Paddle into refine_pg
6 years ago
liuhongyu 869f00ffc6 set lstm lstmp unsed pointer to null
6 years ago
Yancey1989 f3463ecb6e refine pg execution
6 years ago