Commit Graph

11905 Commits (a8d3aaae2a648ee552d60869fc5117e61d4ce1b0)

Author SHA1 Message Date
Xin Pan 5839e3236b add program check
7 years ago
gmcather ba22624d7e position encoding && log loss
7 years ago
Tao Luo 3a96d41d72 remove with_inference option
7 years ago
sneaxiy 2494ca83ab test=develop
7 years ago
dzhwinter ebfe5a02b3 merge develop branch
7 years ago
JiabinYang 7c45e77c41 test=develop
7 years ago
barrierye b5f78ce42d update paddle/fluid/API.spec
7 years ago
qingqing01 cb27a9219d
Merge pull request #13971 from sefira/FasterOpDoc
7 years ago
sneaxiy 5e5d2223a1 test=develop
7 years ago
tensor-tang 3c957af139
Merge pull request #14080 from tensor-tang/refine/jit/crf2
7 years ago
Xin Pan aa87a989ec
Merge pull request #14119 from Superjomn/fix/api-impl-tester
7 years ago
barrierye 5f3acac9b3 update paddle/fluid/API.spec
7 years ago
Xin Pan 9ef19d4919
Merge pull request #14106 from luotao1/fix_cmake_warning
7 years ago
sneaxiy f2eed667c0 test=develop
7 years ago
Xin Pan 16dfedb8b8
Merge pull request #14103 from jacquesqiao/cpu-for-1.1-merge-with-shape
7 years ago
sneaxiy cef8cc81db merge develop
7 years ago
Jacek Czaja 458b16f42a Rebase of seqpool-max optimization
7 years ago
superjomn 5f7fda0b07 disable some tests
7 years ago
dengkaipeng ff6329bd5f fix some inappropriate expressions in api doc for grid_sampler. test=develop
7 years ago
Tao Luo d3534d2b14 refine warning message
7 years ago
Xin Pan 177720a737
Merge pull request #14116 from chengduoZH/release/1.1.0
7 years ago
chengduozh acec4cb8ca [1.1]fix op_role value
7 years ago
barrierye 73671379cd update paddle/fluid/API.spec
7 years ago
dengkaipeng 8f1e398824 move param exclusive to the last in pool2d/pool3d for forward compatibility:. test=develop
7 years ago
dengkaipeng 593e1b18d7 fix some bugs and add some doc for GridSampleOp
7 years ago
dengkaipeng 0bb0e0c10f add Grid Sampler Operator for STN.
7 years ago
Qiao Longfei 3d4e050802 fix compile, optimize code test=develop
7 years ago
Yu Yang c01696f8c2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
Qiao Longfei d26ff8cb2d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpu-for-1.1-merge-with-shape
7 years ago
JiabinYang e0a89503f8 test=develop
7 years ago
JiabinYang 0e3038680b test=develop
7 years ago
Tao Luo 79da263b11
Merge pull request #14032 from sfraczek/sfraczek/fix-test-multithreading-mkldnn
7 years ago
Wu Yi 26200f2e42
[1.1] [project] train imagenet using large batch size (#13766)
7 years ago
sneaxiy 2414f92f54 test=develop
7 years ago
barrierye 8c1e304307 merge nn.py
7 years ago
sneaxiy 45559d042c move to pass
7 years ago
dengkaipeng c93e044ae0 add inclusive/exclusive mode in PoolOp avg pool type
7 years ago
JiabinYang 9a74c4489f test=develop
7 years ago
barrierye 9dc28179a4 add similarity_focus op
7 years ago
Qiao Longfei 7cd2417fe2 Merge branch 'develop' into cpu-for-1.1-merge-with-shape
7 years ago
Xin Pan 0a80f06ec4
Merge pull request #14086 from panyx0718/fix6
7 years ago
sneaxiy a314a80cdb merge develop
7 years ago
Tao Luo 4928ff32a9 fix cmake warning when ON_INFER=false
7 years ago
dzhwinter c8adc2c6fe cudnn version. staged.
7 years ago
Qiao Longfei 06ffbc4f28 Merge branch 'shape_int_to_int64' of https://github.com/seiriosPlus/Paddle into cpu-for-1.1-merge-with-shape
7 years ago
seiriosPlus 06de824ba8 fix shape in floats
7 years ago
Yan Chunwei ee74be3a49
[1.1] Bugfix/tensorarray (#14044)
7 years ago
Qiyang Min 33b4920d2d
Merge pull request #14057 from velconia/continue_hash_op
7 years ago
Qiyang Min 209f24a241
Merge pull request #14051 from velconia/accelerate_embedding_grad
7 years ago
minqiyang 2fec8c5d9a Polish code
7 years ago
minqiyang 7f7af5d412 Add xxhash deps to inference demo and trainer demo
7 years ago
Qiao Longfei 7cfc3c4415 Merge branch 'optimize-sum-seq-pooling-op' of ssh://github.com/jacquesqiao/Paddle into cpu-for-1.1-merge
7 years ago
Qiao Longfei 72aef6b168 sum selected rows check empty
7 years ago
minqiyang fe18adfbaa Add fluid inference support
7 years ago
seiriosPlus c34610f86d Fix lookup table at CPU Reduce strategy, test=develop
7 years ago
Qiao Longfei 641369f92b Merge branch 'dist-table-do-not-init-on-trainer' of ssh://github.com/jacquesqiao/Paddle into cpu-for-1.1-merge
7 years ago
Qiao Longfei d69c820707 Merge branch 'add-flag-to-control-rpc-thread-num' of ssh://github.com/jacquesqiao/Paddle into cpu-for-1.1-merge
7 years ago
Qiao Longfei 1ed9ef6d70 Merge branch 'shape_int_to_int64' of https://github.com/seiriosPlus/Paddle into cpu-for-1.1-merge
7 years ago
Qiao Longfei f1a3fb041b Merge branch 'fix_lookuptable_in_reduce' of https://github.com/seiriosPlus/Paddle into cpu-for-1.1-merge
7 years ago
Qiao Longfei da61a5b672 Merge branch 'optimizer-prefetch' of https://github.com/seiriosPlus/Paddle into cpu-for-1.1-merge
7 years ago
tangwei12 5ce3a32e06
Merge branch 'develop' into optimizer-prefetch
7 years ago
seiriosPlus b6590b05fb submit by tangwei12, test=develop
7 years ago
Wu Yi 9da9b1926b
[1.1] fix graph num hang (#14072)
7 years ago
tangwei12 cb1ccc710b fix shape type in uniform_random_op.cu
7 years ago
Qiao Longfei 575f22711d optimize code
7 years ago
Qiao Longfei 96d5500934 optimize code
7 years ago
Qiao Longfei 748ee35c89 sum op handle empty input update selected_rows_functor.cu
7 years ago
Qiao Longfei dd78b5df93 sum op handle empty input
7 years ago
Qiao Longfei cbe128bbae Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-sum-seq-pooling-op
7 years ago
Qiao Longfei f4df0cb1a2 update the type of shape to int64, format code
7 years ago
Qiao Longfei fad42fe7cc broadcast handle not inited parameter
7 years ago
Qiao Longfei 7dcb0dc8c6 update year
7 years ago
Qiao Longfei 68aeb4e7e9 add fake init test in test_dist_transpiler
7 years ago
Tao Luo 5ed3e6f3f6
Merge pull request #14042 from luotao1/remove_unused_code
7 years ago
Qiao Longfei a13c788a04 fix a bug
7 years ago
Zeng Jinle 97d47a7d08
Merge pull request #13913 from sneaxiy/seq_reverse
7 years ago
JiabinYang 6e3615422f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_reorg_op
7 years ago
Jiabin Yang a3efba176c
Merge pull request #14085 from jerrywgz/fix_generate_proposals_op
7 years ago
dzhwinter 7141debe38 add cudnn back. staged.
7 years ago
Guo Sheng b9ae1c49f8
Merge pull request #13994 from guoshengCS/add-reshape-reuse-input
7 years ago
Zeng Jinle 60058180cb
Merge pull request #13945 from sneaxiy/unify_mixed_vector_api
7 years ago
Qiao Longfei 0328ffd3ab add fake init op
7 years ago
Xin Pan bcc9126e7b
Merge pull request #14056 from panyx0718/fix
7 years ago
Sylwester Fraczek 2098b42584 review fixes (Teamcity fails)
7 years ago
Tao Luo 961baea16c
Merge pull request #14063 from wojtuss/wojtuss/remove-unused-EnableMKLDNN
7 years ago
Hongyu Liu 379d933ae5
Merge pull request #14036 from phlrain/add_dropout_att_new
7 years ago
tangwei12 d8b697357f update height_sections to int64_t
7 years ago
minqiyang a2820b9899 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_embedding_grad
7 years ago
Xin Pan bba0c4a9f2 delete unused codes.
7 years ago
jerrywgz de2f965c9b test=develop
7 years ago
guosheng cc0e23973d Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-reshape-reuse-input
7 years ago
tangwei12 318ba99124 revert changes in protobuf.cc and type_defs
7 years ago
tangwei12 aa6dc82f4b revert changes in protobuf.cc and type_defs
7 years ago
dzhwinter 09409bad4d staged. test speed=49ms in 1080.
7 years ago
tensor-tang 64d5b4385e fix crf decode avx512
7 years ago
tensor-tang 21487d78bf add crf decode jit kernel
7 years ago
sneaxiy b1fd62f39e test=develop
7 years ago
guosheng 3cfaeac288 Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-reshape-reuse-input
7 years ago
sneaxiy 1af3fe8c35 test=develop
7 years ago
Xin Pan d5d09672c8 better fix
7 years ago
Qiao Longfei de539d72da format
7 years ago
sneaxiy 5be6f762d0 remove_lock_in_some_ops
7 years ago
buxingyuan 6c1d74bb47 Merge branch 'develop' into FasterOpDoc
7 years ago
Xin Pan 726fd438cd avoid blocking everyone
7 years ago
JiabinYang 7bcba47e41 test=develop
7 years ago
barrierye a7f94ec794 add similarity_focus op
7 years ago
Tao Luo 8ab953e37c auto insert infer_graph_clean_pass as the default first one
7 years ago
Tao Luo d70c7fb9b3 Merge branch 'develop' into remove_unused_code
7 years ago
Tao Luo ea2bdd192d Merge branch 'develop' into remove_unused_code
7 years ago
minqiyang 0de6811ee0 Change reserve to resize
7 years ago
tangwei12 b58957d9d7 Revert "fix lookuptable in reduce strategy"
7 years ago
JiabinYang 9cad409f2a test=develop
7 years ago
tangwei12 2761eafb92 shape type to int64_t, test=develop
7 years ago
tangwei12 d4a8967c1e add const in &, test=develop
7 years ago
minqiyang 5660d6a3ba Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_embedding_grad
7 years ago
guosheng 1f92c30565 Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-reshape-reuse-input
7 years ago
tensor-tang a05fce6544 Merge remote-tracking branch 'ups/develop' into fix/jit/avx
7 years ago
JiabinYang bd064c0f44 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_reorg_op
7 years ago
tangwei12 0e25e397bd shape type to int64_t, test=develop
7 years ago
Qiyang Min d0fdcb2f6d
Merge pull request #14048 from velconia/change_sequence_pool_to_cpu
7 years ago
tangwei12 d1e85e33d7 shape type to int64_t, test=develop
7 years ago
Yu Yang 8310ce6007 Fix cluster memory
7 years ago
tensor-tang d24d282a7a fix avx error
7 years ago
tensor-tang 9cb8738f54
Merge pull request #14018 from tensor-tang/refine/jit/gru
7 years ago
Xin Pan 70effddfc1 fix
7 years ago
Xin Pan 64e7688ade clean more APIs
7 years ago
Xin Pan c891bc22f5 clarify Reset
7 years ago
Qiao Longfei 6253b152e6 Merge branch 'optimize-sum-seq-pooling-op' of https://github.com/jacquesqiao/Paddle into optimize-sum-seq-pooling-op
7 years ago
Qiao Longfei 14f5a40898 fix unit test
7 years ago
minqiyang 447a680a2b Add API.spec
7 years ago
minqiyang 5de4619781 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_embedding_grad
7 years ago
minqiyang 0695c1fbe8 Add remind for code
7 years ago
minqiyang 0c5c4c4a5b Add blas header file
7 years ago
guosheng aac426444f Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-reshape-reuse-input
7 years ago
buxingyuan d0ccdf8fc1 follow comments
7 years ago
minqiyang e2a348cd10 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into change_sequence_pool_to_cpu
7 years ago
Qiao Longfei f4e6fe0786 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-sum-seq-pooling-op
7 years ago
Xin Pan 4f59690b4c clean unused codes
7 years ago
Xin Pan 784a19ecd0 fix some thread-safty issue and simplify threadpool
7 years ago
Wojciech Uss be58997443 remove unused method from naive executor
7 years ago
minqiyang 40141f749b Implement the unittest for hash op
7 years ago
Sylwester Fraczek 741cb33bd9 test multithreading
7 years ago
Brian Liu a53e8a8da6 Update MKLDNN integration framework to support Paddle multi-instances
7 years ago
minqiyang 8a0f26f45f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into continue_hash_op
7 years ago
minqiyang d4f9aa0852 Add hash op implementation
7 years ago
dzhwinter 468467f391 update real incnet tester
7 years ago
tangwei12 39b3bf24d0 shape type to int64_t, test=develop
7 years ago
tangwei12 755927d2b0 shape type to int64_t, test=develop
7 years ago
Qiao Longfei 7357d8412e add flags for control the thead num for pserver
7 years ago
phlrain a4ad286e6b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_dropout_att_new
7 years ago
phlrain 469bdb9e55 modify api.spec; test=develop
7 years ago
minqiyang 1a3b38a432 Polish code
7 years ago
dzhwinter b154e0b492 clean demo_ci
7 years ago
minqiyang 133bac2b10 Accelerate embedding op grad
7 years ago
Zhaolong Xing 2256fae45d
Merge pull request #13938 from NHZlX/ocr_attention_support
7 years ago
dzhwinter abe8e207c4 clean demo_ci
7 years ago
dzhwinter 597d92179b clean demo_ci
7 years ago
phlrain 201d4f2a85 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_dropout_att_new
7 years ago
phlrain a6e6bc45d6 modify dropout att; test=develop
7 years ago
minqiyang 2468057da6 Move code to SumSeqPoolGradFunctor
7 years ago
minqiyang 9725db0d40 Fix copy wrong pos bug
7 years ago
minqiyang 9c68709036 Accelerate sequence_pool functor
7 years ago
minqiyang 14ebc424d6 Add gpu support for unittest
7 years ago
jerrywgz e906c8e5e7
Merge pull request #14022 from jerrywgz/fix_rpn_target_assign_op
7 years ago
minqiyang bd5a82e193 Polish unit test code
7 years ago
minqiyang 047fa2f9aa Add unit-test for sequence_pooling functor
7 years ago
qingqing01 c7379a7320 Fix top_k op (#14034)
7 years ago
sneaxiy 016bf51e3f test=develop
7 years ago
Tao Luo f7bbcfa913 remove unused code in paddle_inference_api.h
7 years ago
JiabinYang c056328563 test=develop
7 years ago
nhzlx 11f189bacf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_demo_ci_trt
7 years ago
tangwei12 8b7f45a889 add longs in framework
7 years ago
JiabinYang c13f1ef3c4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_reorg_op
7 years ago
tangwei12 f3729db6e0 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into Pdv
7 years ago
Xin Pan 8837669782
Merge pull request #13982 from panyx0718/fix
7 years ago
dzhwinter dbd0075b68 Merge branch 'windows/support' into lb
7 years ago
dzhwinter c6dcffc61a lb. add debug output
7 years ago
wanghaoshuang 78cf76a1ca fix linux compile
7 years ago
tangwei12 770e2a1881 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into Pdv
7 years ago
chengduo e943f4508b
add graph number check (#14025)
7 years ago
sneaxiy 92a2817a2b test=develop
7 years ago
JiabinYang 8e8e8e66ab Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_reorg_op
7 years ago
nhzlx ae8f26072d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_demo_ci_trt
7 years ago
phlrain 049c9c7d2a Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_dropout_att_new
7 years ago
phlrain ffb24a73ec add dropout attr; test=develop
7 years ago
sneaxiy 8f07f60915 test=develop
7 years ago
wanghaoshuang 5993155d67 Merge remote-tracking branch 'dzhwinter/windows/support' into windows/support
7 years ago
wanghaoshuang f9e7cfb03c save binary file
7 years ago
tensor-tang 032c3a07e3 Merge remote-tracking branch 'ups/develop' into refine/jit/gru
7 years ago
tensor-tang 159be8cc63 optimize fusion gru kernel at size 8
7 years ago
dzhwinter 607080e888 windows static library
7 years ago
Tao Luo 23da8defc8
Merge pull request #14028 from luotao1/fix_resnet50_test
7 years ago
Yu Yang 71c846ef8a Revert buggy changes
7 years ago
JiabinYang ff07dc315e test=develop
7 years ago
chengduo a7497653d0
Refine Split op (#13967)
7 years ago
Yu Yang dbf9f6f408 Fix distribute compile
7 years ago
guosheng 3099a8f3aa Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-reshape-reuse-input
7 years ago
jerrywgz e0708e62ba refine code
7 years ago
jerrywgz 1c591c3909
Merge branch 'develop' into fix_rpn_target_assign_op
7 years ago
sneaxiy a9d7a9d720 test=develop
7 years ago
Tao Luo 316bc9bfc9 fix typo and warning in analyzer_resnet50_test
7 years ago
guosheng 6447b69aec Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-reshape-reuse-input
7 years ago
jerrywgz f06c6193d7 fix rpn target assign test=develop
7 years ago
Yu Yang 1d4d4e73ab Remove place hash
7 years ago
dongzhihong 563e7bca7f "fix op. test=develop"
7 years ago
Xin Pan 4625f83f92 better handle var type inference
7 years ago
Xin Pan 8f2116d8fa clean up after the changes have been stopped for so long.
7 years ago
tensor-tang 83dc689877 Merge remote-tracking branch 'ups/develop' into refine/jit/gru
7 years ago
tensor-tang 640e789d3d add fusion gru jit kernel
7 years ago
JiabinYang 39d39775c3 test=develop
7 years ago
JiabinYang 70351de1b5 test=develop
7 years ago
Yu Yang 461f71a90b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
qingqing01 0e24138494
Merge pull request #13991 from qingqing01/refine_generate_proposals_op
7 years ago
gongweibao 58c027cc38
Add rpc profiler flags. (#13989)
7 years ago
Xin Pan d10e54c460
Merge pull request #14003 from chengduoZH/fix_fast_parallel_exe_bug
7 years ago
Tao Luo 42aa1d409d
Merge pull request #13485 from tpatejko/tpatejko/capi-resnet-conv-elementwise-fusion
7 years ago
Yu Yang 9dcddf92f2 Polish best_fit_allocator
7 years ago
tensor-tang 664159ad42
Merge pull request #13998 from tensor-tang/fea/fusion_seqconv_add
7 years ago
Yu Yang 0c25da39a0 Refine auto_increment_allocator
7 years ago
Yu Yang ab87a88200 Polish retry allocator
7 years ago
guosheng 6d3b030bb5 Refine the api of reshape to be compatible.
7 years ago
chengduozh 82d2903b63 Fix fast ParallelExe bug
7 years ago
Tomasz Patejko aa35aaa1ab MKLDNN conv + elementwise_add fusion: fixing formatting
7 years ago
jerrywgz 765085d297
Merge pull request #13904 from jerrywgz/roialign
7 years ago
Dang Qingqing 56936b9e25 Refine doc for generate_proposals_op.
7 years ago
Tomasz Patejko ce2464fd98 MKLDNN conv + elementwise_add fusion: UT for missing bias added. UTs refactored. Some minor changes in the pass
7 years ago
Tomasz Patejko 4e72ab411e MKLDNN conv + elementwise_add fusion: fix for crash when bias is not present
7 years ago
Tomasz Patejko 415b261555 MKLDNN conv + elementwise_add fusion: fusion options added
7 years ago
Tomasz Patejko 1676094697 MKLDNN conv + elementwise_add fusion: turn on residual connection pass when CAPI is used.
7 years ago
Tomasz Patejko 0fe3079c46 MKLDNN conv + elementwise_add fusion: fix for order of parameters in elementwise_add in resnet50
7 years ago
Tomasz Patejko b73b868366 MKLDNN conv + elementwise_add fusion: bias in tests made persistent.
7 years ago
Tomasz Patejko a1fa203287 MKLDNN conv + elementwise_add fusion: name of the pass reused with name_scope_
7 years ago
Tomasz Patejko 2c43419db1 MKLDNN conv + elementwise_add fusion: comment explaining CorrectGraphEdges added
7 years ago
Tomasz Patejko 8fb29b2ca9 MKLDNN conv + elementwise_add fusion: new nodes marked as input or output
7 years ago
Tomasz Patejko cc1c8e37c1 MKLDNN conv + elementwise_add fusion: attributes in new conv op copied from old op
7 years ago
Tomasz Patejko a27a8c5da8 MKLDNN conv + elementwise_add fusion: bias in test marked as persistable
7 years ago
Tomasz Patejko af8c71317c MKLDNN conv + elementwise_add fusion: CorrectGraphEdges refactored
7 years ago
Tomasz Patejko 3e033087f1 MKLDNN conv + elementwise_add fusion: LinkNodes function removed and
7 years ago
Tomasz Patejko 4be45af1cc MKLDNN conv + elementwise_add fusion: skip connection attribute renamed. Comments about patterns added.
7 years ago
Tomasz Patejko 9a335e0277 MKLDNN conv + elementwise_add fusion: changed a name of a formal argument in ElementwiseAdd pattern
7 years ago
Tomasz Patejko fb7a50b230 MKLDNN conv + elementwise_add fusion: removed commented code. Internal functions marked as static.
7 years ago
Michal Gallus f688197182 MKLDNN conv + elementwise_add fusion: Fix output_data to point to the right tensor, also fix transpiler integration
7 years ago
Tomasz Patejko efd76614fb MKLDNN conv + elementwise_add fusion: implementation changed to conform with Paddle API
7 years ago
Tomasz Patejko 347bf90412 MKLDNN conv + elementwise_add fusion: bias is also handled
7 years ago
Tomasz Patejko bf95ac36a7 MKLDNN conv + elementwise_add fusion: further reformatting
7 years ago
Tomasz Patejko cbe122ae2e MKLDNN conv + elementwise_add fusion: correcting formatting
7 years ago
Tomasz Patejko 2a251bbf27 MKLDNN conv + elementwise_add fusion: some refactoring: consts, function calls instead of constant values
7 years ago
Tomasz Patejko b8e54ab5cc MKLDNN conv + elementwise_add fusion: parameter name changed to ResidualData
7 years ago
Tomasz Patejko 27573ece03 MKLDNN conv + elementwise_add fusion: trailing spaces removed
7 years ago
Tomasz Patejko 7f5c8a95e8 MKLDNN conv + elementwise_add fusion: arguments are replaced for many parameters in operator
7 years ago
Tomasz Patejko 5996bd39e8 MKLDNN conv + elementwise_add fusion: graph is corrected based on actual argument name, not formal argument name
7 years ago
Tomasz Patejko 41f3d78fdf MKLDNN conv + elementwise_add fusion: output and elemwise param share data in conv primitive. Output is properly allocated
7 years ago
Tomasz Patejko 07a62ddc08 MKLDNN conv + elementwise_add fusion: inputs in pass modified. Support for new conv parameter. UTs corrected
7 years ago
Tomasz Patejko 56528531ea MKLDNN conv + elementwis_add fusion: initial work on passing eltwise data to conv primitive
7 years ago
Tomasz Patejko 42f569fdfd MKLDNN conv + elementwise_add fusion: use_mkldnn attribute added
7 years ago
Tomasz Patejko 441d3a4726 MKLDNN conv + elementwise_add: added some refactoring in the pass
7 years ago
Tomasz Patejko 38b7b34b1c MKLDNN conv + elementwise_add fusion: added reachability tests, inputs and outputs in graph nodes are transformed
7 years ago
Tomasz Patejko 16eaaf3fbe MKLDNN conv + elementwise_add fusion: added one more UT, found and corrected bugs in pass
7 years ago
Tomasz Patejko 604bad08bc MKLDNN conv + elementwise_add fusion: implementation of patterns refarctored, applied to graph. UTs added
7 years ago
Tomasz Patejko 9ce343f868 MKLDNN conv + elementwise_add fusion: initial implementation of patterns
7 years ago
tensor-tang 40f8456a4f refine fuse pattern and attr
7 years ago
tensor-tang cbbacb2534 Merge remote-tracking branch 'ups/develop' into fea/fusion_seqconv_add
7 years ago
tensor-tang 603ba5e01d add seqconv eltadd relu pass
7 years ago
Dang Qingqing 4801ee8f97 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_generate_proposals_op
7 years ago
Tao Luo da722d6d9b
Merge pull request #13858 from Sand3r-/mgallus/conv-bias-pass
7 years ago
Tao Luo a4b48f70c1
Merge pull request #13997 from wojtuss/wojtuss/do-not-enable-mkldnn-twice
7 years ago
Tao Luo 252401c5d3
Merge pull request #13992 from wojtuss/wojtuss/add-ifdef-mkldnn
7 years ago
Michał Gallus f9ca31811d
Remove use mkldnn from config in resnet50 test
7 years ago
tensor-tang 23fc896bc2 Merge remote-tracking branch 'ups/develop' into fea/fusion_seqconv_add
7 years ago
tensor-tang 339e655aec refine and add seqconv elementwiseadd relu op test
7 years ago
sneaxiy 2002e71da8 fix pinned allocator
7 years ago
Michal Gallus c504a5a1b7 Adjust Conv+bias to placement pass
7 years ago
Michal Gallus d7509d63f1 Conv+Bias: Support non-null bias
7 years ago
Michal Gallus 91e8fbac2f Enable MKLDNN in Resnet50Tester
7 years ago
Michal Gallus 582f59c190 Conv+Bias fuse
7 years ago
jerrywgz a1d3db031b
Merge pull request #13844 from jerrywgz/fix_roi_pool
7 years ago
guosheng dfb841ad5a Make reshape_op reuse input.
7 years ago
Dang Qingqing 8e0b9496de Fix unit test
7 years ago
Wojciech Uss e6f480ec44 add comment on the default first pass
7 years ago
Wojciech Uss 2cf258e381 remove redundant pass list
7 years ago
Wojciech Uss 5632019f0f add MKL-DNN placement pass
7 years ago
tensor-tang 0a9f5f1790
Merge pull request #13968 from tensor-tang/fix/jit/exp
7 years ago
Wojciech Uss 5083ec3a1b do not enable MKL-DNN twice
7 years ago
Yipeng fcb2e8103e Ocr end2end dev (#13889)
7 years ago
tensor-tang e5ce965952 refine and add eltadd_relu unit test
7 years ago
sneaxiy 5a38930660 test=develop
7 years ago
Wojciech Uss c3b70aece9 Add MKL-DNN placement pass (#13958)
7 years ago
Wojciech Uss 4a368a4901 add ifdef guard for MKL-DNN placement pass
7 years ago
Xin Pan 909e1341bd
Merge pull request #13966 from panyx0718/fix4
7 years ago
chengduo 9775e50ca2
Fix add doc for bias_attr (#13937)
7 years ago
Tao Luo 7b11162ab5
Merge pull request #13949 from PaddlePaddle/wojtuss/unique-patterns-request-comment
7 years ago
tensor-tang 7cb19a5976 fuse elementwise_add and relu
7 years ago
tensor-tang 3c249283af init seqconv eltadd relu op
7 years ago
tangwei12 48982e9dc7 fix lookuptable in reduce strategy
7 years ago
Xin Pan 9a819265eb fix
7 years ago
sneaxiy ac2eba4457 test=develop
7 years ago
Tao Luo 305034f5b3
Merge pull request #13909 from luotao1/mkldnn_test
7 years ago
superjomn b77e4f4978 update
7 years ago
jerrywgz 553342624e test=develop
7 years ago
jerrywgz 9a14ca91b8 test=develop
7 years ago
tensor-tang 60ff05e312 Merge branch 'luotao1-fix_rnn2_test' into fix/jit/exp
7 years ago
Tao Luo ef09862450 fix analyzer_rnn2_test
7 years ago
tangwei12 0e722c5ea2 fix lookuptable in reduce strategy
7 years ago
Tao Luo e5b4643ad8 add profile_mkldnn test
7 years ago
Tao Luo 7d680be5a3 Merge branch 'develop' into mkldnn_test
7 years ago
buxingyuan 0bb3b099c2 generate_proposal_labels doc
7 years ago
Wojciech Uss 55fd136ab0 Added comment with request for enhancement
7 years ago
gongweibao a831ecc75d
Add grpc error context. (#13957)
7 years ago
tensor-tang b139b687de Merge remote-tracking branch 'ups/develop' into fix/jit/exp
7 years ago
qingqing01 67a2b5215d
Add affine channel op to speed and save memory for faster-rcnn model. (#13919)
7 years ago
tensor-tang 748435586a clean code exp avx
7 years ago
tensor-tang b4751a34a5 fix illegal instruction of rnn2
7 years ago
sneaxiy 21fdf8e87d add unittest for allocator_facade.cc
7 years ago
Xin Pan 6de08b5eef set default timeout to avoiding blocking CI
7 years ago
tensor-tang 30dfbdee7f
Merge pull request #13951 from tensor-tang/fix/warning
7 years ago
Tao Luo 34ed7d1379
Merge pull request #13924 from luotao1/clean_inference_lib
7 years ago
tensor-tang 36588b3365 fix illegal instruction of rnn1 and text
7 years ago
Tao Luo 6a4e9230ed Merge branch 'develop' into mkldnn_test
7 years ago
gongweibao 078223b3e3
Add rpc timeline. (#13900)
7 years ago
dzhwinter 29382db625
Merge pull request #13874 from dzhwinter/fix/momentum
7 years ago
Xin Pan 6a54c3de1f
Merge pull request #13928 from panyx0718/doc
7 years ago
qingqing01 5dbb2e9986
Small changes for sum_op to avoid zero setting. (#13923)
7 years ago
Tao Luo b819684370 add compare_mkldnn test
7 years ago
Tao Luo e47f4186ae fix some compiler warning
7 years ago
nhzlx b970c6d5d0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_demo_ci_trt
7 years ago
nhzlx 32072d31b5 fix demo ci error on manylinux
7 years ago
Jiabin Yang 6553956bd6
Merge pull request #13931 from JiabinYang/fix_dist_on_mac
7 years ago
dzhwinter 00e8791f66 fix compile in cpu error. test=develop
7 years ago
tensor-tang e69328c3bc fix warning and mac compile
7 years ago
Tao Luo 6ea9d1b595 add analysis_predictor in vis_demo
7 years ago
Tao Luo f444a7226e Merge branch 'develop' into clean_inference_lib
7 years ago
Tao Luo 3598500773
Merge pull request #13867 from Superjomn/clean/CreatePaddlePredictor
7 years ago
Tao Luo 41eeb771e8 Merge branch 'develop' into clean_inference_lib
7 years ago
sneaxiy 3419d04c3f test=develop
7 years ago
dzhwinter d239cf2e15 use binary search. test=develop
7 years ago
dzhwinter a9f5f822e6 use binary search. test=develop
7 years ago
Tao Luo b854d959a5 update with comments
7 years ago
Qiao Longfei cc7f551441 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pyreader
7 years ago
nhzlx 2b5edfbc37 Add ceil model pooling for trt (ocr attention)
7 years ago
Tao Luo 75bb0babef Merge branch 'develop' into mkldnn_test
7 years ago
tensor-tang 6447155dac
Merge pull request #13851 from tensor-tang/fea/jitkernel_peephole
7 years ago
sneaxiy 4b4af84e67 test=develop
7 years ago
jerrywgz 4c9884e713 refine unittest test=develop
7 years ago
JiabinYang 02f863400e test=develop
7 years ago
Qiao Longfei 0225957515 change elementwise_add to elementwise_add_to test=develop
7 years ago
Qiao Longfei bd2b6d7f8f sum_op support inplace
7 years ago
Xin Pan 7fb5b66ac2
Merge pull request #13916 from panyx0718/fix2
7 years ago
Yan Chunwei 6809238d97
fix analysis predictor profile (#13896)
7 years ago
Xin Pan abbfb60ca9 remove unused codes
7 years ago
Yibing Liu 6b795d424c
Merge pull request #13901 from kuke/seq_slice_py
7 years ago
nhzlx 9d98ca0424 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_googlenet_bug_with_rule
7 years ago
nhzlx 849a6874ad fix googlenet bug with relu
7 years ago
dzhwinter 3861269594 merge develop branch
7 years ago
jerrywgz 98c3294b85 Merge branch 'roialign' of https://github.com/jerrywgz/Paddle into roialign
7 years ago
Tao Luo a35e7f4bae adjust demo_ci with fluid_inference_install_dir
7 years ago
tangwei12 fa2ab3346c
fill constant add infervarshape, lookuptable clone lr var (#13830)
7 years ago
jerrywgz 8c79071d6a roi_align for gpu
7 years ago
Xin Pan 342e436158 Make Var::GetMutable robust
7 years ago
Yan Chunwei 7a751b83ac fix isfinite_op sprintf (#13850)
7 years ago
Qiyang Min e3a64fca44
Merge pull request #13835 from velconia/fix_reshape_op
7 years ago
Yibing Liu 46b0b7903c
Merge pull request #13856 from kuke/seq_unpad_op
7 years ago
tensor-tang dcfb687584
Merge pull request #13846 from tensor-tang/fix/mkldnn
7 years ago
Qiao Longfei b4a32eafdf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-sum-seq-pooling-op
7 years ago
Tao Luo bd77460182 refine mkldnn test in analyzer_tests
7 years ago
jerrywgz c9d2046f76 roi_align for gpu
7 years ago
jerrywgz 2f5a80174e add roi_align api
7 years ago
dzhwinter e41a3fcd68 fix update to develop hang problem.
7 years ago
Zeng Jinle af91d41ab8
Merge pull request #13852 from sneaxiy/feature/eager_delete_tensor
7 years ago
Zeng Jinle 93606c2c2c
Merge pull request #13689 from sneaxiy/sparse_rmsprop
7 years ago
Qiao Longfei 681226e97c
Merge pull request #13864 from jacquesqiao/py-reader-add-test-mode
7 years ago
jerrywgz 90f39b1123 Merge branch 'roialign' of https://github.com/jerrywgz/Paddle into roialign
7 years ago
Tao Luo f8874b3cb2
Merge pull request #13884 from luotao1/rename_inference_lib_dist
7 years ago
Xin Pan 288a112ffd
Revert "Revert "Revert "Make variable::GetMutable robust"""
7 years ago
sneaxiy 5cedfb60c8 test=develop
7 years ago
Yibing Liu b785798585 Expose layer's name for sequence pad & unpad
7 years ago
Yibing Liu 18e1c1e07d Update API spec for seq slice
7 years ago
jerrywgz 5e52dafda5 add roi align
7 years ago
jerrywgz c0e34eebec add roi align
7 years ago
Tao Luo c26f2b21eb
Merge pull request #13813 from sfraczek/sfraczek/conv-bn-fuse-pass-full-eigen
7 years ago
sneaxiy 64d94596ab fix allocator_facade bug
7 years ago
Yibing Liu 16b2c6dc78 Add py api for sequence_slice_op
7 years ago
superjomn 1cfd2b51a7 update
7 years ago
Xin Pan fededdda20
Merge pull request #13872 from panyx0718/fix2
7 years ago
dzhwinter 804dd7da04 merge conflict. both linux and windows pass.
7 years ago
dzhwinter 962061f0a3 windows fix
7 years ago
wanghaoshuang b12f7c2399 compile in linux.
7 years ago
Qiao Longfei b16e9cd105
a small fix for compile WITH_INFERENCE=OFF (#13869)
7 years ago
Qiao Longfei ec25a09bd5 revert unused change test=develop
7 years ago
Qiao Longfei 60030e8678 change the use of FLAGS_reader_queue_speed_test_mode
7 years ago
Tao Luo 323d67cfc1
Merge pull request #13879 from panyx0718/doc
7 years ago
Qiao Longfei 936926aadd code optimize
7 years ago
Sylwester Fraczek 50c5e9b0c6 reshape_2d used from ddim.h
7 years ago
Qiyang Min cab29828a5
Merge pull request #13829 from velconia/accelerate_sequence_pool_op
7 years ago
minqiyang aeec82acd5 Add unittest for reshape op
7 years ago
Qiao Longfei 9fd78df71c revert unused change
7 years ago
Xin Pan ddb76d0d09 Make GetMutable more robust
7 years ago
Qiao Longfei c52ccbc109 clean code
7 years ago
Qiao Longfei 6056d04361 optimize blas call
7 years ago
Qiyang Min c2842377ce
Merge pull request #13837 from velconia/add_pyramid_dnn_support
7 years ago
Qiao Longfei 5db7551317 optimize code
7 years ago
minqiyang 24c9fbdba3 Polish code
7 years ago
chengduo 2c9839c847
add cuda version display (#13885)
7 years ago
sneaxiy d3ed070e10 test=develop
7 years ago
minqiyang d9b202e717 Move tensor copy src_ptr and dst_ptr check to TensorCopy function
7 years ago
sneaxiy fb6201e93e test=develop
7 years ago
chengduo 8e2fdc54b1
Add check for opt op (#13840)
7 years ago
Qiao Longfei eb6d9e3bbe Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-sum-seq-pooling-op
7 years ago
Yibing Liu 46e61d81a7 Wrapper py api for sequence_unpad
7 years ago
Qiao Longfei 0170d36c42 fix a bug
7 years ago
superjomn 28459592cc update
7 years ago
Qiyang Min e37c9e6732
Merge pull request #13828 from velconia/accelerate_selected_rows_functor
7 years ago
Qiao Longfei 86e2e686ee fix bug
7 years ago
Qiao Longfei 333fd15204 add gpu test for mrege add
7 years ago
Tao Luo 3d976f3f18 rename inference_lib_dist to fluid_lib_dist
7 years ago
Qiao Longfei ab3e36da80 update MergeAdd for selected_rows_functor.cu
7 years ago
sneaxiy 84d9300365 test=develop
7 years ago
chengduo 2562eb92b8
Add strategy doc (#13849)
7 years ago
Xin Pan b11372a0af fix doc
7 years ago
Zhaolong Xing 7413fa458f
Merge pull request #13838 from NHZlX/add_trt_pad_op
7 years ago
superjomn e2bd40ca82 update
7 years ago
Qiao Longfei d5c64af24f change map to unordered_map
7 years ago
Qiao Longfei 005f1923a2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-sum-seq-pooling-op
7 years ago
wanghaoshuang 3ae9645084 compile in linux
7 years ago
dzhwinter 8329a1f139 add sparse update momentum. test=develop
7 years ago
superjomn 049fcbe125 update
7 years ago
Qiao Longfei ea97e83e41 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pyreader
7 years ago
superjomn f5c0221c17 clean CreatePaddlePredictor
7 years ago
Xin Pan d852be7c48
Revert "Make variable::GetMutable robust"
7 years ago
Qiao Longfei c61e16b181 add reader_queue_speed_test_mode_flag test
7 years ago
Qiao Longfei 8686f7c68e add reader_queue_speed_test_mode flag for speed test
7 years ago
Xin Pan ae8b1c32bf
Merge pull request #13821 from panyx0718/fix
7 years ago
Xin Pan 9b3e0df375
Merge pull request #13819 from panyx0718/doc
7 years ago
Zhaolong Xing 44f37d019a
Merge pull request #13848 from NHZlX/fix_demo_ci_with_trt
7 years ago
Sylwester Fraczek 55d6950a1a rewrite conv_bn fuse pass to eigen
7 years ago
chengduo 9c77b65c06
Fix layers.uniform_random (#13823)
7 years ago
nhzlx 320c78e16f fix commets
7 years ago
Yibing Liu cbe4292516 Add sequence unpad op
7 years ago
tensor-tang bcb8ea397d Merge remote-tracking branch 'ups/develop' into fea/jitkernel_peephole
7 years ago
tensor-tang 8e182170ba refine and replace lstm peephole kernel
7 years ago
nhzlx efa5bac7ad fix demo_ci bug in vis_demo.cc
7 years ago
tensor-tang dc5a7b906d fix default number of threads when inference with or without MKLDNN
7 years ago
Xin Pan 228506618b Avoid GetMutable implicitly reset Var Type.
7 years ago
jerrywgz 3c963336e4 fix roi pool register
7 years ago
Dun 5f2e837847 optimize depthwise conv by register memory (#13778)
7 years ago
sneaxiy d87569134c test=develop
7 years ago
minqiyang 3f6ec90060 Polish code
7 years ago
minqiyang 9878eedbaa Change API.spec
7 years ago
Qiao Longfei 5428cb9908
Profiler support merge data of all thread (#13811)
7 years ago
nhzlx bf7a2789a0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_trt_pad_op
7 years ago
nhzlx 0cb88c34be add op converter
7 years ago