Commit Graph

2669 Commits (af2f5fc8249bd449a3d90df6e5cce1a63f9c244c)

Author SHA1 Message Date
dengkaipeng ff6329bd5f fix some inappropriate expressions in api doc for grid_sampler. test=develop
6 years ago
dengkaipeng 8f1e398824 move param exclusive to the last in pool2d/pool3d for forward compatibility:. test=develop
6 years ago
dengkaipeng 593e1b18d7 fix some bugs and add some doc for GridSampleOp
6 years ago
dengkaipeng 0bb0e0c10f add Grid Sampler Operator for STN.
6 years ago
Yu Yang c01696f8c2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
Qiao Longfei d26ff8cb2d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpu-for-1.1-merge-with-shape
6 years ago
JiabinYang e0a89503f8 test=develop
6 years ago
Wu Yi 26200f2e42
[1.1] [project] train imagenet using large batch size (#13766)
6 years ago
barrierye 8c1e304307 merge nn.py
6 years ago
dengkaipeng c93e044ae0 add inclusive/exclusive mode in PoolOp avg pool type
6 years ago
JiabinYang 9a74c4489f test=develop
6 years ago
barrierye 9dc28179a4 add similarity_focus op
6 years ago
Qiao Longfei 7cd2417fe2 Merge branch 'develop' into cpu-for-1.1-merge-with-shape
6 years ago
dzhwinter c8adc2c6fe cudnn version. staged.
6 years ago
Yan Chunwei ee74be3a49
[1.1] Bugfix/tensorarray (#14044)
6 years ago
Qiyang Min 33b4920d2d
Merge pull request #14057 from velconia/continue_hash_op
6 years ago
Qiyang Min 209f24a241
Merge pull request #14051 from velconia/accelerate_embedding_grad
6 years ago
Qiao Longfei 7cfc3c4415 Merge branch 'optimize-sum-seq-pooling-op' of ssh://github.com/jacquesqiao/Paddle into cpu-for-1.1-merge
6 years ago
Qiao Longfei 72aef6b168 sum selected rows check empty
6 years ago
Qiao Longfei 641369f92b Merge branch 'dist-table-do-not-init-on-trainer' of ssh://github.com/jacquesqiao/Paddle into cpu-for-1.1-merge
6 years ago
Qiao Longfei d69c820707 Merge branch 'add-flag-to-control-rpc-thread-num' of ssh://github.com/jacquesqiao/Paddle into cpu-for-1.1-merge
6 years ago
Qiao Longfei 1ed9ef6d70 Merge branch 'shape_int_to_int64' of https://github.com/seiriosPlus/Paddle into cpu-for-1.1-merge
6 years ago
Qiao Longfei da61a5b672 Merge branch 'optimizer-prefetch' of https://github.com/seiriosPlus/Paddle into cpu-for-1.1-merge
6 years ago
tangwei12 5ce3a32e06
Merge branch 'develop' into optimizer-prefetch
6 years ago
seiriosPlus b6590b05fb submit by tangwei12, test=develop
6 years ago
tangwei12 cb1ccc710b fix shape type in uniform_random_op.cu
6 years ago
Qiao Longfei 575f22711d optimize code
6 years ago
Qiao Longfei 96d5500934 optimize code
6 years ago
Qiao Longfei 748ee35c89 sum op handle empty input update selected_rows_functor.cu
6 years ago
Qiao Longfei dd78b5df93 sum op handle empty input
6 years ago
Qiao Longfei cbe128bbae Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-sum-seq-pooling-op
6 years ago
Qiao Longfei f4df0cb1a2 update the type of shape to int64, format code
6 years ago
Qiao Longfei 7dcb0dc8c6 update year
6 years ago
Qiao Longfei 68aeb4e7e9 add fake init test in test_dist_transpiler
6 years ago
Qiao Longfei a13c788a04 fix a bug
6 years ago
Zeng Jinle 97d47a7d08
Merge pull request #13913 from sneaxiy/seq_reverse
6 years ago
JiabinYang 6e3615422f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_reorg_op
6 years ago
Jiabin Yang a3efba176c
Merge pull request #14085 from jerrywgz/fix_generate_proposals_op
6 years ago
dzhwinter 7141debe38 add cudnn back. staged.
6 years ago
Qiao Longfei 0328ffd3ab add fake init op
6 years ago
Hongyu Liu 379d933ae5
Merge pull request #14036 from phlrain/add_dropout_att_new
6 years ago
tangwei12 d8b697357f update height_sections to int64_t
6 years ago
jerrywgz de2f965c9b test=develop
6 years ago
dzhwinter 09409bad4d staged. test speed=49ms in 1080.
6 years ago
tensor-tang 64d5b4385e fix crf decode avx512
6 years ago
tensor-tang 21487d78bf add crf decode jit kernel
6 years ago
sneaxiy 1af3fe8c35 test=develop
6 years ago
Qiao Longfei de539d72da format
6 years ago
sneaxiy 5be6f762d0 remove_lock_in_some_ops
6 years ago
buxingyuan 6c1d74bb47 Merge branch 'develop' into FasterOpDoc
6 years ago
JiabinYang 7bcba47e41 test=develop
6 years ago
barrierye a7f94ec794 add similarity_focus op
6 years ago
minqiyang 0de6811ee0 Change reserve to resize
6 years ago
JiabinYang 9cad409f2a test=develop
6 years ago
minqiyang 5660d6a3ba Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_embedding_grad
6 years ago
tensor-tang a05fce6544 Merge remote-tracking branch 'ups/develop' into fix/jit/avx
6 years ago
JiabinYang bd064c0f44 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_reorg_op
6 years ago
Qiyang Min d0fdcb2f6d
Merge pull request #14048 from velconia/change_sequence_pool_to_cpu
6 years ago
Yu Yang 8310ce6007 Fix cluster memory
6 years ago
tensor-tang d24d282a7a fix avx error
6 years ago
tensor-tang 9cb8738f54
Merge pull request #14018 from tensor-tang/refine/jit/gru
6 years ago
Qiao Longfei 6253b152e6 Merge branch 'optimize-sum-seq-pooling-op' of https://github.com/jacquesqiao/Paddle into optimize-sum-seq-pooling-op
6 years ago
Qiao Longfei 14f5a40898 fix unit test
6 years ago
minqiyang 5de4619781 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_embedding_grad
6 years ago
minqiyang 0695c1fbe8 Add remind for code
6 years ago
minqiyang 0c5c4c4a5b Add blas header file
6 years ago
buxingyuan d0ccdf8fc1 follow comments
6 years ago
minqiyang e2a348cd10 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into change_sequence_pool_to_cpu
6 years ago
Qiao Longfei f4e6fe0786 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-sum-seq-pooling-op
6 years ago
minqiyang 40141f749b Implement the unittest for hash op
6 years ago
minqiyang 8a0f26f45f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into continue_hash_op
6 years ago
minqiyang d4f9aa0852 Add hash op implementation
6 years ago
tangwei12 755927d2b0 shape type to int64_t, test=develop
6 years ago
Qiao Longfei 7357d8412e add flags for control the thead num for pserver
6 years ago
minqiyang 1a3b38a432 Polish code
6 years ago
minqiyang 133bac2b10 Accelerate embedding op grad
6 years ago
dzhwinter 597d92179b clean demo_ci
6 years ago
phlrain 201d4f2a85 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_dropout_att_new
6 years ago
phlrain a6e6bc45d6 modify dropout att; test=develop
6 years ago
minqiyang 2468057da6 Move code to SumSeqPoolGradFunctor
6 years ago
minqiyang 9725db0d40 Fix copy wrong pos bug
6 years ago
minqiyang 9c68709036 Accelerate sequence_pool functor
6 years ago
minqiyang 14ebc424d6 Add gpu support for unittest
6 years ago
jerrywgz e906c8e5e7
Merge pull request #14022 from jerrywgz/fix_rpn_target_assign_op
6 years ago
minqiyang bd5a82e193 Polish unit test code
6 years ago
minqiyang 047fa2f9aa Add unit-test for sequence_pooling functor
6 years ago
qingqing01 c7379a7320 Fix top_k op (#14034)
6 years ago
sneaxiy 016bf51e3f test=develop
6 years ago
JiabinYang c13f1ef3c4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_reorg_op
6 years ago
Xin Pan 8837669782
Merge pull request #13982 from panyx0718/fix
6 years ago
dzhwinter dbd0075b68 Merge branch 'windows/support' into lb
6 years ago
dzhwinter c6dcffc61a lb. add debug output
6 years ago
sneaxiy 92a2817a2b test=develop
6 years ago
JiabinYang 8e8e8e66ab Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_reorg_op
6 years ago
phlrain 049c9c7d2a Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_dropout_att_new
6 years ago
phlrain ffb24a73ec add dropout attr; test=develop
6 years ago
wanghaoshuang 5993155d67 Merge remote-tracking branch 'dzhwinter/windows/support' into windows/support
6 years ago
wanghaoshuang f9e7cfb03c save binary file
6 years ago
tensor-tang 032c3a07e3 Merge remote-tracking branch 'ups/develop' into refine/jit/gru
6 years ago
tensor-tang 159be8cc63 optimize fusion gru kernel at size 8
6 years ago
Tao Luo 23da8defc8
Merge pull request #14028 from luotao1/fix_resnet50_test
6 years ago
Yu Yang 71c846ef8a Revert buggy changes
6 years ago
JiabinYang ff07dc315e test=develop
6 years ago
chengduo a7497653d0
Refine Split op (#13967)
6 years ago
Yu Yang dbf9f6f408 Fix distribute compile
6 years ago
jerrywgz e0708e62ba refine code
6 years ago
jerrywgz 1c591c3909
Merge branch 'develop' into fix_rpn_target_assign_op
6 years ago
sneaxiy a9d7a9d720 test=develop
6 years ago
Tao Luo 316bc9bfc9 fix typo and warning in analyzer_resnet50_test
6 years ago
jerrywgz f06c6193d7 fix rpn target assign test=develop
6 years ago
dongzhihong 563e7bca7f "fix op. test=develop"
6 years ago
Xin Pan 8f2116d8fa clean up after the changes have been stopped for so long.
6 years ago
tensor-tang 83dc689877 Merge remote-tracking branch 'ups/develop' into refine/jit/gru
6 years ago
tensor-tang 640e789d3d add fusion gru jit kernel
6 years ago
JiabinYang 39d39775c3 test=develop
6 years ago
JiabinYang 70351de1b5 test=develop
6 years ago
Yu Yang 461f71a90b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
qingqing01 0e24138494
Merge pull request #13991 from qingqing01/refine_generate_proposals_op
6 years ago
gongweibao 58c027cc38
Add rpc profiler flags. (#13989)
6 years ago
Tao Luo 42aa1d409d
Merge pull request #13485 from tpatejko/tpatejko/capi-resnet-conv-elementwise-fusion
6 years ago
tensor-tang 664159ad42
Merge pull request #13998 from tensor-tang/fea/fusion_seqconv_add
6 years ago
jerrywgz 765085d297
Merge pull request #13904 from jerrywgz/roialign
6 years ago
Dang Qingqing 56936b9e25 Refine doc for generate_proposals_op.
6 years ago
Tomasz Patejko 4be45af1cc MKLDNN conv + elementwise_add fusion: skip connection attribute renamed. Comments about patterns added.
6 years ago
Michal Gallus f688197182 MKLDNN conv + elementwise_add fusion: Fix output_data to point to the right tensor, also fix transpiler integration
6 years ago
Tomasz Patejko bf95ac36a7 MKLDNN conv + elementwise_add fusion: further reformatting
6 years ago
Tomasz Patejko b8e54ab5cc MKLDNN conv + elementwise_add fusion: parameter name changed to ResidualData
6 years ago
Tomasz Patejko 41f3d78fdf MKLDNN conv + elementwise_add fusion: output and elemwise param share data in conv primitive. Output is properly allocated
6 years ago
Tomasz Patejko 56528531ea MKLDNN conv + elementwis_add fusion: initial work on passing eltwise data to conv primitive
6 years ago
Dang Qingqing 4801ee8f97 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_generate_proposals_op
6 years ago
tensor-tang 23fc896bc2 Merge remote-tracking branch 'ups/develop' into fea/fusion_seqconv_add
6 years ago
tensor-tang 339e655aec refine and add seqconv elementwiseadd relu op test
6 years ago
jerrywgz a1d3db031b
Merge pull request #13844 from jerrywgz/fix_roi_pool
6 years ago
Dang Qingqing 8e0b9496de Fix unit test
6 years ago
tensor-tang 0a9f5f1790
Merge pull request #13968 from tensor-tang/fix/jit/exp
6 years ago
Yipeng fcb2e8103e Ocr end2end dev (#13889)
6 years ago
tensor-tang e5ce965952 refine and add eltadd_relu unit test
6 years ago
sneaxiy 5a38930660 test=develop
6 years ago
tensor-tang 7cb19a5976 fuse elementwise_add and relu
6 years ago
tensor-tang 3c249283af init seqconv eltadd relu op
6 years ago
sneaxiy ac2eba4457 test=develop
6 years ago
jerrywgz 553342624e test=develop
6 years ago
jerrywgz 9a14ca91b8 test=develop
6 years ago
tensor-tang 60ff05e312 Merge branch 'luotao1-fix_rnn2_test' into fix/jit/exp
6 years ago
Tao Luo 7d680be5a3 Merge branch 'develop' into mkldnn_test
6 years ago
buxingyuan 0bb3b099c2 generate_proposal_labels doc
6 years ago
gongweibao a831ecc75d
Add grpc error context. (#13957)
6 years ago
tensor-tang b139b687de Merge remote-tracking branch 'ups/develop' into fix/jit/exp
6 years ago
qingqing01 67a2b5215d
Add affine channel op to speed and save memory for faster-rcnn model. (#13919)
6 years ago
tensor-tang 748435586a clean code exp avx
6 years ago
tensor-tang b4751a34a5 fix illegal instruction of rnn2
6 years ago
tensor-tang 30dfbdee7f
Merge pull request #13951 from tensor-tang/fix/warning
6 years ago
tensor-tang 36588b3365 fix illegal instruction of rnn1 and text
6 years ago
Tao Luo 6a4e9230ed Merge branch 'develop' into mkldnn_test
6 years ago
gongweibao 078223b3e3
Add rpc timeline. (#13900)
6 years ago
dzhwinter 29382db625
Merge pull request #13874 from dzhwinter/fix/momentum
6 years ago
qingqing01 5dbb2e9986
Small changes for sum_op to avoid zero setting. (#13923)
6 years ago
Tao Luo e47f4186ae fix some compiler warning
6 years ago
dzhwinter 00e8791f66 fix compile in cpu error. test=develop
6 years ago
tensor-tang e69328c3bc fix warning and mac compile
6 years ago
dzhwinter d239cf2e15 use binary search. test=develop
6 years ago
dzhwinter a9f5f822e6 use binary search. test=develop
6 years ago
tensor-tang 6447155dac
Merge pull request #13851 from tensor-tang/fea/jitkernel_peephole
6 years ago
sneaxiy 4b4af84e67 test=develop
6 years ago
jerrywgz 4c9884e713 refine unittest test=develop
6 years ago
Qiao Longfei 0225957515 change elementwise_add to elementwise_add_to test=develop
6 years ago
Qiao Longfei bd2b6d7f8f sum_op support inplace
6 years ago
Xin Pan 7fb5b66ac2
Merge pull request #13916 from panyx0718/fix2
6 years ago
dzhwinter 3861269594 merge develop branch
6 years ago
jerrywgz 98c3294b85 Merge branch 'roialign' of https://github.com/jerrywgz/Paddle into roialign
6 years ago
tangwei12 fa2ab3346c
fill constant add infervarshape, lookuptable clone lr var (#13830)
6 years ago
jerrywgz 8c79071d6a roi_align for gpu
6 years ago
Xin Pan 342e436158 Make Var::GetMutable robust
6 years ago
Yan Chunwei 7a751b83ac fix isfinite_op sprintf (#13850)
6 years ago
Qiyang Min e3a64fca44
Merge pull request #13835 from velconia/fix_reshape_op
6 years ago
Yibing Liu 46b0b7903c
Merge pull request #13856 from kuke/seq_unpad_op
6 years ago
Qiao Longfei b4a32eafdf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-sum-seq-pooling-op
6 years ago
jerrywgz c9d2046f76 roi_align for gpu
6 years ago
jerrywgz 2f5a80174e add roi_align api
6 years ago
dzhwinter e41a3fcd68 fix update to develop hang problem.
6 years ago
Zeng Jinle 93606c2c2c
Merge pull request #13689 from sneaxiy/sparse_rmsprop
6 years ago
Qiao Longfei 681226e97c
Merge pull request #13864 from jacquesqiao/py-reader-add-test-mode
6 years ago
jerrywgz 90f39b1123 Merge branch 'roialign' of https://github.com/jerrywgz/Paddle into roialign
6 years ago
Xin Pan 288a112ffd
Revert "Revert "Revert "Make variable::GetMutable robust"""
6 years ago
sneaxiy 5cedfb60c8 test=develop
6 years ago
jerrywgz 5e52dafda5 add roi align
6 years ago
jerrywgz c0e34eebec add roi align
6 years ago
Xin Pan fededdda20
Merge pull request #13872 from panyx0718/fix2
6 years ago
Qiao Longfei 60030e8678 change the use of FLAGS_reader_queue_speed_test_mode
6 years ago
Qiao Longfei 936926aadd code optimize
6 years ago
Qiyang Min cab29828a5
Merge pull request #13829 from velconia/accelerate_sequence_pool_op
6 years ago
Xin Pan ddb76d0d09 Make GetMutable more robust
6 years ago
Qiao Longfei c52ccbc109 clean code
6 years ago
Qiao Longfei 6056d04361 optimize blas call
6 years ago
Qiao Longfei 5db7551317 optimize code
6 years ago
minqiyang d9b202e717 Move tensor copy src_ptr and dst_ptr check to TensorCopy function
6 years ago
chengduo 8e2fdc54b1
Add check for opt op (#13840)
6 years ago
Qiao Longfei eb6d9e3bbe Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-sum-seq-pooling-op
6 years ago
Qiao Longfei 0170d36c42 fix a bug
6 years ago
Qiyang Min e37c9e6732
Merge pull request #13828 from velconia/accelerate_selected_rows_functor
6 years ago
Qiao Longfei 86e2e686ee fix bug
6 years ago
Qiao Longfei 333fd15204 add gpu test for mrege add
6 years ago
Qiao Longfei ab3e36da80 update MergeAdd for selected_rows_functor.cu
6 years ago
sneaxiy 84d9300365 test=develop
6 years ago
Qiao Longfei d5c64af24f change map to unordered_map
6 years ago
Qiao Longfei 005f1923a2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-sum-seq-pooling-op
6 years ago
wanghaoshuang 3ae9645084 compile in linux
6 years ago
dzhwinter 8329a1f139 add sparse update momentum. test=develop
6 years ago
Qiao Longfei c61e16b181 add reader_queue_speed_test_mode_flag test
6 years ago
Qiao Longfei 8686f7c68e add reader_queue_speed_test_mode flag for speed test
6 years ago
chengduo 9c77b65c06
Fix layers.uniform_random (#13823)
6 years ago
Yibing Liu cbe4292516 Add sequence unpad op
6 years ago
tensor-tang bcb8ea397d Merge remote-tracking branch 'ups/develop' into fea/jitkernel_peephole
6 years ago
tensor-tang 8e182170ba refine and replace lstm peephole kernel
6 years ago
jerrywgz 3c963336e4 fix roi pool register
6 years ago
Dun 5f2e837847 optimize depthwise conv by register memory (#13778)
6 years ago
minqiyang 3f6ec90060 Polish code
6 years ago
tensor-tang 7ef2699e18 init peephole runtime kernel
6 years ago
Qiyang Min f99ea99e36
Merge pull request #13720 from velconia/fix_grad_clip
6 years ago
minqiyang f40848828d Polish code
6 years ago
minqiyang e2e82bde32 Accelerate Reshape op
6 years ago
minqiyang 0385b0a1ea Accelerate SequencePool Op on SUM mode
6 years ago
minqiyang 8ec748cfa0 Accelerate SelectedRows Functors:
6 years ago
Qiao Longfei 38568519f7 optimize code
6 years ago
whs 7e651c8641
Fix truncated norm (#13785)
6 years ago
minqiyang 1456b8ec7d Add unittest for clip_by_norm_op with SelectedRows
6 years ago
Tao Luo fd0dd07ab4
Merge pull request #13726 from jczaja/prv-fused_embedding_fc_lstm-ut
6 years ago
sneaxiy ea61e4ef15 merge develop
6 years ago
chengduo e1761709f8
Set the right shape of selected_rows (#13723)
6 years ago
tensor-tang 3ee8f2c6cf thread local jit kernels
6 years ago
tensor-tang 9131a35676 replace the lstm compute with jitkernel
6 years ago
tensor-tang b55c247678 add lstm compute unit test
6 years ago
qingqing01 6094a72308
Fix bug in reduce_op caused by PR #13534 (#13748)
6 years ago
sneaxiy 4c672ab1a2 Merge reyoung:rewrite_allocation
6 years ago
dzhwinter a46e30aa6d
enhance isinf/isnan in tensor util, avoid copy back to cpu (#12688)
6 years ago
tensor-tang 2a00969165 optimize lstm jitkernel keq8
6 years ago
tensor-tang f2adaf1c3e add vrelu and lstm kernel
6 years ago
Jacek Czaja 9f15d8817e - Cleanup as suggessted by reviewers
6 years ago
Jacek Czaja ae8b4717cc - Cleaning fused_embedding_fc_lstm op
6 years ago
Jacek Czaja fd31b54cf1 - Removed disabled code
6 years ago
Jacek Czaja f9da2d6416 - Removed disabled diagnostic code
6 years ago
Jacek Czaja 809dbc5c17 - Added file for fused_embedded_fc_lstm_op unit test
6 years ago
tensor-tang e6d8aca3bf refine code and fix
6 years ago
qingqing01 41e4f7ea0a
Optimize Topk when height is large. (#13710)
6 years ago
qiaolongfei 1a59880084 update test_sum_op
6 years ago
minqiyang bcd8c2ccc3 Add unit test
6 years ago
qiaolongfei 40d3bd4e81 selected rows merge add support multi input
6 years ago
tensor-tang ea7dc9cbf6 Merge remote-tracking branch 'ups/develop' into fea/jitkernel
6 years ago
tensor-tang 2513b2cc4e fix bug vtanh
6 years ago
chengduoZH e59ab42caa add nodes for drnn
6 years ago
minqiyang f20fc95539 Resize output ddims and rows
6 years ago
Yu Yang 15076c325e Add comments and polish code style
6 years ago
tensor-tang 5ef14dd386
Merge pull request #13715 from tensor-tang/fix/op
6 years ago
minqiyang 67308822f8 Add selected_rows merge for clip_by_norm op
6 years ago
Xin Pan c0dfd5eee8
Merge pull request #13712 from luotao1/doc_element
6 years ago
tensor-tang ea0b98e007 bugfix: fusion lstm and gru batch,seq mode switch
6 years ago
Tao Luo 69ed75e77c refine elementwise doc
6 years ago
Tao Luo 49cd43000c
Merge pull request #12981 from chenwhql/sequence_erase_op_infershape_note_polish
6 years ago
tensor-tang cf8c8e72bd add vtanh and unit test
6 years ago
Tao Luo 067b8582af
Merge pull request #13625 from wanghaoshuang/fix_warning_roi
6 years ago
Yu Yang e25240c22a Refine
6 years ago
tensor-tang b37fe30417
Merge pull request #13690 from wangguibao/fix_cpu_lstm_compute_cc
6 years ago
dzhwinter 26771f41ba
"fix compile error" (#13579)
6 years ago
Yu Yang 6ca37448ac Refine prelu_op
6 years ago
Yu Yang 186b2b138d
Revert "Merge pull request #13201 from reyoung/revert_callstack" (#13697)
6 years ago
tensor-tang d10a9df7b8 add vaddbias and unit test
6 years ago
tensor-tang 3c8b651187 add vsigmoid avx implementations and unit test
6 years ago
Yu Yang 31270e58d0 Add communication attr
6 years ago
sneaxiy 6f748a035d test=develop
6 years ago
tensor-tang 55e44761fb refine code and init vsigmoid
6 years ago
wangguibao 1940bc2d83 Avoid multiple definitions of lstm_compute_ctht when linking libpaddle_fluid.so
6 years ago
sneaxiy 584c3f048f fix sparse rmsprop
6 years ago
Yu Yang 8e3fdc6e65 Fix SetDevice on init
6 years ago
Xin Pan 425a882165
Merge pull request #13643 from panyx0718/ir2
6 years ago
Yu Yang 524f6e9b36 Refine code
6 years ago
Xin Pan 642905958a fix compile error
6 years ago
Xin Pan 33b68fdf25 fix compile error
6 years ago
tensor-tang ede4b230be
Merge pull request #13553 from jczaja/prv-fused_embedding_fc_lstm_op
6 years ago
Yu Yang 5cf395beaf Fix bug in uts
6 years ago
Xin Pan 6746b1fdf3 add missing header
6 years ago
Dun 161c3e31f7 Optimization of Kernels that related to DeepLabv3+ (#13534)
6 years ago
Xin Pan 5fb72d840a add header
6 years ago
Xin Pan ddd60581b7 clean up channel
6 years ago
tensor-tang 2d0ff6a3c2 add vexp and unit test
6 years ago
tensor-tang b3c63f40fa add vscal and unit test
6 years ago
Yu Yang a6fbf7ec28 Merge branch 'feature/refine_generate_proposals_op' into rewrite_allocation
6 years ago
tensor-tang 0987f2b4d9 add vadd unit test
6 years ago
tensor-tang 3d928d4f9d refine and seepdup
6 years ago
tensor-tang 77fc42d2d1 Merge remote-tracking branch 'ups/develop' into fea/jitkernel
6 years ago
Yu Yang 593ad763cd refactor(op): polish generate_proposals_op
6 years ago
tensor-tang 2937314d8e refine vmul and test
6 years ago
Yu Yang 58ed412f68 refactor(memory): rewrite memory allocation and make it extentable
6 years ago
tensor-tang 6c986e127a fix macro and add vmul unit test
6 years ago
chengduo 43a3af86be
refine sgd_op (#13626)
6 years ago
Xin Pan 2c01c2216a
Merge pull request #13531 from gongweibao/generator2
6 years ago
Yu Yang 0be1582df0
Merge pull request #13525 from reyoung/fix_mixed_vector
6 years ago
Jacek Czaja d5114c60b0 - Reviewers suggesstions to fused_embedding_fc_lstm_op
6 years ago
Jacek Czaja 7ab5626dee - Added initial pass for embedding-fc-lstm
6 years ago
qingqing01 fd4c4df93d
Cuda speed for generate_proposals_op. (#13596)
6 years ago
tensor-tang 8c69764d12 add vmul unit tests
6 years ago