Commit Graph

2296 Commits (2414f92f54c3b49e30f976a5ff942cc8e89c6cd4)

Author SHA1 Message Date
jerrywgz 8c79071d6a roi_align for gpu
7 years ago
Xin Pan 342e436158 Make Var::GetMutable robust
7 years ago
Yan Chunwei 7a751b83ac fix isfinite_op sprintf (#13850)
7 years ago
Qiyang Min e3a64fca44
Merge pull request #13835 from velconia/fix_reshape_op
7 years ago
Yibing Liu 46b0b7903c
Merge pull request #13856 from kuke/seq_unpad_op
7 years ago
jerrywgz c9d2046f76 roi_align for gpu
7 years ago
jerrywgz 2f5a80174e add roi_align api
7 years ago
Zeng Jinle 93606c2c2c
Merge pull request #13689 from sneaxiy/sparse_rmsprop
7 years ago
Qiao Longfei 681226e97c
Merge pull request #13864 from jacquesqiao/py-reader-add-test-mode
7 years ago
jerrywgz 90f39b1123 Merge branch 'roialign' of https://github.com/jerrywgz/Paddle into roialign
7 years ago
Xin Pan 288a112ffd
Revert "Revert "Revert "Make variable::GetMutable robust"""
7 years ago
sneaxiy 5cedfb60c8 test=develop
7 years ago
jerrywgz 5e52dafda5 add roi align
7 years ago
jerrywgz c0e34eebec add roi align
7 years ago
Xin Pan fededdda20
Merge pull request #13872 from panyx0718/fix2
7 years ago
Qiao Longfei 60030e8678 change the use of FLAGS_reader_queue_speed_test_mode
7 years ago
Qiyang Min cab29828a5
Merge pull request #13829 from velconia/accelerate_sequence_pool_op
7 years ago
Xin Pan ddb76d0d09 Make GetMutable more robust
7 years ago
minqiyang d9b202e717 Move tensor copy src_ptr and dst_ptr check to TensorCopy function
7 years ago
chengduo 8e2fdc54b1
Add check for opt op (#13840)
7 years ago
Qiyang Min e37c9e6732
Merge pull request #13828 from velconia/accelerate_selected_rows_functor
7 years ago
sneaxiy 84d9300365 test=develop
7 years ago
dzhwinter 8329a1f139 add sparse update momentum. test=develop
7 years ago
Qiao Longfei c61e16b181 add reader_queue_speed_test_mode_flag test
7 years ago
Qiao Longfei 8686f7c68e add reader_queue_speed_test_mode flag for speed test
7 years ago
chengduo 9c77b65c06
Fix layers.uniform_random (#13823)
7 years ago
Yibing Liu cbe4292516 Add sequence unpad op
7 years ago
tensor-tang bcb8ea397d Merge remote-tracking branch 'ups/develop' into fea/jitkernel_peephole
7 years ago
tensor-tang 8e182170ba refine and replace lstm peephole kernel
7 years ago
jerrywgz 3c963336e4 fix roi pool register
7 years ago
Dun 5f2e837847 optimize depthwise conv by register memory (#13778)
7 years ago
minqiyang 3f6ec90060 Polish code
7 years ago
tensor-tang 7ef2699e18 init peephole runtime kernel
7 years ago
Qiyang Min f99ea99e36
Merge pull request #13720 from velconia/fix_grad_clip
7 years ago
minqiyang f40848828d Polish code
7 years ago
minqiyang e2e82bde32 Accelerate Reshape op
7 years ago
minqiyang 0385b0a1ea Accelerate SequencePool Op on SUM mode
7 years ago
minqiyang 8ec748cfa0 Accelerate SelectedRows Functors:
7 years ago
whs 7e651c8641
Fix truncated norm (#13785)
7 years ago
minqiyang 1456b8ec7d Add unittest for clip_by_norm_op with SelectedRows
7 years ago
Tao Luo fd0dd07ab4
Merge pull request #13726 from jczaja/prv-fused_embedding_fc_lstm-ut
7 years ago
chengduo e1761709f8
Set the right shape of selected_rows (#13723)
7 years ago
tensor-tang 3ee8f2c6cf thread local jit kernels
7 years ago
tensor-tang 9131a35676 replace the lstm compute with jitkernel
7 years ago
tensor-tang b55c247678 add lstm compute unit test
7 years ago
qingqing01 6094a72308
Fix bug in reduce_op caused by PR #13534 (#13748)
7 years ago
dzhwinter a46e30aa6d
enhance isinf/isnan in tensor util, avoid copy back to cpu (#12688)
7 years ago
tensor-tang 2a00969165 optimize lstm jitkernel keq8
7 years ago
tensor-tang f2adaf1c3e add vrelu and lstm kernel
7 years ago
Jacek Czaja 9f15d8817e - Cleanup as suggessted by reviewers
7 years ago
Jacek Czaja ae8b4717cc - Cleaning fused_embedding_fc_lstm op
7 years ago
Jacek Czaja fd31b54cf1 - Removed disabled code
7 years ago
Jacek Czaja f9da2d6416 - Removed disabled diagnostic code
7 years ago
Jacek Czaja 809dbc5c17 - Added file for fused_embedded_fc_lstm_op unit test
7 years ago
tensor-tang e6d8aca3bf refine code and fix
7 years ago
qingqing01 41e4f7ea0a
Optimize Topk when height is large. (#13710)
7 years ago
minqiyang bcd8c2ccc3 Add unit test
7 years ago
tensor-tang ea7dc9cbf6 Merge remote-tracking branch 'ups/develop' into fea/jitkernel
7 years ago
tensor-tang 2513b2cc4e fix bug vtanh
7 years ago
chengduoZH e59ab42caa add nodes for drnn
7 years ago
minqiyang f20fc95539 Resize output ddims and rows
7 years ago
tensor-tang 5ef14dd386
Merge pull request #13715 from tensor-tang/fix/op
7 years ago
minqiyang 67308822f8 Add selected_rows merge for clip_by_norm op
7 years ago
Xin Pan c0dfd5eee8
Merge pull request #13712 from luotao1/doc_element
7 years ago
tensor-tang ea0b98e007 bugfix: fusion lstm and gru batch,seq mode switch
7 years ago
Tao Luo 69ed75e77c refine elementwise doc
7 years ago
Tao Luo 49cd43000c
Merge pull request #12981 from chenwhql/sequence_erase_op_infershape_note_polish
7 years ago
tensor-tang cf8c8e72bd add vtanh and unit test
7 years ago
Tao Luo 067b8582af
Merge pull request #13625 from wanghaoshuang/fix_warning_roi
7 years ago
tensor-tang b37fe30417
Merge pull request #13690 from wangguibao/fix_cpu_lstm_compute_cc
7 years ago
dzhwinter 26771f41ba
"fix compile error" (#13579)
7 years ago
Yu Yang 186b2b138d
Revert "Merge pull request #13201 from reyoung/revert_callstack" (#13697)
7 years ago
tensor-tang d10a9df7b8 add vaddbias and unit test
7 years ago
tensor-tang 3c8b651187 add vsigmoid avx implementations and unit test
7 years ago
sneaxiy 6f748a035d test=develop
7 years ago
tensor-tang 55e44761fb refine code and init vsigmoid
7 years ago
wangguibao 1940bc2d83 Avoid multiple definitions of lstm_compute_ctht when linking libpaddle_fluid.so
7 years ago
sneaxiy 584c3f048f fix sparse rmsprop
7 years ago
Xin Pan 425a882165
Merge pull request #13643 from panyx0718/ir2
7 years ago
Xin Pan 642905958a fix compile error
7 years ago
Xin Pan 33b68fdf25 fix compile error
7 years ago
tensor-tang ede4b230be
Merge pull request #13553 from jczaja/prv-fused_embedding_fc_lstm_op
7 years ago
Xin Pan 6746b1fdf3 add missing header
7 years ago
Dun 161c3e31f7 Optimization of Kernels that related to DeepLabv3+ (#13534)
7 years ago
Xin Pan 5fb72d840a add header
7 years ago
Xin Pan ddd60581b7 clean up channel
7 years ago
tensor-tang 2d0ff6a3c2 add vexp and unit test
7 years ago
tensor-tang b3c63f40fa add vscal and unit test
7 years ago
tensor-tang 0987f2b4d9 add vadd unit test
7 years ago
tensor-tang 3d928d4f9d refine and seepdup
7 years ago
tensor-tang 77fc42d2d1 Merge remote-tracking branch 'ups/develop' into fea/jitkernel
7 years ago
Yu Yang 593ad763cd refactor(op): polish generate_proposals_op
7 years ago
tensor-tang 2937314d8e refine vmul and test
7 years ago
tensor-tang 6c986e127a fix macro and add vmul unit test
7 years ago
chengduo 43a3af86be
refine sgd_op (#13626)
7 years ago
Xin Pan 2c01c2216a
Merge pull request #13531 from gongweibao/generator2
7 years ago
Yu Yang 0be1582df0
Merge pull request #13525 from reyoung/fix_mixed_vector
7 years ago
Jacek Czaja d5114c60b0 - Reviewers suggesstions to fused_embedding_fc_lstm_op
7 years ago
Jacek Czaja 7ab5626dee - Added initial pass for embedding-fc-lstm
7 years ago
qingqing01 fd4c4df93d
Cuda speed for generate_proposals_op. (#13596)
7 years ago
tensor-tang 8c69764d12 add vmul unit tests
7 years ago
tensor-tang 084893a9a9 add vadd kernel
7 years ago
wanghaoshuang 153d4f5d15 test=develop
7 years ago
wanghaoshuang 5d7395cd0f Fix warning of roi perspective transform op.
7 years ago
Wu Yi 10dffc68cd
Merge pull request #13618 from typhoonzero/revert_13530
7 years ago
tangwei12 97cf1eb6d7
Add distributed unit tests about text_classification/simnet-bow/ctr (#12812)
7 years ago
typhoonzero a4f7696a18 Revert "Some trivial optimization (#13530)"
7 years ago
tangwei12 85362e98dd
Batch AUC (#13567)
7 years ago
tensor-tang 6938e6cf06
Merge pull request #13603 from tensor-tang/refine/peephole
7 years ago
Zhaolong Xing 9b03d53543
Merge pull request #13469 from NHZlX/add_ut_for_trt
7 years ago
tensor-tang eeff268a6c clean and refine kernels
7 years ago
tensor-tang dee5d35c20 refine vmul
7 years ago
tensor-tang 209e9c3db1 refine peephole
7 years ago
chengduo 1d91a49d2f
Some trivial optimization (#13530)
7 years ago
ktlichkid 5093afced1 Fix bug in sequence_slice_op
7 years ago
tensor-tang 92031968d7 init vmul kernel
7 years ago
tensor-tang b9acbcc8c5 init lstm kernel
7 years ago
tensor-tang c260bf942d init jit kernel
7 years ago
nhzlx 5c57e15044 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_ut_for_trt
7 years ago
Tao Luo c07b2a97a9
Merge pull request #13521 from Sand3r-/mgallus/fix-pooling-ceiled-size
7 years ago
dzhwinter cc20867d49
flags (#13542)
7 years ago
Michal Gallus 0e6b303f54 MKLDNN Pooling: inline functions handling ceiled mode
7 years ago
Michal Gallus f465b03ef9 Enable MKLDNN in Analysis Predictor
7 years ago
gongweibao 1113337764 merge
7 years ago
Yu Yang 7119d6c3cf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_callstack
7 years ago
Jiabin Yang efc2ac950c
Merge pull request #13527 from JiabinYang/mac/fix_mac_compile
7 years ago
nhzlx baae7e4f63 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_ut_for_trt
7 years ago
nhzlx 2763321684 fix comments
7 years ago
sneaxiy 6724801884 fix conflict
7 years ago
chengduo cdf3a4c244
Fix concat_op InferShape (#13513)
7 years ago
gongweibao bee213e51f fix conflict
7 years ago
sneaxiy 3ad3635de0 fix conflict
7 years ago
sneaxiy 3ee0a6489d remove kwargs in python api
7 years ago
Zeng Jinle f855c05fff
Merge pull request #13520 from sneaxiy/enhance_eager_delete
7 years ago
gongweibao 01396d91db Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into generator2
7 years ago
nhzlx 0514882bc5 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_ut_for_trt
7 years ago
gongweibao dda9c3556e fix
7 years ago
JiabinYang c324cdeff8 fix roi_perspective_transform_op.cc unused variable caused error on macos
7 years ago
Yu Yang 3043f51b3a
Merge pull request #13511 from reyoung/fix_ce
7 years ago
gongweibao ff47841796 fix
7 years ago
Yu Yang f7af695801
Merge pull request #13505 from reyoung/fix_selected_rows_functor_test
7 years ago
Wu Yi 29c63d180f
[Feature] dist op role and lr op role, to support memory optimize with dist training (#13220)
7 years ago
Yu Yang e1913bc5d8 Fix MixedVector
7 years ago
whs fc44087d31
Add roi perspective transform op. (#13176)
7 years ago
sneaxiy 192c49cb07 modification
7 years ago
sneaxiy 0a36ef3c18 enhance eager deletion
7 years ago
Yu Yang 6d2c6f96f1 Revert "Revert "Merge pull request #13431 from chengduoZH/refine_lod""
7 years ago
Yu Yang a6c8d6b9a2 Revert "Merge pull request #13431 from chengduoZH/refine_lod"
7 years ago
Zeng Jinle 7f1e312677
Merge pull request #13456 from sneaxiy/refine_sparse_adam
7 years ago
Yu Yang b5996fa124 Fix unstable selected_rows_functor_test.cu
7 years ago