Commit Graph

11210 Commits (b58957d9d792b8ec85ad460a02ecc1f13575e7cd)

Author SHA1 Message Date
Tao Luo 7d680be5a3 Merge branch 'develop' into mkldnn_test
7 years ago
Wojciech Uss 55fd136ab0 Added comment with request for enhancement
7 years ago
gongweibao a831ecc75d
Add grpc error context. (#13957)
7 years ago
tensor-tang b139b687de Merge remote-tracking branch 'ups/develop' into fix/jit/exp
7 years ago
qingqing01 67a2b5215d
Add affine channel op to speed and save memory for faster-rcnn model. (#13919)
7 years ago
tensor-tang 748435586a clean code exp avx
7 years ago
tensor-tang b4751a34a5 fix illegal instruction of rnn2
7 years ago
Xin Pan 6de08b5eef set default timeout to avoiding blocking CI
7 years ago
tensor-tang 30dfbdee7f
Merge pull request #13951 from tensor-tang/fix/warning
7 years ago
Tao Luo 34ed7d1379
Merge pull request #13924 from luotao1/clean_inference_lib
7 years ago
tensor-tang 36588b3365 fix illegal instruction of rnn1 and text
7 years ago
Tao Luo 6a4e9230ed Merge branch 'develop' into mkldnn_test
7 years ago
gongweibao 078223b3e3
Add rpc timeline. (#13900)
7 years ago
dzhwinter 29382db625
Merge pull request #13874 from dzhwinter/fix/momentum
7 years ago
Xin Pan 6a54c3de1f
Merge pull request #13928 from panyx0718/doc
7 years ago
qingqing01 5dbb2e9986
Small changes for sum_op to avoid zero setting. (#13923)
7 years ago
Tao Luo b819684370 add compare_mkldnn test
7 years ago
Tao Luo e47f4186ae fix some compiler warning
7 years ago
Jiabin Yang 6553956bd6
Merge pull request #13931 from JiabinYang/fix_dist_on_mac
7 years ago
dzhwinter 00e8791f66 fix compile in cpu error. test=develop
7 years ago
tensor-tang e69328c3bc fix warning and mac compile
7 years ago
Tao Luo 6ea9d1b595 add analysis_predictor in vis_demo
7 years ago
Tao Luo f444a7226e Merge branch 'develop' into clean_inference_lib
7 years ago
Tao Luo 3598500773
Merge pull request #13867 from Superjomn/clean/CreatePaddlePredictor
7 years ago
Tao Luo 41eeb771e8 Merge branch 'develop' into clean_inference_lib
7 years ago
dzhwinter d239cf2e15 use binary search. test=develop
7 years ago
dzhwinter a9f5f822e6 use binary search. test=develop
7 years ago
Tao Luo b854d959a5 update with comments
7 years ago
Tao Luo 75bb0babef Merge branch 'develop' into mkldnn_test
7 years ago
tensor-tang 6447155dac
Merge pull request #13851 from tensor-tang/fea/jitkernel_peephole
7 years ago
jerrywgz 4c9884e713 refine unittest test=develop
7 years ago
JiabinYang 02f863400e test=develop
7 years ago
Xin Pan 7fb5b66ac2
Merge pull request #13916 from panyx0718/fix2
7 years ago
Yan Chunwei 6809238d97
fix analysis predictor profile (#13896)
7 years ago
Xin Pan abbfb60ca9 remove unused codes
7 years ago
Yibing Liu 6b795d424c
Merge pull request #13901 from kuke/seq_slice_py
7 years ago
dzhwinter 3861269594 merge develop branch
7 years ago
jerrywgz 98c3294b85 Merge branch 'roialign' of https://github.com/jerrywgz/Paddle into roialign
7 years ago
Tao Luo a35e7f4bae adjust demo_ci with fluid_inference_install_dir
7 years ago
tangwei12 fa2ab3346c
fill constant add infervarshape, lookuptable clone lr var (#13830)
7 years ago
jerrywgz 8c79071d6a roi_align for gpu
7 years ago
Xin Pan 342e436158 Make Var::GetMutable robust
7 years ago
Yan Chunwei 7a751b83ac fix isfinite_op sprintf (#13850)
7 years ago
Qiyang Min e3a64fca44
Merge pull request #13835 from velconia/fix_reshape_op
7 years ago
Yibing Liu 46b0b7903c
Merge pull request #13856 from kuke/seq_unpad_op
7 years ago
tensor-tang dcfb687584
Merge pull request #13846 from tensor-tang/fix/mkldnn
7 years ago
Tao Luo bd77460182 refine mkldnn test in analyzer_tests
7 years ago
jerrywgz c9d2046f76 roi_align for gpu
7 years ago
jerrywgz 2f5a80174e add roi_align api
7 years ago
Zeng Jinle af91d41ab8
Merge pull request #13852 from sneaxiy/feature/eager_delete_tensor
7 years ago
Zeng Jinle 93606c2c2c
Merge pull request #13689 from sneaxiy/sparse_rmsprop
7 years ago
Qiao Longfei 681226e97c
Merge pull request #13864 from jacquesqiao/py-reader-add-test-mode
7 years ago
jerrywgz 90f39b1123 Merge branch 'roialign' of https://github.com/jerrywgz/Paddle into roialign
7 years ago
Tao Luo f8874b3cb2
Merge pull request #13884 from luotao1/rename_inference_lib_dist
7 years ago
Xin Pan 288a112ffd
Revert "Revert "Revert "Make variable::GetMutable robust"""
7 years ago
sneaxiy 5cedfb60c8 test=develop
7 years ago
Yibing Liu b785798585 Expose layer's name for sequence pad & unpad
7 years ago
Yibing Liu 18e1c1e07d Update API spec for seq slice
7 years ago
jerrywgz 5e52dafda5 add roi align
7 years ago
jerrywgz c0e34eebec add roi align
7 years ago
Tao Luo c26f2b21eb
Merge pull request #13813 from sfraczek/sfraczek/conv-bn-fuse-pass-full-eigen
7 years ago
Yibing Liu 16b2c6dc78 Add py api for sequence_slice_op
7 years ago
superjomn 1cfd2b51a7 update
7 years ago
Xin Pan fededdda20
Merge pull request #13872 from panyx0718/fix2
7 years ago
Qiao Longfei b16e9cd105
a small fix for compile WITH_INFERENCE=OFF (#13869)
7 years ago
Qiao Longfei ec25a09bd5 revert unused change test=develop
7 years ago
Qiao Longfei 60030e8678 change the use of FLAGS_reader_queue_speed_test_mode
7 years ago
Tao Luo 323d67cfc1
Merge pull request #13879 from panyx0718/doc
7 years ago
Sylwester Fraczek 50c5e9b0c6 reshape_2d used from ddim.h
7 years ago
Qiyang Min cab29828a5
Merge pull request #13829 from velconia/accelerate_sequence_pool_op
7 years ago
minqiyang aeec82acd5 Add unittest for reshape op
7 years ago
Xin Pan ddb76d0d09 Make GetMutable more robust
7 years ago
Qiyang Min c2842377ce
Merge pull request #13837 from velconia/add_pyramid_dnn_support
7 years ago
minqiyang 24c9fbdba3 Polish code
7 years ago
chengduo 2c9839c847
add cuda version display (#13885)
7 years ago
sneaxiy d3ed070e10 test=develop
7 years ago
minqiyang d9b202e717 Move tensor copy src_ptr and dst_ptr check to TensorCopy function
7 years ago
sneaxiy fb6201e93e test=develop
7 years ago
chengduo 8e2fdc54b1
Add check for opt op (#13840)
7 years ago
Yibing Liu 46e61d81a7 Wrapper py api for sequence_unpad
7 years ago
superjomn 28459592cc update
7 years ago
Qiyang Min e37c9e6732
Merge pull request #13828 from velconia/accelerate_selected_rows_functor
7 years ago
Tao Luo 3d976f3f18 rename inference_lib_dist to fluid_lib_dist
7 years ago
sneaxiy 84d9300365 test=develop
7 years ago
chengduo 2562eb92b8
Add strategy doc (#13849)
7 years ago
Xin Pan b11372a0af fix doc
7 years ago
Zhaolong Xing 7413fa458f
Merge pull request #13838 from NHZlX/add_trt_pad_op
7 years ago
superjomn e2bd40ca82 update
7 years ago
dzhwinter 8329a1f139 add sparse update momentum. test=develop
7 years ago
superjomn 049fcbe125 update
7 years ago
superjomn f5c0221c17 clean CreatePaddlePredictor
7 years ago
Xin Pan d852be7c48
Revert "Make variable::GetMutable robust"
7 years ago
Qiao Longfei c61e16b181 add reader_queue_speed_test_mode_flag test
7 years ago
Qiao Longfei 8686f7c68e add reader_queue_speed_test_mode flag for speed test
7 years ago
Xin Pan ae8b1c32bf
Merge pull request #13821 from panyx0718/fix
7 years ago
Xin Pan 9b3e0df375
Merge pull request #13819 from panyx0718/doc
7 years ago
Zhaolong Xing 44f37d019a
Merge pull request #13848 from NHZlX/fix_demo_ci_with_trt
7 years ago
Sylwester Fraczek 55d6950a1a rewrite conv_bn fuse pass to eigen
7 years ago
chengduo 9c77b65c06
Fix layers.uniform_random (#13823)
7 years ago
nhzlx 320c78e16f fix commets
7 years ago
Yibing Liu cbe4292516 Add sequence unpad op
7 years ago
tensor-tang bcb8ea397d Merge remote-tracking branch 'ups/develop' into fea/jitkernel_peephole
7 years ago
tensor-tang 8e182170ba refine and replace lstm peephole kernel
7 years ago
nhzlx efa5bac7ad fix demo_ci bug in vis_demo.cc
7 years ago
tensor-tang dc5a7b906d fix default number of threads when inference with or without MKLDNN
7 years ago
Xin Pan 228506618b Avoid GetMutable implicitly reset Var Type.
7 years ago
jerrywgz 3c963336e4 fix roi pool register
7 years ago
Dun 5f2e837847 optimize depthwise conv by register memory (#13778)
7 years ago
minqiyang 3f6ec90060 Polish code
7 years ago
minqiyang 9878eedbaa Change API.spec
7 years ago
Qiao Longfei 5428cb9908
Profiler support merge data of all thread (#13811)
7 years ago
nhzlx bf7a2789a0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_trt_pad_op
7 years ago
nhzlx 0cb88c34be add op converter
7 years ago
tensor-tang 7ef2699e18 init peephole runtime kernel
7 years ago
Qiyang Min f99ea99e36
Merge pull request #13720 from velconia/fix_grad_clip
7 years ago
minqiyang f40848828d Polish code
7 years ago
minqiyang e2e82bde32 Accelerate Reshape op
7 years ago
minqiyang 0385b0a1ea Accelerate SequencePool Op on SUM mode
7 years ago
minqiyang 8ec748cfa0 Accelerate SelectedRows Functors:
7 years ago
Xin Pan 63b2e98f3d Explain LoD and a few other concepts
7 years ago
Tao Luo 9b11a17502
Revert "[MKLDNN] Pass: Fuse Conv + Bias"
7 years ago
Tao Luo ce248a15d9
Merge pull request #13368 from Sand3r-/mgallus/conv-bias-pass
7 years ago
whs 7e651c8641
Fix truncated norm (#13785)
7 years ago
Tao Luo 16b1beb244
Merge pull request #13486 from sfraczek/sfraczek/conv-bn-fuse-pass
7 years ago
Zhaolong Xing 5d5587fff7
Merge pull request #13792 from NHZlX/trt_dy_lib
7 years ago
Michal Gallus 40b17be4b0 Pass: Fuse Conv + Bias
7 years ago
minqiyang 1456b8ec7d Add unittest for clip_by_norm_op with SelectedRows
7 years ago
Tao Luo fd0dd07ab4
Merge pull request #13726 from jczaja/prv-fused_embedding_fc_lstm-ut
7 years ago
Sylwester Fraczek 3fcca40909 eigen sqrt fix and change 1e-5 to epsilon
7 years ago
Qiao Longfei 5fc305220c
Merge pull request #13787 from PaddlePaddle/revert-13637-optimize-opyreader
7 years ago
nhzlx 9445502f90 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_dy_lib
7 years ago
chengduo e1904ac2c8
Add doc (#13765)
7 years ago
nhzlx d347ea689a fix comments
7 years ago
chengduo e1761709f8
Set the right shape of selected_rows (#13723)
7 years ago
tensor-tang 3ee8f2c6cf thread local jit kernels
7 years ago
tensor-tang 9131a35676 replace the lstm compute with jitkernel
7 years ago
Qiao Longfei 9d087d5139 Revert "optimize pyreader"
7 years ago
tensor-tang b55c247678 add lstm compute unit test
7 years ago
nhzlx f3af90d121 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_dy_lib
7 years ago
nhzlx f569095084 add tensorrt api lib to paddle_fluid
7 years ago
qingqing01 6094a72308
Fix bug in reduce_op caused by PR #13534 (#13748)
7 years ago
Tao Luo 59c306eb52
Merge pull request #13776 from luotao1/revert_fast_math
7 years ago
Tao Luo 84a55155ec revert with_fast_math to ON
7 years ago
Wu Yi cc36bab184
fix manylinux multi arch docker build test=develop (#13770)
7 years ago
Qiao Longfei b1d5135ffb
Merge pull request #13637 from jacquesqiao/optimize-opyreader
7 years ago
dzhwinter a46e30aa6d
enhance isinf/isnan in tensor util, avoid copy back to cpu (#12688)
7 years ago
tensor-tang 2a00969165 optimize lstm jitkernel keq8
7 years ago
tensor-tang f2adaf1c3e add vrelu and lstm kernel
7 years ago
Xin Pan 943e4deb23
Merge pull request #13750 from panyx0718/fix
7 years ago
Jacek Czaja 9f15d8817e - Cleanup as suggessted by reviewers
7 years ago