Commit Graph

6088 Commits (07dc5a1506b4c349b7771f7bec342c11ae0477b1)

Author SHA1 Message Date
Tao Luo dc0c221426
Merge pull request #14803 from mozga-intel/mozga-intel/mean_operator_ngraph
6 years ago
Xin Pan b629133375 checkpoint runnable PyLayer
6 years ago
peizhilin a6f5ceee74 add the python callstack for debug support test=develop
6 years ago
Zeng Jinle dacfaaa966 Revert "Remove op handle lock"
6 years ago
Tao Luo 6ca9a4810b
Merge pull request #15196 from luotao1/serial
6 years ago
Xin Pan c4b09a713f polish
6 years ago
minqiyang b76695418a Polish log
6 years ago
minqiyang 1bfbc0d963 Polish code
6 years ago
minqiyang 7f45b9511a Polish code
6 years ago
minqiyang 68a07328fa Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_pyramid_dnn_support
6 years ago
Qiyang Min 317840d3ba
Merge pull request #14277 from velconia/add_fused_emb_seq_pool_op
6 years ago
tensor-tang 2dd331cc21 Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat
6 years ago
tensor-tang 316636404f add seqpool concat unit test
6 years ago
Yan Chunwei 6ccf8685f7
refactor tensorrt node teller (#15181)
6 years ago
Tao Luo 7dc0181c46 run analyzer_tester serial in multi-thread
6 years ago
xiaolil1 c8f101e5da Conv int8 relu (#15130)
6 years ago
sneaxiy 9793a0b6a6 fix_cudnn_compatible_check
6 years ago
Zeng Jinle ccb322d6a5 merge develop
6 years ago
Xin Pan 0d0bc61248 update api
6 years ago
tensor-tang 7923d7271f add fusion seqpool concat op
6 years ago
Zeng Jinle f3a13512fc
Merge pull request #15139 from sneaxiy/remove_op_handle_lock
6 years ago
Qiao Longfei 44b300556d change min_row_size_to_use_multithread to parameter of adam
6 years ago
Qiao Longfei 87b4eb1da4 change min_param_size_to_use_multithread to min_row_size_to_use_multithread
6 years ago
minqiyang 0f94c1ac14 Polish code
6 years ago
minqiyang 00e4de04bf Polish code
6 years ago
minqiyang 4bfa110fd8 Add no lock optimize pass
6 years ago
chengduo eabb2105fa
Refactor MultiDevSSAGraphBuilder (#15090)
6 years ago
Yan Chunwei 875a07c32d
refactor inference analysis api (#14634)
6 years ago
minqiyang c09a379015 remove const_cast
6 years ago
tensor-tang 102d93712e Merge remote-tracking branch 'ups/develop' into jit/seqpool
6 years ago
tensor-tang 123b98f417 refine heigth and codesize and support all pool
6 years ago
tensor-tang 0145f40f45 use height from params of jitcode
6 years ago
tensor-tang e0591deebc enhance seqpool jitcode
6 years ago
Zeng Jinle 99e6e8b00f
Merge pull request #15179 from sneaxiy/fix_crf_grad_lod
6 years ago
minqiyang db8eb9b688 Polish code
6 years ago
minqiyang f4c990e7b8 Add fused embedding ops
6 years ago
minqiyang 39b98709b1 Move fused ops to fused dir
6 years ago
minqiyang 920d4a8b78 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_fused_emb_seq_pool_op
6 years ago
Tao Luo 5ee596cae5
Merge pull request #15175 from baojun-nervana/intel/mkldnn
6 years ago
乔龙飞 Qiao Longfei 7c891e1ecc
Merge pull request #15111 from jacquesqiao/fix-adam-tmp-var
6 years ago
mozga-intel e77956c920 Enable mean operator for a ngraph
6 years ago
mozga-intel dd768714ab Enable scale operator for a ngraph
6 years ago
sneaxiy be425461a1 fix crf grad lod share
6 years ago
Qiao Longfei 3e1b914fcb update gru op forward kernel
6 years ago
Qiao Longfei 7a81ab8607 complete gru_unite_op and test
6 years ago
Qiao Longfei 72618c8da5 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gru-add-mode
6 years ago
Qiao Longfei 17b1b660fc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
6 years ago
Qiao Longfei c15270c5b2 optimize multi thread adam
6 years ago
乔龙飞 Qiao Longfei e1679b8847
Merge pull request #14893 from JiabinYang/feature/add_prefech_hs
6 years ago
baojun-nervana f0cde74564 Update ngraph with elt-wise relu test=develop
6 years ago
tensor-tang 92201d3956 support avg and sqrt pool and add mkl impl
6 years ago
tensor-tang c50060bb26 add jitcode impl and use it
6 years ago
tensor-tang 142bb41748 add seqpool jitkernel test and benchmark
6 years ago
tensor-tang e58a569c6c use seqpool jitkernel
6 years ago
tensor-tang 3e01a4048f add refer seqpool jitkernel
6 years ago
Qiao Longfei 4ecb9c93f0 update API.spec
6 years ago
xiaolil1 bbc9336878 Enable basic MKL-DNN INT8 Conv OP (#15124)
6 years ago
Xin Pan 8ae9094e07 polish and resolve conflicts
6 years ago
Xin Pan 5e928e579a try unify Executor and ParallelExecutor
6 years ago
Qiao Longfei e10af895de update gru grad op
6 years ago
Qiao Longfei 78ec7c0f99 gru add origin mode
6 years ago
peizhilin c919b2f31d Merge remote-tracking branch 'upstream/develop' into windows/fixgpuissue
6 years ago
peizhilin fd4f4d0e5f fix build issue test=develop
6 years ago
Yan Xu a1e60ab19b
Merge pull request #14791 from Yancey1989/parallel_graph_mode
6 years ago
peizhilin 9ae50dd07d fix gpu buils issue on windows test=develop
6 years ago
Qiao Longfei 0e747e8d02 change the limit of thead num
6 years ago
qingqing01 c981bf0f9d
Fix compling error with cuDNN v5 (#15148)
6 years ago
Yancey1989 4ad9de74dd disable sync nccl by default test=develop
6 years ago
Yancey1989 449bf58ea6 disable parallelgraph mode by default test=develop
6 years ago
Yancey1989 db603398b7 disable parallel graph executor by default
6 years ago
wopeizl 67093da398
Merge pull request #15122 from wopeizl/windows/fixhuberloss
6 years ago
sneaxiy d0a8a1e950 remove_op_handle_lock
6 years ago
Xin Pan 087af6a686
Merge pull request #15131 from panyx0718/clean
6 years ago
Yancey1989 e65436103f Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Yancey1989 94c80347b6 update by comment
6 years ago
sneaxiy 6f06e6cdac Merge remote origin
6 years ago
Qiyang Min 23761beaef
Merge pull request #14971 from velconia/imperative_mnist
6 years ago
xiaolil1 8eb1f26211 Enable INT8 pool OP (#15046)
6 years ago
Wu Yi 227e0c4518
fix nccl2 mode startup test=develop (#15132)
6 years ago
Xin Pan 9186451f60 hide GetTensor
6 years ago
wopeizl 7305fc2ff9
Merge pull request #15112 from wopeizl/windows/fixsaveandloadops
6 years ago
peizhilin dba009dbbf fix script issue
6 years ago
peizhilin cd2d60b4c8 fix build issue for density prior box op on windows test=develop
6 years ago
Yancey1989 35cda13e9f fix unittest test=develop
6 years ago
peizhilin 1f423f84ac fix the huber loss compile issue on windows test=develop
6 years ago
sneaxiy d25395fc98 remove tensor core lock
6 years ago
tensor-tang 516fe301ee add comment in case of empty name
6 years ago
peizhilin b3688100ad fix unittest
6 years ago
minqiyang 2547f9d1b8 Polish code
6 years ago
tensor-tang b9c645639b workaround with third party cache
6 years ago
peizhilin 5d8f281397 restore the memory mode
6 years ago
tensor-tang c02165d23a Merge remote-tracking branch 'ups/develop' into refine/seqpool
6 years ago
tensor-tang dca68cdf97 throw error when name not find
6 years ago
peizhilin 33b7821a75 fix save and load ops on windows test=develop
6 years ago
Qiao Longfei dfe85fb358 fix build
6 years ago
Qiao Longfei f057bbd1d1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-adam-tmp-var
6 years ago
Qiao Longfei f1c973b014 adam op should not create tmp var in compute
6 years ago
Yancey1989 82b42e31f0 polish unittest test=develop
6 years ago
wopeizl 10bedbdeaa
Merge pull request #15105 from wopeizl/windows/fixtimer
6 years ago
minqiyang 09e2e66236 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
6 years ago
Yancey1989 0a885ac12a Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Yancey1989 ca8c77d966 selecte execution according to strategy test=develop
6 years ago
tensor-tang 484085693e update url and num_ops
6 years ago
tensor-tang cd94df8679 fix load and refine
6 years ago
tensor-tang 8e271896ae add test data for seqpool1
6 years ago
minqiyang 858e903231 Add unittest for operator
6 years ago
gongweibao ce70229ba6
Add max_body_size flags to brpc (#15084)
6 years ago
qingqing01 6f0a1d7b47
Inception fusion operator. (#14968)
6 years ago
peizhilin 813c2ce539 fix timer test=develop
6 years ago
Qiao Longfei 25d44d40ac sum op support empty selected rows as input
6 years ago
wopeizl 7ab501264d
Merge pull request #15069 from wopeizl/windows/dsosupport
6 years ago
minqiyang 6a5f604607 Support stop_gradients var in imperative backward
6 years ago
guru4elephant ff739449ab
Merge pull request #15018 from guru4elephant/add_timer
6 years ago
Qiyang Min e29cbfe4f7
Merge pull request #14829 from velconia/accelerate_ddpg
6 years ago
Tao Luo 9c2cbfb89e
Merge pull request #15093 from baojun-nervana/intel/cmake
6 years ago
Zeng Jinle 25b49a0896
Merge pull request #14933 from sneaxiy/rewrite_ddim
6 years ago
Wu Yi a8bc05b5ff
Refactor distributed RPC (#15075)
6 years ago
baojun-nervana 555fbc10d8 upgrade ngraph to v0.10.1 test=develop
6 years ago
baojun-nervana c714c36482 simplify logic test=develop
6 years ago
minqiyang 9e3155e01d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
6 years ago
minqiyang 6bb84490af Fix imperative unit test
6 years ago
Xin Pan 3e8408429d
Merge pull request #15053 from panyx0718/imperative_hold
6 years ago
sneaxiy 73896eeb94 merge develop
6 years ago
Wu Yi e26cced7cc
refine batch merge pass (#14777)
6 years ago
minqiyang 336160e651 Complete imperative optimizer implementation
6 years ago
Yancey1989 4743c9cd5d Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
sneaxiy 9a3a246cb5 fix py35 compile error
6 years ago
Xin Pan f7294f8b25 register float16
6 years ago
Zhaolong Xing 4048cfa9da
Merge pull request #15048 from NHZlX/add_affine_channel_fuse
6 years ago
minqiyang ef7d563db9 Add changes back
6 years ago
minqiyang a318a490ab Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
6 years ago
Zeng Jinle c0bcff00dc
Merge pull request #14962 from sneaxiy/rewrite_variable_type
6 years ago
chengduo fe8495a758
[WIP] Refine MultiDevSSAGraph (#15040)
6 years ago
Qiao Longfei d161215332 optimize adam multi thread
6 years ago
dongdaxiang 82335cd88c Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
6 years ago
Tao Luo 85471533e0
Merge pull request #15079 from luotao1/analysis_test
6 years ago
minqiyang d4b9928c5a Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
6 years ago
minqiyang 5822f7f1d8 Polish code
6 years ago
wopeizl 719ebe3786
Merge pull request #15070 from wopeizl/windows/testcasefix
6 years ago
Qiao Longfei 7a58ad5c79 lazy mode have higher priority then multithread
6 years ago
Xin Pan c132c79011 address comments and resolve conflicts.
6 years ago
Xin Pan b91a7a9d30 clear operator changes
6 years ago
Xin Pan f52b514dcd call kernel
6 years ago
Xin Pan 4e80e04f23 fix
6 years ago
Xin Pan 7b6bf9ddf2 make fill_constant kernel-based
6 years ago
Xin Pan 61491ce250 clean
6 years ago
Xin Pan ce7e503cbe refactor to avoid scope.
6 years ago
Qiyang Min 0238a3bb4f
Merge pull request #14972 from velconia/accelerate_lstm
6 years ago
Houjiang Chen 242d3c71a6
Merge pull request #15031 from hjchen2/develop
6 years ago
Qiao Longfei d0572bf02e add log for lazy mode test=develop
6 years ago
Xin Pan 71a4a8e981
Merge pull request #15071 from wopeizl/revert/15035
6 years ago
minqiyang 68e9b841ab Add support for optimizer
6 years ago
Qiao Longfei 1177b0bc84 update multi thread adam
6 years ago
Qiao Longfei 3b294e2e2e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
6 years ago
Zeng Jinle 988bc2b5a7
Merge pull request #15060 from dzhwinter/fix/nccl
6 years ago
sneaxiy c4ce2e7b21 merge develop, solve conflict
6 years ago
minqiyang 8ed0233924 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
6 years ago
Zeng Jinle 9c6a0203e2
Merge pull request #15073 from sneaxiy/add_scope_pool
6 years ago
tensor-tang 656c672cdd
Merge pull request #15051 from tensor-tang/test/seq_pool1
6 years ago
Tao Luo ecae157edf simplify some data record in analyzer_tester
6 years ago
sneaxiy b56aca82e9 merge develop
6 years ago
Tao Luo 05f1b65da3 simplify prepere_input in analyzer_test
6 years ago
sneaxiy ee83ce75bf try to fix py35 compile error
6 years ago
sneaxiy 10a6bc9675 modify API.spec
6 years ago
sneaxiy 3e917a934a add scope_pool
6 years ago
nhzlx 02e17396c2 fix comments
6 years ago
jerrywgz ef2d292bfc
Merge pull request #14956 from jerrywgz/fix_bug_in_ifelse
6 years ago
Yancey1989 1a4f79a7de fix unittest test=develop
6 years ago
peizhilin e49276e731 restore the huber_loss_op
6 years ago
Yancey1989 86bb583881 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Yancey1989 495e73d766 enable gc
6 years ago
Yancey1989 28cdfbc2b0 delete comment code
6 years ago
Yancey1989 845bfd5807 cleanup code
6 years ago
peizhilin 2388d0e7d6 Revert "cherry-pick the #12759"
6 years ago
nhzlx 71636e677d add min_subgraph_size attr to tensorrt config
6 years ago
peizhilin 01c00b07dd fix test issues on windows
6 years ago
peizhilin 1e7f83e60a add cuda dso support for windows
6 years ago
tangwei12 dc8eca826e
code style fix, test=develop (#15045)
6 years ago
Yancey1989 41a64f6a2a Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
nhzlx a6aa8ea771 faster rcnn input is presistable. (fix it in paddle-trt)
6 years ago
hjchen2 956cf92145 Fix conv_elementwise_add2_act pass
6 years ago
Tao Luo 69659f4ae2
Merge pull request #15037 from jianhang-liu/fix/abnormal_stack_op_time
6 years ago
whs 2314f2ebb3
Make topk op support variable k. (#15044)
6 years ago
sneaxiy 179acc60b3 fix conflict with develop
6 years ago
wopeizl 09bd8fa67a
Merge pull request #15035 from wopeizl/debug/improvement1
6 years ago
sneaxiy dde3afe7b7 Merge develop
6 years ago
dzhwinter 3ea2f415dc fix ci error. test=develop
6 years ago
dongdaxiang 2df1d80767 Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
6 years ago
Wu Yi 856f0da0fe
Fp16 training (#14992)
6 years ago
Brian Liu e821b12f57 Fix issue which cause abnormal CPU usage in stack op
6 years ago
chengduo b9fb03cf54
Move GetTensor to tensor_util (#15011)
6 years ago
Yihua Xu 0b0acfaa88 Add mkldnn item for porfile and compare usage.
6 years ago
Yihua Xu dbb90a76f0 Merge remote-tracking branch 'paddle/develop' into develop_641313ea7_elementwise_mul_mkldnn_bug_fix
6 years ago
tensor-tang d46a140dd9 add seq pool inference test
6 years ago
tensor-tang d4931a2abc support more input fake data
6 years ago
nhzlx 73b47df1f4 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_affine_channel_fuse
6 years ago
nhzlx ce3782c193 add affine_channel fuse.
6 years ago
peizhilin 170e78b397 restore the top-k
6 years ago
dongdaxiang ab2abfc5b2 Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
6 years ago
Tao Luo bc16bcda49
Merge pull request #14998 from luotao1/mm_dnn
6 years ago
Qiyang Min aba1f9b06e
Merge pull request #14891 from velconia/accelerate_adam
6 years ago
minqiyang 8ec3d863b0 Fix throw_on_error direct call bug
6 years ago
peizhilin e05fb128bc fix code style
6 years ago
peizhilin 7f6d8acecb cherry-pick the #12759
6 years ago
sneaxiy 3a2afbf02e polish code
6 years ago
dongdaxiang 4cb833d2de Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
6 years ago
tensor-tang 05d1121b22
Merge pull request #14802 from mozga-intel/mozga-intel/fill_constant_operator_ngraph
6 years ago
tensor-tang 9d4f1d468a
Merge pull request #14804 from mozga-intel/mozga-intel/top_k_operator_ngraph
6 years ago
tensor-tang 8a6ac4dba7
Merge pull request #14973 from xiaolil1/dequantize
6 years ago
tensor-tang f0e02a65ed
Merge pull request #14974 from xiaolil1/quantize
6 years ago
Tao Luo 91408e3122 fix analyzer_mm_dnn_tester fails when bs > 1
6 years ago
Tao Luo f01c966800 Merge branch 'develop' into mm_dnn
6 years ago
sneaxiy 68d91cd594 add copy ctor
6 years ago
dongdaxiang 68a2d1f3d7 Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
6 years ago
dongdaxiang 2e5ebc4594 Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
6 years ago
dongdaxiang 5dfd9c9aa9 Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
6 years ago
dongdaxiang d0a5159946 Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
6 years ago
dongdaxiang f9b8168508 Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
6 years ago
dongdaxiang 3b3cb4ea55 Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
6 years ago
minqiyang 52b4821a6e Fix Sprintf problem
6 years ago
qingqing01 51a9fca323
Async memory copy (#15013)
6 years ago
minqiyang 010f657b33 Polish code
6 years ago
JiabinYang 1a8cbb6799 test=develop, accelerate_hs_op and add prefetch with is_sparse
6 years ago
sneaxiy e02f67eff7 rewrite unsafe_cast
6 years ago
minqiyang 45acfbd011 1. Add specific condition for one or no arg in PADDLE_ENFORCE
6 years ago
minqiyang 68b86d6665 Change default value to align with the original react
6 years ago
whs 938705745e
Init paddle slim (#14834)
6 years ago
dongdaxiang 2dee8f6cd5 add TrainFilesWithTimer in async_executor
6 years ago
xiaoli.liu@intel.com 869d444b92 Fix comments misunderstanding
6 years ago
xiaoli.liu@intel.com d83d0f33fd extract templated function
6 years ago
dongdaxiang d434fcbaa6 add TrainFilesWithTimer in async_executor
6 years ago
Yihua Xu d4606bcb22 Fix the exception when tensor format is x
6 years ago
minqiyang 250e893745 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
6 years ago
minqiyang 8b6b0da062 Use adam_update
6 years ago
minqiyang f4e7a47381 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_adam
6 years ago
minqiyang b1d0a14c14 Change the ut back
6 years ago
minqiyang 7d1533216d Fix syntax error in unit test
6 years ago
tensor-tang 641313ea77
Merge pull request #15000 from tensor-tang/doc/en/jit
6 years ago
minqiyang e811e06555 Avoid comma in macro
6 years ago
minqiyang 0cf1461ccc Avoid comma in macro
6 years ago
wopeizl b117a5f208
Merge pull request #14931 from wopeizl/windows/mkl
6 years ago
Xin Pan 103f08f50e
Merge pull request #14910 from panyx0718/clean3
6 years ago
dongdaxiang cf6188a823 add a linux timer
6 years ago
Zeng Jinle 0021b05b19
Merge pull request #14993 from sneaxiy/fix_check_lod
6 years ago
tensor-tang 68ab16444a add eng doc of jit kernel and follow comments
6 years ago
chengduo 79bd6dfa18
[Feature] Add Temporary Allocator (#14875)
6 years ago
minqiyang e4719eb462 Fix bug in Windows VC 2010
6 years ago
sneaxiy a30c5373eb use std::is_sorted
6 years ago
minqiyang 5a5c577529 Polish code
6 years ago
minqiyang 099186cd41 Support one argument PADDLE_ENFORCE
6 years ago