Commit Graph

6272 Commits (3d0ecab41bc62585d52816251098a78b5c65d217)

Author SHA1 Message Date
tensor-tang 3759c1db8c
Merge pull request #14805 from mozga-intel/mozga-intel/element_wise_operator_ngraph
7 years ago
tensor-tang 904a39239d
Merge pull request #15254 from mozga-intel/mozga-intel/softmax_operator_ngraph
7 years ago
nhzlx e61a1b9514 merge develop test=develop
7 years ago
peizhilin cd562f8fb7 disable the parallel mode for adam op on windows test=develop
7 years ago
nhzlx b2ba3471fd fix analysis config bug.
7 years ago
Xin Pan 01dc15ce32
Merge pull request #15329 from panyx0718/imperative2
7 years ago
Xin Pan 16cb3ebd68
Merge pull request #15268 from xiaolil1/pool-int8
7 years ago
Xin Pan 9a4314f025 imperative gan
7 years ago
tensor-tang a7fc3d42a0
Merge pull request #15304 from tensor-tang/fuse/second_order_mul_sub
7 years ago
bingyanghuang a152a5c731 Disable conv3d mkldnn in dam (#15335)
7 years ago
Xin Pan 73093656b8
Merge pull request #15331 from panyx0718/api
7 years ago
Xin Pan 2db6e3ed2a
Merge pull request #15292 from panyx0718/imperative
7 years ago
乔龙飞 Qiao Longfei b14d4cdd75
Merge pull request #14890 from jacquesqiao/multithread-sparse-adam
7 years ago
Xin Pan 6b762f6519 add doc
7 years ago
Xin Pan d7b159355c add more doc
7 years ago
mozga-intel cba729404d Enable softmax operator for a ngraph engine
7 years ago
Qiao Longfei cd31b90a46 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
7 years ago
wopeizl 0fbb76f66b
Merge pull request #15204 from wopeizl/debug/support
7 years ago
Xin Pan 24bb6a6aec expose CompiledProgram
7 years ago
Xin Pan 783dbe9abb more doc
7 years ago
Xin Pan f997109bb1 polish
7 years ago
Xin Pan c1fdacd4b4 add imperative mode design
7 years ago
Qiao Longfei 8c516a24e5 remote min_row_size_to_use_multithread in adam interface test=develop
7 years ago
Tao Luo 9497d43921
Merge pull request #15307 from luotao1/trace_deps
7 years ago
tensor-tang 1a95cd227d disable seqpool test on mac or without mkl
7 years ago
Qiao Longfei 9b4fe283e1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
7 years ago
tensor-tang 0b6447a482
Merge pull request #15310 from luotao1/ZeroCopy_omp
7 years ago
peizhilin 5e450833bd test=develop
7 years ago
Qiyang Min 3f687765e6
Merge pull request #15281 from velconia/fix_expand_op_compile_time
7 years ago
peizhilin eea75a1d93 fix issue when type is invalid
7 years ago
peizhilin 9adb158e5b Merge remote-tracking branch 'upstream/develop' into debug/support
7 years ago
minqiyang 29ceb93126 Use malloc and free in JeMalloc
7 years ago
Tao Luo 2411ed4286 fix multi-threads in ZeroCopyProfile
7 years ago
minqiyang c4cf5967db Change backward op infershape
7 years ago
tensor-tang 84b0ecdcce Merge remote-tracking branch 'ups/develop' into fuse/second_order_mul_sub
7 years ago
tensor-tang 7035f051a8 adjust acc on mac
7 years ago
luotao1 346561a37f fix imperative compile when WITH_PYTHON=OFF
7 years ago
Xin Pan b29eca3b71 code style
7 years ago
Xin Pan 7bc67c31e5 polish more
7 years ago
Xin Pan 0c04cac484 polish
7 years ago
Xin Pan 47ef2df01a polish
7 years ago
Xin Pan 0d5819eb4f polish imperative codes
7 years ago
Tao Luo e33427da0d
Merge pull request #15280 from luotao1/random_test
7 years ago
chengduo 46d01d798e
Revert "Revert "Remove workspace_handle in conv_cudnn (#15186)"" (#15290)
7 years ago
Qiao Longfei 4d15515c40 fix gru_gpu_kernel test=develop
7 years ago
tensor-tang 93e75c5ae5 refine jitcode of vsub and vsquare
7 years ago
tensor-tang d618e48309 fix fuse square mat order and refine test
7 years ago
tensor-tang a5d2a6d1ad add fuse pass of sequared mat sub fusion
7 years ago
tensor-tang 531f4a1578 Merge branch 'fuse/repeatedfcrelu' into fuse/second_order_mul_sub
7 years ago
tensor-tang 84e023eae5 adjust the acc since the refer result is too large
7 years ago
Qiao Longfei 4feae25378 fix build problem test=develop
7 years ago
tensor-tang 38de1ff472 add fusion squared mat sub op
7 years ago
Qiao Longfei e641ffe77b change interface and api spec for dynamic_gru test=develop
7 years ago
tensor-tang 09c5786e22 add square jitkernel
7 years ago
Qiao Longfei 4c7be265d3 update avx gru grad kernel test=develop
7 years ago
tensor-tang 4461a458a5 adjust diff since abs is too large
7 years ago
Qiao Longfei 9b16e54064 update gru_grad_op
7 years ago
tensor-tang ca6fdc6e33 refine and fix test
7 years ago
tensor-tang a89296ac1f add repeated fc relu pass
7 years ago
Qiao Longfei e477d789a1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gru-add-mode
7 years ago
tensor-tang f347d6e4a1 add repeated fc relu unit test
7 years ago
tensor-tang 99010e6eae init repeated fc relu op
7 years ago
tensor-tang 266a5d2f52 implement matmul refer and mkl kernel
7 years ago
tensor-tang c5623c87a3 init jit matmul kernel
7 years ago
Xin Pan a92860a3b1
Merge pull request #15298 from panyx0718/fix
7 years ago
Xin Pan 50b4ac08b0 fix
7 years ago
Xin Pan a1bfb35dd6 try fix py2
7 years ago
tensor-tang 781cd0cf51 add multi threads test of seqpool test (#15293)
7 years ago
Xin Pan 3f65869ba6 try fix
7 years ago
Xin Pan 3e79e6544f try fix
7 years ago
Tao Luo 1d434a9de6
Merge pull request #15291 from wojtuss/wojtuss/fix-performance-drop
7 years ago
minqiyang c86b3dd6e6 Polish code
7 years ago
minqiyang ddfb9f1123 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_shared_ptr
7 years ago
Xin Pan d1220f23bb
Merge pull request #15229 from panyx0718/imperative
7 years ago
colourful-tree 576c740d5d
Merge pull request #14964 from colourful-tree/data_norm
7 years ago
colourful-tree d5a8909131
Merge pull request #14950 from colourful-tree/develop
7 years ago
minqiyang bc3e0d6e01 Fix expand op compile time bug
7 years ago
Tao Luo cbd1c7c01f fix CompareDeterministic error when test_all_data
7 years ago
Xin Pan 6a18c0f9ff
Merge pull request #15278 from chengduoZH/revert_remove_workspace_handle_in_conv2d_cudnn
7 years ago
Zhaolong Xing 98e85f3735 add_transpose_flatten_concat_fuse (#15121)
7 years ago
chengduozh c4eced9881 fix thread safe bug
7 years ago
chengduozh 358e657f68 Revert "Remove workspace_handle in conv_cudnn (#15186)"
7 years ago
wopeizl 5d9edb4124
Merge pull request #15156 from wopeizl/windows/fixgpuissue
7 years ago
Wojciech Uss cb2ba58458 Fix performance drop when with MKL-DNN
7 years ago
tensor-tang fc9fbab6a0
Merge pull request #15271 from tensor-tang/fix/typo
7 years ago
minqiyang d0b640dca1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_shared_ptr
7 years ago
chengduo 064512aa47
Remove workspace_handle in conv_cudnn (#15186)
7 years ago
minqiyang 687171d22b Move from shared_ptr to raw pointer
7 years ago
tensor-tang c3a9f3c4b2 fix typo and refine
7 years ago
tensor-tang 146e942c65
Merge pull request #15250 from tensor-tang/refine/seqpool/feed
7 years ago
xiaolil1 8f17c714de Conv int8 residual (#15145)
7 years ago
Tao Luo 93d5c1ed5a
Merge pull request #15261 from wopeizl/fixdemos
7 years ago
xiaoli.liu@intel.com f34e779f4d Enhance key generation for INT8 test.
7 years ago
peizhilin 439691f5bd adjust the shlwapi on windows
7 years ago
peizhilin 92da467c99 Merge remote-tracking branch 'upstream/develop' into windows/fixgpuissue
7 years ago
Wu Yi fd85418329
[Feature] support mix precision training for resnet (#14899)
7 years ago
tensor-tang 96786d3716 add compare_determine of seqpool1 test
7 years ago
tensor-tang ab9c4b2a9f refine seqpool concat pass and remove unused nodes
7 years ago
tensor-tang ce909664d8 Merge remote-tracking branch 'ups/develop' into refine/seqpool/feed
7 years ago
peizhilin e239558e56 remove the dismatch enclosure to avoid warning message test=develop
7 years ago
flame fb63cd89d4
Add python ir graph API (#14917)
7 years ago
tensor-tang a0a27bd240 add seqpool concat fuse pass tester
7 years ago
Tao Luo 7d13d20769
Merge pull request #15245 from luotao1/rnn1_multi_thread
7 years ago
minqiyang 80197fac26 Add missing files
7 years ago
Tao Luo 2b11c710b3
Merge pull request #15249 from NHZlX/fix_trt_demo_ci
7 years ago
乔龙飞 Qiao Longfei 5e74c4e88f
Merge pull request #15100 from jacquesqiao/fix-dist-sparse-decay
7 years ago
tensor-tang 8e086a8521 follow comment and fix typo
7 years ago
minqiyang 08e2a5d611 Polish tracer code
7 years ago
minqiyang cded24768c Remove shared_ptr holder for VarBase
7 years ago
minqiyang c8d1a8e909 Change var_ and grad_ to shared_ptr
7 years ago
minqiyang 7aab39af15 Change grads to VarBase
7 years ago
tensor-tang 54afcb7ec6 add compare zerocopy test with native result
7 years ago
tensor-tang 137060135e fix zerocopy size
7 years ago
tensor-tang 7461356723 add zerocopy for seqpool test
7 years ago
tensor-tang 48410b9bfe
Merge pull request #15237 from tensor-tang/fuse/seqpool_concat_2
7 years ago
nhzlx e7d83389e6 fix demo ci bug
7 years ago
Tao Luo 9b41e45584
Merge pull request #15222 from luotao1/native_config
7 years ago
Tao Luo d43983b61d reduce threads number to avoid hang in CI
7 years ago
Qiao Longfei 653cd31971 remote unused code
7 years ago
Qiao Longfei 0a79d7a404 fix merge
7 years ago
Qiao Longfei 422449a945 fix style
7 years ago
Qiao Longfei edad60e612 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
7 years ago
nhzlx c1264e99f3 fix win error
7 years ago
peizhilin c1235c935f add the enable_debug flag
7 years ago
nhzlx 4e3522e5b4 add trt int8 support
7 years ago
Xin Pan 7b73fc9e1a
Merge pull request #15089 from panyx0718/api
7 years ago
Xin Pan 9597fd05e9 polish
7 years ago
Qiao Longfei d0e3b24002 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-dist-sparse-decay
7 years ago
tensor-tang f8c305b243 Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat_2
7 years ago
Tao Luo 197d0f2431 fix trt_model_tester to pass the ci
7 years ago
tensor-tang 223c61ca5e
Merge pull request #15170 from tensor-tang/jit/seqpool
7 years ago
Qiao Longfei c3b9edf958 follow comment test=develop
7 years ago
Zeng Jinle e29f10d315
Merge pull request #15207 from sneaxiy/remove_op_handle_lock_and_fix_var
7 years ago
Zeng Jinle 7b638f2781
Merge pull request #15218 from sneaxiy/fix_same_name_func
7 years ago
Tao Luo feee78d9f0
Merge pull request #15214 from tensor-tang/fix/debug
7 years ago
Xin Pan 7aad6afd49 forward and backward
7 years ago
mozga-intel eff90eb941 PADDLE_WITH_NGRAPH was removed from the code
7 years ago
mozga-intel a42f8f4f6f Enable element_wise_add operator for a ngraph
7 years ago
mozga-intel e4184008a4 PADDLE_WITH_NGRAPH was removed from the code
7 years ago
Qiao Longfei 3ace486ebd fix sum_op selected rows test=develop
7 years ago
Tao Luo 71d9097a89 fix analyzer_test runs error in native_config
7 years ago
Tao Luo 9c02765158
Merge pull request #15210 from Superjomn/fix/analysis_tester_bug
7 years ago
tensor-tang 72d2a1801e add seqpool concat fuse pass
7 years ago
tensor-tang f702f8fd10 Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat
7 years ago
sneaxiy bc205ef374 fix same name func
7 years ago
tensor-tang 69fd3fdb52 fix debug build error
7 years ago
Xin Pan 2349acea48 checkpoint
7 years ago
xuezhong c0bc818688
Merge pull request #15188 from velconia/add_pyramid_dnn_support
7 years ago
Qiao Longfei b16e832d4d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-dist-sparse-decay
7 years ago
Qiao Longfei 4d169ad981 update api spec test=develop
7 years ago
superjomn 23bdd0a223 fix analysis_tester bug
7 years ago
Yan Chunwei d09d6eadc0
make inference api work with Doxygen (#15195)
7 years ago
Zeng Jinle c562be20d9
Merge pull request #15193 from sneaxiy/fix_cudnn_compatible_check
7 years ago
peizhilin 1cd95d8a0b use thread local instance test=develop
7 years ago
minqiyang 7b7d0d0caf Change hash function back
7 years ago
Xin Pan 11d4d39cd7 forward working
7 years ago
sneaxiy ed409ac9f4 Revert "Revert "Remove op handle lock""
7 years ago
sneaxiy 4a443ffc98 merge develop
7 years ago
peizhilin d54133ea85 not include the numeric under linux test=develop
7 years ago
sneaxiy 7c7342bf12 fix scope.var()
7 years ago
Tao Luo 4d9aa1745a
Merge pull request #14806 from mozga-intel/mozga-intel/scale_operator_ngraph
7 years ago
Tao Luo dc0c221426
Merge pull request #14803 from mozga-intel/mozga-intel/mean_operator_ngraph
7 years ago
Xin Pan b629133375 checkpoint runnable PyLayer
7 years ago
peizhilin a6f5ceee74 add the python callstack for debug support test=develop
7 years ago
Zeng Jinle dacfaaa966 Revert "Remove op handle lock"
7 years ago
Tao Luo 6ca9a4810b
Merge pull request #15196 from luotao1/serial
7 years ago
Xin Pan c4b09a713f polish
7 years ago
minqiyang b76695418a Polish log
7 years ago
minqiyang 1bfbc0d963 Polish code
7 years ago
minqiyang 7f45b9511a Polish code
7 years ago
minqiyang 68a07328fa Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_pyramid_dnn_support
7 years ago
Qiyang Min 317840d3ba
Merge pull request #14277 from velconia/add_fused_emb_seq_pool_op
7 years ago
tensor-tang 2dd331cc21 Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat
7 years ago
tensor-tang 316636404f add seqpool concat unit test
7 years ago
Yan Chunwei 6ccf8685f7
refactor tensorrt node teller (#15181)
7 years ago
Tao Luo 7dc0181c46 run analyzer_tester serial in multi-thread
7 years ago
xiaolil1 c8f101e5da Conv int8 relu (#15130)
7 years ago
sneaxiy 9793a0b6a6 fix_cudnn_compatible_check
7 years ago
Zeng Jinle ccb322d6a5 merge develop
7 years ago
Xin Pan 0d0bc61248 update api
7 years ago
tensor-tang 7923d7271f add fusion seqpool concat op
7 years ago
Zeng Jinle f3a13512fc
Merge pull request #15139 from sneaxiy/remove_op_handle_lock
7 years ago
Qiao Longfei 44b300556d change min_row_size_to_use_multithread to parameter of adam
7 years ago
Qiao Longfei 87b4eb1da4 change min_param_size_to_use_multithread to min_row_size_to_use_multithread
7 years ago
minqiyang 0f94c1ac14 Polish code
7 years ago
minqiyang 00e4de04bf Polish code
7 years ago
minqiyang 4bfa110fd8 Add no lock optimize pass
7 years ago
chengduo eabb2105fa
Refactor MultiDevSSAGraphBuilder (#15090)
7 years ago
Yan Chunwei 875a07c32d
refactor inference analysis api (#14634)
7 years ago
minqiyang c09a379015 remove const_cast
7 years ago
tensor-tang 102d93712e Merge remote-tracking branch 'ups/develop' into jit/seqpool
7 years ago
tensor-tang 123b98f417 refine heigth and codesize and support all pool
7 years ago
tensor-tang 0145f40f45 use height from params of jitcode
7 years ago
tensor-tang e0591deebc enhance seqpool jitcode
7 years ago
Zeng Jinle 99e6e8b00f
Merge pull request #15179 from sneaxiy/fix_crf_grad_lod
7 years ago
minqiyang db8eb9b688 Polish code
7 years ago
minqiyang f4c990e7b8 Add fused embedding ops
7 years ago
minqiyang 39b98709b1 Move fused ops to fused dir
7 years ago
minqiyang 920d4a8b78 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_fused_emb_seq_pool_op
7 years ago
Tao Luo 5ee596cae5
Merge pull request #15175 from baojun-nervana/intel/mkldnn
7 years ago
乔龙飞 Qiao Longfei 7c891e1ecc
Merge pull request #15111 from jacquesqiao/fix-adam-tmp-var
7 years ago
mozga-intel e77956c920 Enable mean operator for a ngraph
7 years ago
mozga-intel dd768714ab Enable scale operator for a ngraph
7 years ago
sneaxiy be425461a1 fix crf grad lod share
7 years ago
Qiao Longfei 3e1b914fcb update gru op forward kernel
7 years ago
Qiao Longfei 7a81ab8607 complete gru_unite_op and test
7 years ago
Qiao Longfei 72618c8da5 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gru-add-mode
7 years ago
Qiao Longfei 17b1b660fc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
7 years ago
Qiao Longfei c15270c5b2 optimize multi thread adam
7 years ago
乔龙飞 Qiao Longfei e1679b8847
Merge pull request #14893 from JiabinYang/feature/add_prefech_hs
7 years ago
baojun-nervana f0cde74564 Update ngraph with elt-wise relu test=develop
7 years ago
tensor-tang 92201d3956 support avg and sqrt pool and add mkl impl
7 years ago
tensor-tang c50060bb26 add jitcode impl and use it
7 years ago
tensor-tang 142bb41748 add seqpool jitkernel test and benchmark
7 years ago
tensor-tang e58a569c6c use seqpool jitkernel
7 years ago
tensor-tang 3e01a4048f add refer seqpool jitkernel
7 years ago
Qiao Longfei 4ecb9c93f0 update API.spec
7 years ago
xiaolil1 bbc9336878 Enable basic MKL-DNN INT8 Conv OP (#15124)
7 years ago
Xin Pan 8ae9094e07 polish and resolve conflicts
7 years ago
Xin Pan 5e928e579a try unify Executor and ParallelExecutor
7 years ago
Qiao Longfei e10af895de update gru grad op
7 years ago
Qiao Longfei 78ec7c0f99 gru add origin mode
7 years ago
peizhilin c919b2f31d Merge remote-tracking branch 'upstream/develop' into windows/fixgpuissue
7 years ago
peizhilin fd4f4d0e5f fix build issue test=develop
7 years ago
Yan Xu a1e60ab19b
Merge pull request #14791 from Yancey1989/parallel_graph_mode
7 years ago
peizhilin 9ae50dd07d fix gpu buils issue on windows test=develop
7 years ago
Qiao Longfei 0e747e8d02 change the limit of thead num
7 years ago
qingqing01 c981bf0f9d
Fix compling error with cuDNN v5 (#15148)
7 years ago
Yancey1989 4ad9de74dd disable sync nccl by default test=develop
7 years ago
Yancey1989 449bf58ea6 disable parallelgraph mode by default test=develop
7 years ago
Yancey1989 db603398b7 disable parallel graph executor by default
7 years ago
wopeizl 67093da398
Merge pull request #15122 from wopeizl/windows/fixhuberloss
7 years ago
sneaxiy d0a8a1e950 remove_op_handle_lock
7 years ago
Xin Pan 087af6a686
Merge pull request #15131 from panyx0718/clean
7 years ago
Yancey1989 e65436103f Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
7 years ago
Yancey1989 94c80347b6 update by comment
7 years ago
sneaxiy 6f06e6cdac Merge remote origin
7 years ago
Qiyang Min 23761beaef
Merge pull request #14971 from velconia/imperative_mnist
7 years ago
xiaolil1 8eb1f26211 Enable INT8 pool OP (#15046)
7 years ago
Wu Yi 227e0c4518
fix nccl2 mode startup test=develop (#15132)
7 years ago
Xin Pan 9186451f60 hide GetTensor
7 years ago
wopeizl 7305fc2ff9
Merge pull request #15112 from wopeizl/windows/fixsaveandloadops
7 years ago
peizhilin dba009dbbf fix script issue
7 years ago
peizhilin cd2d60b4c8 fix build issue for density prior box op on windows test=develop
7 years ago
Yancey1989 35cda13e9f fix unittest test=develop
7 years ago
peizhilin 1f423f84ac fix the huber loss compile issue on windows test=develop
7 years ago
sneaxiy d25395fc98 remove tensor core lock
7 years ago
tensor-tang 516fe301ee add comment in case of empty name
7 years ago
peizhilin b3688100ad fix unittest
7 years ago
minqiyang 2547f9d1b8 Polish code
7 years ago