Commit Graph

3336 Commits (ca5d96bb3d376be0ade29db4f58700ba2c81b88a)

Author SHA1 Message Date
tensor-tang 09c5786e22 add square jitkernel
6 years ago
Qiao Longfei 4c7be265d3 update avx gru grad kernel test=develop
6 years ago
Qiao Longfei 9b16e54064 update gru_grad_op
6 years ago
Qiao Longfei e477d789a1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gru-add-mode
6 years ago
tensor-tang f347d6e4a1 add repeated fc relu unit test
6 years ago
tensor-tang 99010e6eae init repeated fc relu op
6 years ago
tensor-tang 266a5d2f52 implement matmul refer and mkl kernel
6 years ago
tensor-tang c5623c87a3 init jit matmul kernel
6 years ago
Xin Pan a1bfb35dd6 try fix py2
6 years ago
colourful-tree 576c740d5d
Merge pull request #14964 from colourful-tree/data_norm
6 years ago
colourful-tree d5a8909131
Merge pull request #14950 from colourful-tree/develop
6 years ago
minqiyang bc3e0d6e01 Fix expand op compile time bug
6 years ago
chengduozh 358e657f68 Revert "Remove workspace_handle in conv_cudnn (#15186)"
6 years ago
tensor-tang fc9fbab6a0
Merge pull request #15271 from tensor-tang/fix/typo
6 years ago
chengduo 064512aa47
Remove workspace_handle in conv_cudnn (#15186)
6 years ago
tensor-tang c3a9f3c4b2 fix typo and refine
6 years ago
tensor-tang 146e942c65
Merge pull request #15250 from tensor-tang/refine/seqpool/feed
6 years ago
xiaolil1 8f17c714de Conv int8 residual (#15145)
6 years ago
xiaoli.liu@intel.com f34e779f4d Enhance key generation for INT8 test.
6 years ago
Wu Yi fd85418329
[Feature] support mix precision training for resnet (#14899)
6 years ago
tensor-tang ce909664d8 Merge remote-tracking branch 'ups/develop' into refine/seqpool/feed
6 years ago
乔龙飞 Qiao Longfei 5e74c4e88f
Merge pull request #15100 from jacquesqiao/fix-dist-sparse-decay
6 years ago
tensor-tang 8e086a8521 follow comment and fix typo
6 years ago
Qiao Longfei d0e3b24002 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-dist-sparse-decay
6 years ago
tensor-tang f8c305b243 Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat_2
6 years ago
tensor-tang 223c61ca5e
Merge pull request #15170 from tensor-tang/jit/seqpool
6 years ago
Qiao Longfei c3b9edf958 follow comment test=develop
6 years ago
Zeng Jinle e29f10d315
Merge pull request #15207 from sneaxiy/remove_op_handle_lock_and_fix_var
6 years ago
mozga-intel eff90eb941 PADDLE_WITH_NGRAPH was removed from the code
6 years ago
mozga-intel a42f8f4f6f Enable element_wise_add operator for a ngraph
6 years ago
mozga-intel e4184008a4 PADDLE_WITH_NGRAPH was removed from the code
6 years ago
Qiao Longfei 3ace486ebd fix sum_op selected rows test=develop
6 years ago
tensor-tang f702f8fd10 Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat
6 years ago
Qiao Longfei b16e832d4d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-dist-sparse-decay
6 years ago
sneaxiy ed409ac9f4 Revert "Revert "Remove op handle lock""
6 years ago
Tao Luo 4d9aa1745a
Merge pull request #14806 from mozga-intel/mozga-intel/scale_operator_ngraph
6 years ago
Tao Luo dc0c221426
Merge pull request #14803 from mozga-intel/mozga-intel/mean_operator_ngraph
6 years ago
Zeng Jinle dacfaaa966 Revert "Remove op handle lock"
6 years ago
Qiyang Min 317840d3ba
Merge pull request #14277 from velconia/add_fused_emb_seq_pool_op
6 years ago
tensor-tang 2dd331cc21 Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat
6 years ago
tensor-tang 316636404f add seqpool concat unit test
6 years ago
xiaolil1 c8f101e5da Conv int8 relu (#15130)
6 years ago
tensor-tang 7923d7271f add fusion seqpool concat op
6 years ago
Zeng Jinle f3a13512fc
Merge pull request #15139 from sneaxiy/remove_op_handle_lock
6 years ago
Qiao Longfei 44b300556d change min_row_size_to_use_multithread to parameter of adam
6 years ago
Qiao Longfei 87b4eb1da4 change min_param_size_to_use_multithread to min_row_size_to_use_multithread
6 years ago
minqiyang 0f94c1ac14 Polish code
6 years ago
minqiyang c09a379015 remove const_cast
6 years ago
tensor-tang 102d93712e Merge remote-tracking branch 'ups/develop' into jit/seqpool
6 years ago
tensor-tang 123b98f417 refine heigth and codesize and support all pool
6 years ago
tensor-tang 0145f40f45 use height from params of jitcode
6 years ago
tensor-tang e0591deebc enhance seqpool jitcode
6 years ago
Zeng Jinle 99e6e8b00f
Merge pull request #15179 from sneaxiy/fix_crf_grad_lod
6 years ago
minqiyang db8eb9b688 Polish code
6 years ago
minqiyang 39b98709b1 Move fused ops to fused dir
6 years ago
minqiyang 920d4a8b78 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_fused_emb_seq_pool_op
6 years ago
乔龙飞 Qiao Longfei 7c891e1ecc
Merge pull request #15111 from jacquesqiao/fix-adam-tmp-var
6 years ago
mozga-intel e77956c920 Enable mean operator for a ngraph
6 years ago
mozga-intel dd768714ab Enable scale operator for a ngraph
6 years ago
sneaxiy be425461a1 fix crf grad lod share
6 years ago
Qiao Longfei 3e1b914fcb update gru op forward kernel
6 years ago
Qiao Longfei 7a81ab8607 complete gru_unite_op and test
6 years ago
Qiao Longfei 72618c8da5 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gru-add-mode
6 years ago
Qiao Longfei 17b1b660fc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
6 years ago
Qiao Longfei c15270c5b2 optimize multi thread adam
6 years ago
乔龙飞 Qiao Longfei e1679b8847
Merge pull request #14893 from JiabinYang/feature/add_prefech_hs
6 years ago
tensor-tang 92201d3956 support avg and sqrt pool and add mkl impl
6 years ago
tensor-tang c50060bb26 add jitcode impl and use it
6 years ago
tensor-tang 142bb41748 add seqpool jitkernel test and benchmark
6 years ago
tensor-tang e58a569c6c use seqpool jitkernel
6 years ago
tensor-tang 3e01a4048f add refer seqpool jitkernel
6 years ago
xiaolil1 bbc9336878 Enable basic MKL-DNN INT8 Conv OP (#15124)
6 years ago
Qiao Longfei e10af895de update gru grad op
6 years ago
Qiao Longfei 78ec7c0f99 gru add origin mode
6 years ago
Yan Xu a1e60ab19b
Merge pull request #14791 from Yancey1989/parallel_graph_mode
6 years ago
Qiao Longfei 0e747e8d02 change the limit of thead num
6 years ago
qingqing01 c981bf0f9d
Fix compling error with cuDNN v5 (#15148)
6 years ago
wopeizl 67093da398
Merge pull request #15122 from wopeizl/windows/fixhuberloss
6 years ago
sneaxiy d0a8a1e950 remove_op_handle_lock
6 years ago
Xin Pan 087af6a686
Merge pull request #15131 from panyx0718/clean
6 years ago
Yancey1989 e65436103f Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
sneaxiy 6f06e6cdac Merge remote origin
6 years ago
xiaolil1 8eb1f26211 Enable INT8 pool OP (#15046)
6 years ago
Xin Pan 9186451f60 hide GetTensor
6 years ago
peizhilin dba009dbbf fix script issue
6 years ago
peizhilin cd2d60b4c8 fix build issue for density prior box op on windows test=develop
6 years ago
peizhilin 1f423f84ac fix the huber loss compile issue on windows test=develop
6 years ago
sneaxiy d25395fc98 remove tensor core lock
6 years ago
peizhilin b3688100ad fix unittest
6 years ago
peizhilin 5d8f281397 restore the memory mode
6 years ago
peizhilin 33b7821a75 fix save and load ops on windows test=develop
6 years ago
Qiao Longfei dfe85fb358 fix build
6 years ago
Qiao Longfei f057bbd1d1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-adam-tmp-var
6 years ago
Qiao Longfei f1c973b014 adam op should not create tmp var in compute
6 years ago
Yancey1989 0a885ac12a Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
gongweibao ce70229ba6
Add max_body_size flags to brpc (#15084)
6 years ago
qingqing01 6f0a1d7b47
Inception fusion operator. (#14968)
6 years ago
Qiao Longfei 25d44d40ac sum op support empty selected rows as input
6 years ago
Zeng Jinle 25b49a0896
Merge pull request #14933 from sneaxiy/rewrite_ddim
6 years ago
Wu Yi a8bc05b5ff
Refactor distributed RPC (#15075)
6 years ago