Commit Graph

3504 Commits (16af1dbc7bb1580565fe07e71c18abf917553f30)

Author SHA1 Message Date
Dun Liang a900015c03 add async copy and pinned place
6 years ago
colourful-tree 576c740d5d
Merge pull request #14964 from colourful-tree/data_norm
6 years ago
colourful-tree d5a8909131
Merge pull request #14950 from colourful-tree/develop
6 years ago
minqiyang bc3e0d6e01 Fix expand op compile time bug
6 years ago
chengduozh 358e657f68 Revert "Remove workspace_handle in conv_cudnn (#15186)"
6 years ago
tensor-tang fc9fbab6a0
Merge pull request #15271 from tensor-tang/fix/typo
6 years ago
chengduo 064512aa47
Remove workspace_handle in conv_cudnn (#15186)
6 years ago
tensor-tang c3a9f3c4b2 fix typo and refine
6 years ago
tensor-tang 146e942c65
Merge pull request #15250 from tensor-tang/refine/seqpool/feed
6 years ago
xiaolil1 8f17c714de Conv int8 residual (#15145)
6 years ago
xiaoli.liu@intel.com f34e779f4d Enhance key generation for INT8 test.
6 years ago
Wu Yi fd85418329
[Feature] support mix precision training for resnet (#14899)
6 years ago
tensor-tang ce909664d8 Merge remote-tracking branch 'ups/develop' into refine/seqpool/feed
6 years ago
乔龙飞 Qiao Longfei 5e74c4e88f
Merge pull request #15100 from jacquesqiao/fix-dist-sparse-decay
6 years ago
tensor-tang 8e086a8521 follow comment and fix typo
6 years ago
Qiao Longfei 653cd31971 remote unused code
6 years ago
Qiao Longfei 0a79d7a404 fix merge
6 years ago
Qiao Longfei 422449a945 fix style
6 years ago
Qiao Longfei edad60e612 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
6 years ago
nhzlx 4e3522e5b4 add trt int8 support
6 years ago
Qiao Longfei d0e3b24002 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-dist-sparse-decay
6 years ago
tensor-tang f8c305b243 Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat_2
6 years ago
tensor-tang 223c61ca5e
Merge pull request #15170 from tensor-tang/jit/seqpool
6 years ago
Qiao Longfei c3b9edf958 follow comment test=develop
6 years ago
Zeng Jinle e29f10d315
Merge pull request #15207 from sneaxiy/remove_op_handle_lock_and_fix_var
6 years ago
mozga-intel eff90eb941 PADDLE_WITH_NGRAPH was removed from the code
6 years ago
mozga-intel a42f8f4f6f Enable element_wise_add operator for a ngraph
6 years ago
mozga-intel e4184008a4 PADDLE_WITH_NGRAPH was removed from the code
6 years ago
Qiao Longfei 3ace486ebd fix sum_op selected rows test=develop
6 years ago
tensor-tang f702f8fd10 Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat
6 years ago
Qiao Longfei b16e832d4d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-dist-sparse-decay
6 years ago
sneaxiy ed409ac9f4 Revert "Revert "Remove op handle lock""
6 years ago
Tao Luo 4d9aa1745a
Merge pull request #14806 from mozga-intel/mozga-intel/scale_operator_ngraph
6 years ago
Tao Luo dc0c221426
Merge pull request #14803 from mozga-intel/mozga-intel/mean_operator_ngraph
6 years ago
Zeng Jinle dacfaaa966 Revert "Remove op handle lock"
6 years ago
Qiyang Min 317840d3ba
Merge pull request #14277 from velconia/add_fused_emb_seq_pool_op
6 years ago
tensor-tang 2dd331cc21 Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat
6 years ago
tensor-tang 316636404f add seqpool concat unit test
6 years ago
xiaolil1 c8f101e5da Conv int8 relu (#15130)
6 years ago
tensor-tang 7923d7271f add fusion seqpool concat op
6 years ago
Zeng Jinle f3a13512fc
Merge pull request #15139 from sneaxiy/remove_op_handle_lock
6 years ago
Qiao Longfei 44b300556d change min_row_size_to_use_multithread to parameter of adam
6 years ago
Qiao Longfei 87b4eb1da4 change min_param_size_to_use_multithread to min_row_size_to_use_multithread
6 years ago
minqiyang 0f94c1ac14 Polish code
6 years ago
minqiyang c09a379015 remove const_cast
6 years ago
tensor-tang 102d93712e Merge remote-tracking branch 'ups/develop' into jit/seqpool
6 years ago
tensor-tang 123b98f417 refine heigth and codesize and support all pool
6 years ago
tensor-tang 0145f40f45 use height from params of jitcode
6 years ago
tensor-tang e0591deebc enhance seqpool jitcode
6 years ago
Zeng Jinle 99e6e8b00f
Merge pull request #15179 from sneaxiy/fix_crf_grad_lod
6 years ago
minqiyang db8eb9b688 Polish code
6 years ago
minqiyang 39b98709b1 Move fused ops to fused dir
6 years ago
minqiyang 920d4a8b78 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_fused_emb_seq_pool_op
6 years ago
乔龙飞 Qiao Longfei 7c891e1ecc
Merge pull request #15111 from jacquesqiao/fix-adam-tmp-var
6 years ago
mozga-intel e77956c920 Enable mean operator for a ngraph
6 years ago
mozga-intel dd768714ab Enable scale operator for a ngraph
6 years ago
sneaxiy be425461a1 fix crf grad lod share
6 years ago
Qiao Longfei 3e1b914fcb update gru op forward kernel
6 years ago
Qiao Longfei 7a81ab8607 complete gru_unite_op and test
6 years ago
Qiao Longfei 72618c8da5 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gru-add-mode
6 years ago
Qiao Longfei 17b1b660fc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
6 years ago
Qiao Longfei c15270c5b2 optimize multi thread adam
6 years ago
乔龙飞 Qiao Longfei e1679b8847
Merge pull request #14893 from JiabinYang/feature/add_prefech_hs
6 years ago
tensor-tang 92201d3956 support avg and sqrt pool and add mkl impl
6 years ago
tensor-tang c50060bb26 add jitcode impl and use it
6 years ago
tensor-tang 142bb41748 add seqpool jitkernel test and benchmark
6 years ago
tensor-tang e58a569c6c use seqpool jitkernel
6 years ago
tensor-tang 3e01a4048f add refer seqpool jitkernel
6 years ago
xiaolil1 bbc9336878 Enable basic MKL-DNN INT8 Conv OP (#15124)
6 years ago
Qiao Longfei e10af895de update gru grad op
6 years ago
Qiao Longfei 78ec7c0f99 gru add origin mode
6 years ago
Yan Xu a1e60ab19b
Merge pull request #14791 from Yancey1989/parallel_graph_mode
6 years ago
Qiao Longfei 0e747e8d02 change the limit of thead num
6 years ago
qingqing01 c981bf0f9d
Fix compling error with cuDNN v5 (#15148)
6 years ago
wopeizl 67093da398
Merge pull request #15122 from wopeizl/windows/fixhuberloss
6 years ago
sneaxiy d0a8a1e950 remove_op_handle_lock
6 years ago
Xin Pan 087af6a686
Merge pull request #15131 from panyx0718/clean
6 years ago
Yancey1989 e65436103f Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
sneaxiy 6f06e6cdac Merge remote origin
6 years ago
xiaolil1 8eb1f26211 Enable INT8 pool OP (#15046)
6 years ago
Xin Pan 9186451f60 hide GetTensor
6 years ago
peizhilin dba009dbbf fix script issue
6 years ago
peizhilin cd2d60b4c8 fix build issue for density prior box op on windows test=develop
6 years ago
peizhilin 1f423f84ac fix the huber loss compile issue on windows test=develop
6 years ago
sneaxiy d25395fc98 remove tensor core lock
6 years ago
peizhilin b3688100ad fix unittest
6 years ago
peizhilin 5d8f281397 restore the memory mode
6 years ago
peizhilin 33b7821a75 fix save and load ops on windows test=develop
6 years ago
Qiao Longfei dfe85fb358 fix build
6 years ago
Qiao Longfei f057bbd1d1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-adam-tmp-var
6 years ago
Qiao Longfei f1c973b014 adam op should not create tmp var in compute
6 years ago
Yancey1989 0a885ac12a Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
shippingwang 83f2e2c903 rewrite the comments, test=develop
6 years ago
gongweibao ce70229ba6
Add max_body_size flags to brpc (#15084)
6 years ago
qingqing01 6f0a1d7b47
Inception fusion operator. (#14968)
6 years ago
Qiao Longfei 25d44d40ac sum op support empty selected rows as input
6 years ago
Zeng Jinle 25b49a0896
Merge pull request #14933 from sneaxiy/rewrite_ddim
6 years ago
Wu Yi a8bc05b5ff
Refactor distributed RPC (#15075)
6 years ago
Xin Pan 3e8408429d
Merge pull request #15053 from panyx0718/imperative_hold
6 years ago
sneaxiy 73896eeb94 merge develop
6 years ago
Yancey1989 4743c9cd5d Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Xin Pan f7294f8b25 register float16
6 years ago
Zhaolong Xing 4048cfa9da
Merge pull request #15048 from NHZlX/add_affine_channel_fuse
6 years ago
Zeng Jinle c0bcff00dc
Merge pull request #14962 from sneaxiy/rewrite_variable_type
6 years ago
Qiao Longfei d161215332 optimize adam multi thread
6 years ago
wopeizl 719ebe3786
Merge pull request #15070 from wopeizl/windows/testcasefix
6 years ago
Qiao Longfei 7a58ad5c79 lazy mode have higher priority then multithread
6 years ago
Xin Pan f52b514dcd call kernel
6 years ago
Xin Pan 7b6bf9ddf2 make fill_constant kernel-based
6 years ago
Xin Pan 61491ce250 clean
6 years ago
Xin Pan ce7e503cbe refactor to avoid scope.
6 years ago
Qiyang Min 0238a3bb4f
Merge pull request #14972 from velconia/accelerate_lstm
6 years ago
Houjiang Chen 242d3c71a6
Merge pull request #15031 from hjchen2/develop
6 years ago
Qiao Longfei d0572bf02e add log for lazy mode test=develop
6 years ago
Qiao Longfei 1177b0bc84 update multi thread adam
6 years ago
Qiao Longfei 3b294e2e2e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
6 years ago
Zeng Jinle 988bc2b5a7
Merge pull request #15060 from dzhwinter/fix/nccl
6 years ago
sneaxiy c4ce2e7b21 merge develop, solve conflict
6 years ago
shippingwang 9322d34032 Fix, test=develop
6 years ago
sneaxiy b56aca82e9 merge develop
6 years ago
jerrywgz ef2d292bfc
Merge pull request #14956 from jerrywgz/fix_bug_in_ifelse
6 years ago
peizhilin e49276e731 restore the huber_loss_op
6 years ago
Yancey1989 86bb583881 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
peizhilin 01c00b07dd fix test issues on windows
6 years ago
tangwei12 dc8eca826e
code style fix, test=develop (#15045)
6 years ago
Yancey1989 41a64f6a2a Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
hjchen2 956cf92145 Fix conv_elementwise_add2_act pass
6 years ago
whs 2314f2ebb3
Make topk op support variable k. (#15044)
6 years ago
shippingwang 5631fc0817 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel
6 years ago
sneaxiy dde3afe7b7 Merge develop
6 years ago
dzhwinter 3ea2f415dc fix ci error. test=develop
6 years ago
Wu Yi 856f0da0fe
Fp16 training (#14992)
6 years ago
chengduo b9fb03cf54
Move GetTensor to tensor_util (#15011)
6 years ago
Yihua Xu dbb90a76f0 Merge remote-tracking branch 'paddle/develop' into develop_641313ea7_elementwise_mul_mkldnn_bug_fix
6 years ago
nhzlx 73b47df1f4 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_affine_channel_fuse
6 years ago
nhzlx ce3782c193 add affine_channel fuse.
6 years ago
shippingwang c2575ac72d update API.spec, test=develop
6 years ago
shippingwang ebeee9303e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel
6 years ago
Qiyang Min aba1f9b06e
Merge pull request #14891 from velconia/accelerate_adam
6 years ago
shippingwang 0a0b6f4a22 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel
6 years ago
minqiyang 8ec3d863b0 Fix throw_on_error direct call bug
6 years ago
sneaxiy 3a2afbf02e polish code
6 years ago
shippingwang 5a9ea9a73d Add ShuffleChannel Op
6 years ago
tensor-tang 05d1121b22
Merge pull request #14802 from mozga-intel/mozga-intel/fill_constant_operator_ngraph
6 years ago
tensor-tang 9d4f1d468a
Merge pull request #14804 from mozga-intel/mozga-intel/top_k_operator_ngraph
6 years ago
tensor-tang 8a6ac4dba7
Merge pull request #14973 from xiaolil1/dequantize
6 years ago
tensor-tang f0e02a65ed
Merge pull request #14974 from xiaolil1/quantize
6 years ago
shippingwang 76c6f11584 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel
6 years ago
shippingwang 16d4e13711 Add ShuffleChannelOP
6 years ago
qingqing01 51a9fca323
Async memory copy (#15013)
6 years ago
minqiyang 010f657b33 Polish code
6 years ago
JiabinYang 1a8cbb6799 test=develop, accelerate_hs_op and add prefetch with is_sparse
6 years ago
xiaoli.liu@intel.com 869d444b92 Fix comments misunderstanding
6 years ago
xiaoli.liu@intel.com d83d0f33fd extract templated function
6 years ago
Yihua Xu d4606bcb22 Fix the exception when tensor format is x
6 years ago
minqiyang 8b6b0da062 Use adam_update
6 years ago
minqiyang f4e7a47381 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_adam
6 years ago
tensor-tang 641313ea77
Merge pull request #15000 from tensor-tang/doc/en/jit
6 years ago
minqiyang e811e06555 Avoid comma in macro
6 years ago
minqiyang 0cf1461ccc Avoid comma in macro
6 years ago
wopeizl b117a5f208
Merge pull request #14931 from wopeizl/windows/mkl
6 years ago
tensor-tang 68ab16444a add eng doc of jit kernel and follow comments
6 years ago
chengduo 79bd6dfa18
[Feature] Add Temporary Allocator (#14875)
6 years ago
minqiyang e4719eb462 Fix bug in Windows VC 2010
6 years ago
peizhilin 2e35290ffa fix build issue
6 years ago
Xin Pan 484c24b756
Merge pull request #14996 from tensor-tang/fix/deps
6 years ago
tensor-tang e99e4e994d fix deps of jit benchmark
6 years ago
Tao Luo d4f384767a
Merge pull request #14987 from jczaja/prv-transpose-mkldnn-grad
6 years ago
peizhilin 201283f95c fix code style
6 years ago
peizhilin 9e60c58666 Merge remote-tracking branch 'upstream/develop' into windows/mkl
6 years ago
tensor-tang 693e5e65ce
Merge pull request #14958 from tensor-tang/refine/jit
6 years ago
minqiyang 0a4b6fc056 Remove unnessesary code
6 years ago
minqiyang 53619a79b4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_lstm
6 years ago
Zeng Jinle 95cbe07c40
Merge pull request #14836 from sneaxiy/feature/py_func
6 years ago
mozga-intel 969f669493 Static specifier was removed
6 years ago
mozga-intel 7048caf9a0 Enable top_k operator for a ngraph
6 years ago
mozga-intel 394635b2ee Static specifier was removed
6 years ago
mozga-intel ecfa68ecaa Enable fill_constant operator for a ngraph
6 years ago
Jacek Czaja d05094eec3 - Added transpose/transpose2 MKLDNN grad ops
6 years ago
Tao Luo 55af116861
Merge pull request #14966 from jczaja/prv-transpose-mkldnn-reuse-PR
6 years ago
sneaxiy c1f7e54f62 merge develop
6 years ago
Xin Pan 2537ac512e
Merge pull request #14983 from typhoonzero/revert_14850
6 years ago
chengduo a015a8a39d
Refine merge_selected_rows Doc (#14748)
6 years ago
typhoonzero da87f7a698 Revert "[Feature] Fp16 training for resnet50 (#14850)"
6 years ago
tensor-tang 1aaec571c2 fix enum style
6 years ago
Qiao Longfei d76bda50c4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
6 years ago
sneaxiy 490eb9061f polish infer shape of py_func op
6 years ago
tensor-tang facfecbdd0 follow comment: reuse time function and change to upper case
6 years ago
tensor-tang f5532877f7 Merge remote-tracking branch 'ups/develop' into refine/jit
6 years ago
Yihua Xu 3babc80160 Fix the regression issue and add the group unitest for conv2d (#14932)
6 years ago
Xin Pan a872eb90c2
Merge pull request #14959 from panyx0718/clean2
6 years ago
minqiyang 6467eda534 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_adam
6 years ago
Wu Yi 3d750f9c5a
[Feature] Fp16 training for resnet50 (#14850)
6 years ago
JiabinYang 1bec52f581 test=develop, fix cpu running error
6 years ago
minqiyang 679d1a9e0b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_lstm
6 years ago
Wang Guibao e7675dd542
Avoid calling VLOG() when program exiting (#14937)
6 years ago
heqiaozhi f18e8a7a5e remove some comments & refine doc & put template class in .h
6 years ago
sneaxiy 83ac85158a polish code
6 years ago
sneaxiy 045dc12766 merge develop
6 years ago
minqiyang 454db6662e Accelerate lstm
6 years ago
xiaoli.liu@intel.com 3146629485 Add Dequantize OP
6 years ago
xiaoli.liu@intel.com 019dbf7f5f Add Quantize OP
6 years ago
tensor-tang a369c8025b fix typo of mix impl of lstm
6 years ago
tensor-tang d651a91138 fix build on win, fix use condition of crf decoding and layer norm and
6 years ago
JiabinYang bfcb5e5235 test=develop, fix gpu compile error on prefetch, and fix hs/nce ut failed on gpu
6 years ago
tensor-tang d53c4756ad clean code and remove unused files
6 years ago
tensor-tang 95fb31285c Merge remote-tracking branch 'ups/develop' into refine/jit
6 years ago
Xin Pan cf3a07e8f8
Merge pull request #14878 from panyx0718/imperative
6 years ago
peizhilin 9f55f1ff50 use the platform api to decide the specific instruction support or not
6 years ago
tensor-tang c187a7c618 add more impls of lstm and gru and fix build on win
6 years ago
heqiaozhi 39f4e9273e data_norm
6 years ago
sneaxiy 74a8e6b032 merge develop
6 years ago
Xin Pan 1fe3ac352a move more and fix while
6 years ago
sneaxiy ae6f46a1a9 rewrite variable type
6 years ago
Jacek Czaja 709d9e3cb7 - Added reusing MKL-DNN primitives for Transpose MKL-DNN op
6 years ago
peizhilin 0b4f742e8a fix the build issue
6 years ago
peizhilin da42cf2055 fix build issue when xbyak is disabled on windows
6 years ago
tensor-tang 83d075aa79 fix lstm and gru jitcode
6 years ago
peizhilin 1cc9d59838 disable xbyak on windows
6 years ago
Xin Pan 876993887b convert more interface to avoid scope
6 years ago
tensor-tang 20392be001 Merge remote-tracking branch 'ups/develop' into refine/jit
6 years ago
tensor-tang f332f589bc add more impls of sigmoid and vtanh
6 years ago
jerrywgz dda28b0e68 fix bug in if-else op, test=develop
6 years ago
JiabinYang 4877f5d71f test=develop, fix compile error under gpu mode
6 years ago
JiabinYang 8515ee3a29 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_prefech_hs
6 years ago
JiabinYang 5ec9b37798 test=develop, fix compile error under gpu mode
6 years ago
heqiaozhi a94285869b add API
6 years ago
mozga-intel 9035bb81fe Enable mul operator for a ngraph engine (#14801)
6 years ago
tensor-tang ea259c6363 enable layer norm intrinsic code
6 years ago
gongweibao b849157e9d
Add size enforce (#14919)
6 years ago
heqiaozhi 5c7a8aee07 merge upstream to my develop
6 years ago
Jacek Czaja aa6e9c30be [MKL-DNN ]Added transpose/transpose2 Op (#14872)
6 years ago
heqiaozhi 4f6e9e3ac3 teacher student sigmoid loss
6 years ago
peizhilin 07c7eaabb4 Merge remote-tracking branch 'upstream/develop' into windows/mkl
6 years ago
wopeizl 6c66b3d496
Merge pull request #14943 from wopeizl/windows/ctc
6 years ago
Xin Pan dfcf746ea1
Merge pull request #14904 from panyx0718/clean2
6 years ago
tensor-tang b1516783ea enable crf decoding intrinsic code
6 years ago
tensor-tang 4cc7707d28 add crf_decoding and layer norm intrisic code
6 years ago
tensor-tang 10c340c9a3 fix confilcts
6 years ago
tensor-tang 893957f711 Merge remote-tracking branch 'ups/develop' into refine/jit
6 years ago
tensor-tang 6648995f53 fix build
6 years ago
JiabinYang 3b7b2e1ded Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_prefech_hs
6 years ago
Xin Pan 6324032602 MLP forward backward
6 years ago
peizhilin 19ebd8b4cf add ctc support for windows
6 years ago
Xin Pan c89a1fb287
Merge pull request #14879 from panyx0718/clean
7 years ago
Qiao Longfei 3f3a84b6dc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
7 years ago
Qiao Longfei e2d56561e7
Merge pull request #14889 from jacquesqiao/optimize-adam
7 years ago
sneaxiy a500dfa579 rewrite ddim
7 years ago
JiabinYang b5fa916413 fix bug after merge reyoung optimization, test=develop
7 years ago
sneaxiy dc8847af87 add examples and comments
7 years ago