Commit Graph

5798 Commits (23bdd0a223cc3e88c62fb8f48155c83455c9fede)

Author SHA1 Message Date
dongdaxiang f9b8168508 Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
6 years ago
dongdaxiang 3b3cb4ea55 Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
6 years ago
minqiyang 52b4821a6e Fix Sprintf problem
6 years ago
qingqing01 51a9fca323
Async memory copy (#15013)
6 years ago
minqiyang 010f657b33 Polish code
6 years ago
JiabinYang 1a8cbb6799 test=develop, accelerate_hs_op and add prefetch with is_sparse
6 years ago
sneaxiy e02f67eff7 rewrite unsafe_cast
6 years ago
minqiyang 45acfbd011 1. Add specific condition for one or no arg in PADDLE_ENFORCE
6 years ago
minqiyang 68b86d6665 Change default value to align with the original react
6 years ago
whs 938705745e
Init paddle slim (#14834)
6 years ago
dongdaxiang 2dee8f6cd5 add TrainFilesWithTimer in async_executor
6 years ago
xiaoli.liu@intel.com 869d444b92 Fix comments misunderstanding
6 years ago
xiaoli.liu@intel.com d83d0f33fd extract templated function
6 years ago
dongdaxiang d434fcbaa6 add TrainFilesWithTimer in async_executor
6 years ago
minqiyang 250e893745 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
6 years ago
minqiyang 8b6b0da062 Use adam_update
6 years ago
minqiyang f4e7a47381 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_adam
6 years ago
minqiyang b1d0a14c14 Change the ut back
6 years ago
minqiyang 7d1533216d Fix syntax error in unit test
6 years ago
tensor-tang 641313ea77
Merge pull request #15000 from tensor-tang/doc/en/jit
6 years ago
minqiyang e811e06555 Avoid comma in macro
6 years ago
minqiyang 0cf1461ccc Avoid comma in macro
6 years ago
wopeizl b117a5f208
Merge pull request #14931 from wopeizl/windows/mkl
6 years ago
Xin Pan 103f08f50e
Merge pull request #14910 from panyx0718/clean3
6 years ago
dongdaxiang cf6188a823 add a linux timer
6 years ago
Zeng Jinle 0021b05b19
Merge pull request #14993 from sneaxiy/fix_check_lod
6 years ago
tensor-tang 68ab16444a add eng doc of jit kernel and follow comments
6 years ago
chengduo 79bd6dfa18
[Feature] Add Temporary Allocator (#14875)
6 years ago
minqiyang e4719eb462 Fix bug in Windows VC 2010
6 years ago
sneaxiy a30c5373eb use std::is_sorted
6 years ago
minqiyang 5a5c577529 Polish code
6 years ago
minqiyang 099186cd41 Support one argument PADDLE_ENFORCE
6 years ago
peizhilin 2e35290ffa fix build issue
6 years ago
Xin Pan 484c24b756
Merge pull request #14996 from tensor-tang/fix/deps
6 years ago
tensor-tang e99e4e994d fix deps of jit benchmark
6 years ago
Tao Luo 22c71398e3 add MM_DNN inference test
6 years ago
minqiyang 8149a07a41 Fix wait stream two times bug
6 years ago
Tao Luo d4f384767a
Merge pull request #14987 from jczaja/prv-transpose-mkldnn-grad
6 years ago
minqiyang 4af97c6946 Polish code
6 years ago
peizhilin 201283f95c fix code style
6 years ago
minqiyang 41b81293ab Polish code
6 years ago
peizhilin 9e60c58666 Merge remote-tracking branch 'upstream/develop' into windows/mkl
6 years ago
minqiyang bc66401566 Polish code
6 years ago
tensor-tang 693e5e65ce
Merge pull request #14958 from tensor-tang/refine/jit
6 years ago
minqiyang 74ead6ff35 Polish code
6 years ago
sneaxiy b8051e7927 merge develop
6 years ago
Tao Luo df1e4e2f10 fix check_lod
6 years ago
minqiyang 0a4b6fc056 Remove unnessesary code
6 years ago
minqiyang 53619a79b4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_lstm
6 years ago
minqiyang 6fabbd8fb8 Polish code and remove spin lock
6 years ago
minqiyang 3de0f612e8 Polish code
6 years ago
Zeng Jinle 95cbe07c40
Merge pull request #14836 from sneaxiy/feature/py_func
6 years ago
mozga-intel 969f669493 Static specifier was removed
6 years ago
mozga-intel 7048caf9a0 Enable top_k operator for a ngraph
6 years ago
Tao Luo d49990e447
Merge pull request #14982 from luotao1/diff_test
6 years ago
mozga-intel 394635b2ee Static specifier was removed
6 years ago
mozga-intel ecfa68ecaa Enable fill_constant operator for a ngraph
6 years ago
sneaxiy 600f6d8272 polish code
6 years ago
minqiyang 9e24fa3aeb Polish code
6 years ago
minqiyang 1c116462cc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
6 years ago
minqiyang 29697c2e25 Add stop_gradient to VarBase to support loss function
6 years ago
Jacek Czaja d05094eec3 - Added transpose/transpose2 MKLDNN grad ops
6 years ago
Tao Luo 55af116861
Merge pull request #14966 from jczaja/prv-transpose-mkldnn-reuse-PR
6 years ago
sneaxiy 7f6e513b1f fix mac ci bug
6 years ago
sneaxiy c1f7e54f62 merge develop
6 years ago
Xin Pan 2537ac512e
Merge pull request #14983 from typhoonzero/revert_14850
6 years ago
chengduo a015a8a39d
Refine merge_selected_rows Doc (#14748)
6 years ago
typhoonzero da87f7a698 Revert "[Feature] Fp16 training for resnet50 (#14850)"
6 years ago
peizhilin b06ce129bc some not so useful adjust
6 years ago
sneaxiy 89b9d86d9d fix windows compile bug
6 years ago
tensor-tang 1aaec571c2 fix enum style
6 years ago
sneaxiy 490eb9061f polish infer shape of py_func op
6 years ago
tensor-tang facfecbdd0 follow comment: reuse time function and change to upper case
6 years ago
tensor-tang f5532877f7 Merge remote-tracking branch 'ups/develop' into refine/jit
6 years ago
Yihua Xu 3babc80160 Fix the regression issue and add the group unitest for conv2d (#14932)
6 years ago
Xin Pan 969ad966ba all converted
6 years ago
Xin Pan a872eb90c2
Merge pull request #14959 from panyx0718/clean2
6 years ago
minqiyang 6467eda534 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_adam
6 years ago
sneaxiy 13429c3e9f clean code, remove void registration
6 years ago
chengduo 550e7e410b
Code Clean parallel_executor.py (#14849)
6 years ago
Wu Yi 3d750f9c5a
[Feature] Fp16 training for resnet50 (#14850)
6 years ago
luotao1 13367866cd add deterministic result unit-test
6 years ago
tangwei12 45dd3491db
lookup table utils fix (#14730)
6 years ago
JiabinYang 1bec52f581 test=develop, fix cpu running error
6 years ago
minqiyang 679d1a9e0b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_lstm
6 years ago
Wang Guibao e7675dd542
Avoid calling VLOG() when program exiting (#14937)
6 years ago
sneaxiy 83ac85158a polish code
6 years ago
sneaxiy 045dc12766 merge develop
6 years ago
minqiyang 454db6662e Accelerate lstm
6 years ago
xiaoli.liu@intel.com 3146629485 Add Dequantize OP
6 years ago
xiaoli.liu@intel.com 019dbf7f5f Add Quantize OP
6 years ago
minqiyang 3cd10a7c4f Add Conv2D forward
6 years ago
sneaxiy ce4a26ddad clean code
6 years ago
tensor-tang a369c8025b fix typo of mix impl of lstm
6 years ago
tensor-tang d651a91138 fix build on win, fix use condition of crf decoding and layer norm and
6 years ago
tensor-tang 141ebcd4aa Merge remote-tracking branch 'ups/develop' into refine/jit
6 years ago
JiabinYang bfcb5e5235 test=develop, fix gpu compile error on prefetch, and fix hs/nce ut failed on gpu
6 years ago
Tomasz Patejko e9eee0de6a MKLDNN residual connection fuse: fixing accuracy problem (#14874)
6 years ago
sneaxiy 53f6c6991a polish code
6 years ago
tensor-tang d53c4756ad clean code and remove unused files
6 years ago
tensor-tang 95fb31285c Merge remote-tracking branch 'ups/develop' into refine/jit
6 years ago
Xin Pan cf3a07e8f8
Merge pull request #14878 from panyx0718/imperative
6 years ago
peizhilin 9f55f1ff50 use the platform api to decide the specific instruction support or not
6 years ago
tensor-tang c187a7c618 add more impls of lstm and gru and fix build on win
6 years ago
sneaxiy 74a8e6b032 merge develop
6 years ago
Xin Pan 1fe3ac352a move more and fix while
6 years ago
sneaxiy ae6f46a1a9 rewrite variable type
6 years ago
Jacek Czaja 709d9e3cb7 - Added reusing MKL-DNN primitives for Transpose MKL-DNN op
6 years ago
peizhilin 0b4f742e8a fix the build issue
6 years ago
peizhilin da42cf2055 fix build issue when xbyak is disabled on windows
6 years ago
tensor-tang 83d075aa79 fix lstm and gru jitcode
6 years ago
peizhilin 1cc9d59838 disable xbyak on windows
6 years ago
Xin Pan 9ef8a76873 convert more
6 years ago
Xin Pan 876993887b convert more interface to avoid scope
6 years ago
tensor-tang 20392be001 Merge remote-tracking branch 'ups/develop' into refine/jit
6 years ago
tensor-tang f332f589bc add more impls of sigmoid and vtanh
6 years ago
peizhilin 40a94a138f remove irrelevant fix for mkl
6 years ago
Xin Pan 8c19f0bfe3 fix
6 years ago
jerrywgz dda28b0e68 fix bug in if-else op, test=develop
6 years ago
minqiyang 8d88c5a87d Shameless copy
6 years ago
JiabinYang 4877f5d71f test=develop, fix compile error under gpu mode
6 years ago
JiabinYang 8515ee3a29 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_prefech_hs
6 years ago
JiabinYang 5ec9b37798 test=develop, fix compile error under gpu mode
6 years ago
mozga-intel 9035bb81fe Enable mul operator for a ngraph engine (#14801)
6 years ago
tensor-tang ea259c6363 enable layer norm intrinsic code
6 years ago
gongweibao b849157e9d
Add size enforce (#14919)
6 years ago
Xin Pan 4dd61e7260 convert GetInputVarPtrs and GetOutputVarPtrs
6 years ago
Jacek Czaja aa6e9c30be [MKL-DNN ]Added transpose/transpose2 Op (#14872)
6 years ago
peizhilin 07c7eaabb4 Merge remote-tracking branch 'upstream/develop' into windows/mkl
6 years ago
wopeizl 6c66b3d496
Merge pull request #14943 from wopeizl/windows/ctc
6 years ago
Xin Pan 52d3903a12 fix
6 years ago
Xin Pan 0e0983cc1d convert more infer shape
6 years ago
Xin Pan 62eb43ba98 convert more
6 years ago
Xin Pan dfcf746ea1
Merge pull request #14904 from panyx0718/clean2
6 years ago
tensor-tang b1516783ea enable crf decoding intrinsic code
6 years ago
tensor-tang 4cc7707d28 add crf_decoding and layer norm intrisic code
6 years ago
tensor-tang 10c340c9a3 fix confilcts
6 years ago
tensor-tang 893957f711 Merge remote-tracking branch 'ups/develop' into refine/jit
6 years ago
tensor-tang 6648995f53 fix build
6 years ago
JiabinYang 3b7b2e1ded Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_prefech_hs
6 years ago
Xin Pan 6324032602 MLP forward backward
6 years ago
peizhilin ed5bd5e586 test=develop
6 years ago
peizhilin 19ebd8b4cf add ctc support for windows
6 years ago
Xin Pan c89a1fb287
Merge pull request #14879 from panyx0718/clean
6 years ago
Qiao Longfei e2d56561e7
Merge pull request #14889 from jacquesqiao/optimize-adam
6 years ago
sneaxiy a500dfa579 rewrite ddim
6 years ago
JiabinYang b5fa916413 fix bug after merge reyoung optimization, test=develop
6 years ago
sneaxiy dc8847af87 add examples and comments
6 years ago
Zeng Jinle 16c244bc3f
Merge pull request #14928 from sneaxiy/fix_gc
6 years ago
Xin Pan f897bd16c0 clean
6 years ago
peizhilin fa135bbf52 Fix the mkl build script on windows
6 years ago
Xin Pan 70981f5d79 clean
6 years ago
colourful-tree 44ad2f4479
Merge pull request #14873 from colourful-tree/develop
6 years ago
minqiyang 69642000dc Hide KeyHasher
6 years ago
minqiyang a3fa3f85d7 Polish code
6 years ago
Yu Yang 2803cf5776
Merge pull request #14868 from reyoung/feature/refine_w2v
6 years ago
Tao Luo 6aa6b8cfa0
Merge pull request #14918 from luotao1/mobilenet_test
6 years ago
Zhaolong Xing a9fb34fad8
Merge pull request #14903 from NHZlX/add_conv_elementwise_pass
6 years ago
peizhilin b601f2de8d include the mkl fix only
6 years ago
dzhwinter 7cd24b1318
add ir memory optimize. (#14530)
6 years ago
Xin Pan fb8ae30331 fix
6 years ago
Qiyang Min fd1d2c897e
Merge pull request #14894 from velconia/add_huber_regression_loss_op
6 years ago
Tao Luo 2f55a04ec6 add refer result comparasion
6 years ago
guru4elephant a79a3ea2f0
Merge branch 'develop' into develop
6 years ago
peizhilin 5a6d7fe2ff add mkl,ctc support for windows
6 years ago
wopeizl 0f085f0a5a
Merge pull request #14892 from wopeizl/windows/port3
6 years ago
JiabinYang 656040c726 merge reyoung optimization
6 years ago
Yancey1989 06936a2ff5 fix 1gpu test=develop
6 years ago
sneaxiy c631412eab fix gc bug
6 years ago
Zeng Jinle 36a1d021a4
Merge pull request #14927 from sneaxiy/fix_cuda_stream_callback_in_cuda10
6 years ago
Xin Pan eaf8ba35b5 change input
6 years ago
Xin Pan 840e6729e2 inject context
6 years ago
Xin Pan bbff0df320 try cache variables
6 years ago
Xin Pan 52bc4ee75a delay infer scope
6 years ago
wopeizl fa78fc60be
Merge pull request #14907 from wopeizl/windows/avx
6 years ago
Qiao Longfei fe3995d335 refine code test=develop
6 years ago
tensor-tang 74292f414c enable eltwise nchw16c mul nc
6 years ago
Qiao Longfei 56686d0f34 clean code test=develop
6 years ago
Qiao Longfei fd152289fa clean for range in test=develop
6 years ago
sneaxiy 2373aeb5e8 fix bug
6 years ago
nhzlx 050a68dde3 fix comments
6 years ago
tensor-tang 720b55cbcf enable crf decoding and layer norm refer code
6 years ago
tensor-tang 64a90b2f1c use vadd, vaddrelu, lstm and gru jitkernel
6 years ago
Qiao Longfei 1141db8114 update test_adam_op
6 years ago
gongweibao addded48e1
test=develop (#14898)
6 years ago
Tao Luo 1a6d2cfe39 add test_analyzer_mobilenet
6 years ago
Yancey1989 d3a4da5cf6 fix comment test=develop
6 years ago
Yancey1989 49870f507d delete unused code test=develop
6 years ago
minqiyang 27a0d6c2dc Polish code
6 years ago
minqiyang aa41ee75a1 Accelerate PADDLE_ENFORCE
6 years ago
Qiao Longfei 96604fda10 fix gpu data
6 years ago
nhzlx fcc93d96d5 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_conv_elementwise_pass
6 years ago
dongdaxiang bc4f16ca6f remove some comments
6 years ago
peizhilin 41456e1723 Remove the useless definition
6 years ago
minqiyang 24eb8f038c Fix bug
6 years ago
Yancey1989 a7d6b1f921 code cleanup test=develop
6 years ago
Yu Yang 740e1626ce Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_w2v
6 years ago
minqiyang 728e7e88fb Use xxHash as scope's hash algorithm
6 years ago
Yancey1989 a760a550b0 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Yancey1989 fd144954ed redefine api test=develop
6 years ago