Commit Graph

3789 Commits (6c641827092fb10f6eeb56477819c76f2b331969)

Author SHA1 Message Date
dengkaipeng c0fa8d2eec use L1Loss for w, h. test=develop
6 years ago
dengkaipeng 3841983aa0 fix division error in mean process. test=develop
6 years ago
dengkaipeng 192d293854 use stable Sigmoid Cross Entropy implement. test=develop
6 years ago
tink2123 909f864a9b remove unnecessary flags
6 years ago
tink2123 6961a94e94 avoid out_size less than 1
6 years ago
jerrywgz 7bc8481c62
Merge pull request #15418 from jerrywgz/refine_nms
6 years ago
tensor-tang d59f733551 refine softmax and use with cache
6 years ago
tensor-tang 7383eefd2d add softmax mix and mkl code
6 years ago
tensor-tang 50945685f2 add hmax, hsum jitcode
6 years ago
tensor-tang 8117725852 add jit kernel hsum, hmax and softmax refer code
6 years ago
Zeng Jinle bf7dedcbc7
Merge pull request #15545 from sneaxiy/fix_debug_nccl_error
6 years ago
dzhwinter ee3aae56cd merge develop branch. test=develop
6 years ago
jerrywgz cee2e1b089 refine code, test=develop
6 years ago
sneaxiy ba4f43fd62 fix compile error in distributed mode
6 years ago
tink2123 a0c63f1106 add align_flag
6 years ago
Tao Luo b919190232
Merge pull request #15531 from jczaja/prv-googlenet-fix
6 years ago
Zhaolong Xing 97b76c94c4
Merge pull request #15242 from NHZlX/trt_int8_ultimate_version
6 years ago
Kaipeng Deng aeca5c50b2 fix grid_sampler PADDLE_ENFORCE error. test=develop (#15542)
6 years ago
乔龙飞 Qiao Longfei 5f89ce7fcd
Merge pull request #15536 from jacquesqiao/fix-prefetch-one-parameter
6 years ago
Jacek Czaja 5885c5cdf6 - Added explanation to LRN MKL-DNN op on alpha modification
6 years ago
Jacek Czaja 4aa7ef3c13 - Compensation fix to LRN MKL-DNN op
6 years ago
Qiao Longfei 806658d72b add space after colon in commnet test=develop
6 years ago
nhzlx b43ea40c51 delete the usage of the const_cast
6 years ago
baojun-nervana 8e9308a51a mv ngraph_bridge to ngraph directory test=develop
6 years ago
Qiao Longfei 4d13434443 fix a little problem test=develop
6 years ago
Qiao Longfei 9c3910f390 IncreaseBatchBarrier should be in the right condition test=develop
6 years ago
ruri 88bd7e1a61
Merge pull request #15027 from shippingwang/shufflechannel
6 years ago
Jacek Czaja fa286b1052 LRN reengineering
6 years ago
nhzlx 92cf4a4c6b fix comments
6 years ago
tensor-tang e043ea9653
Merge pull request #15515 from tensor-tang/jit/benchmark
6 years ago
Qiao Longfei 5a0c6593d5 revert RequestGetHandler
6 years ago
jerrywgz 466a10dcdd refine code, test=develop
6 years ago
乔龙飞 Qiao Longfei c58555067e
Merge pull request #14731 from jacquesqiao/optimize-cpp-reader
6 years ago
jerrywgz a39240c3b6 add attr variance for box coder, test=develop
6 years ago
gongweibao d54494ba87
cleanup test=develop (#15347)
6 years ago
Qiao Longfei 84220765a7 refine code, add more log
6 years ago
Qiao Longfei c750be6d9d add some log
6 years ago
gongweibao fe8f28c957
Add GetVariableNoBarrier on brpc. (#15488)
6 years ago
tangwei12 981fc2bdba
fix bug in merge_ids (#15503)
6 years ago
baojun efce25673c Adding ngraph_engine_op (#14948)
6 years ago
chengduo f8f91fb4b3
Revert conv transpose cudnn (#15514)
6 years ago
tensor-tang b67584a6e9 jit benchmark use tensor
6 years ago
Yiqun Liu 3008fa1261
Add the CUDA kernel for beam_search op (#15020)
6 years ago
tink2123 78145c7dff modified some comments
6 years ago
nhzlx 027d24c831 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version
6 years ago
chengduo bf91d11ed5
Clean elementwise_op_function (#15502)
6 years ago
tangwei12 5cfc40dea8
nce add check sample lables, test=develop (#15463)
6 years ago
tink2123 e448bdb298 modified some comments
6 years ago
tink2123 88744e4ab8 fixed some errors
6 years ago
jerrywgz 9eb2d7b3e1 refine code, test=develop
6 years ago
jerrywgz 6dfd789bfc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_nms
6 years ago
jerrywgz 6928f8318f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_axis_for_boxcoder
6 years ago
jerrywgz e60c8438fc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_clip_op
6 years ago
tink2123 48cc484643 add align_corners and align_mode for image_resize
6 years ago
jerrywgz 11f1baa406 refine code, test=develop
6 years ago
Zhaolong Xing b7b68f2a8c
Merge pull request #15461 from NHZlX/fix_trt_stream_bug
6 years ago
tangwei12 8b50ad80ff
checkpoint at distributed training (#14854)
6 years ago
jerrywgz 57e5f61ec8 add gpu kernel, test=develop
6 years ago
jerrywgz cc53453057 add comment and refine code, test=develop
6 years ago
qingqing01 07dc5a1506
Add generate_mask_labels_op to support Mask-RCNN and refine some code. (#15371)
6 years ago
Yiqun Liu eaad3e4c3d
Add check of input in sequence_expand op. (#15466)
6 years ago
gongweibao f4dec5cdee
Check collective server's data. (#15449)
6 years ago
jerrywgz c12a969bd4 refine comment and unittest, test=develop
6 years ago
chengduo 5a8bd82c0c
Remove workspace_handle (#15376)
6 years ago
jerrywgz 1c558ad388 add gpu kernel for box clip, test=develop
6 years ago
nhzlx 5b92ddabe2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_trt_stream_bug
6 years ago
nhzlx 2f4aee361a fix comments
6 years ago
nhzlx ec213730bc fix trt stream bug.
6 years ago
wopeizl a8aa79130b
Merge pull request #15453 from wopeizl/fix15313
6 years ago
gongweibao 7f8b40f68d
Fix brpc complation error. (#15451)
6 years ago
jerrywgz 0d4b60ab8b add lod for slice op, test=develop
6 years ago
dzhwinter 8f3b252392 squash commits. test=develop
6 years ago
peizhilin e6a3a3a31a fix pr 15313
6 years ago
jerrywgz 66bb5dd760 refine infer shape, test=develop
6 years ago
tensor-tang 266e625d2e
Merge pull request #15399 from tensor-tang/refine/seqpool/fc
6 years ago
Qiao Longfei 45578c1b48 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
6 years ago
Yan Chunwei 885c4e57ab
fea/infer memory optim2 (#14953)
6 years ago
jerrywgz 0d91507859 fix share lod, test=develop
6 years ago
Tao Luo 6597ccb01f
Merge pull request #15413 from luotao1/legacy_code
6 years ago
Dun 9f8f0fc2d3 Memory optimization of depthwise conv op and group norm op (#15313)
6 years ago
jerrywgz 5246285e34 test=develop
6 years ago
jerrywgz b10d84bc5a fix bug when run on GPU, test=develop
6 years ago
whs 530869f829
Share LoD from Input(Rois). (#15420)
6 years ago
gongweibao 7ab4af2716
Fix brpc compilation. (#15417)
6 years ago
Dun Liang e5004f3c1c fix ci && test=develop
6 years ago
tensor-tang 316e44b1b7 fix unused warnings
7 years ago
Wu Yi 7e651a38dd
fix mac cmake version 3.13 build (#15386)
7 years ago
jerrywgz b62a17bbae add nms api
7 years ago
tensor-tang 579d758254 fix jitkernel tests and refine benchmark
7 years ago
jerrywgz f660553d77 enhance nms for mask rcnn, test=develop
7 years ago
shippingwang 14f2a1060d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel
7 years ago
jerrywgz 88ee56d0b2 enhance nms for mask rcnn
7 years ago
zhaozhehao e2ba9668b4 Tree conv op (#15217)
7 years ago
Tao Luo 3ede8b67e6 update CMakeLists.txt
7 years ago
Yiqun Liu f413b6892b
Revert the modification of while_op in #14764. (#15372)
7 years ago
jerrywgz ab9d6a4f39 add comments, test=develop
7 years ago
jerrywgz 10dd3b37ad add axis for box coder op
7 years ago
乔龙飞 Qiao Longfei adba4384ec
Merge pull request #15161 from jacquesqiao/gru-add-mode
7 years ago
nhzlx 8817841c73 fix unit test bug
7 years ago
jerrywgz 5fb2856584 test_develop
7 years ago
Xin Pan 3ecf6bb338
Merge pull request #15028 from yihuaxu/develop_641313ea7_elementwise_mul_mkldnn_bug_fix
7 years ago
jerrywgz af448373c7 test=develop
7 years ago
nhzlx b938324381 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into trt_int8_ultimate_version
7 years ago
nhzlx 312fe0ece1 add trt int8 calibration support
7 years ago
wopeizl 994e73f685
Merge pull request #15351 from wopeizl/fixbuildissue
7 years ago
jerrywgz 481d8bce2f add box clip op
7 years ago
Yiqun Liu 568cc2ffa8
Optimize while_op for test (#14764)
7 years ago
tensor-tang 3759c1db8c
Merge pull request #14805 from mozga-intel/mozga-intel/element_wise_operator_ngraph
7 years ago
tensor-tang 904a39239d
Merge pull request #15254 from mozga-intel/mozga-intel/softmax_operator_ngraph
7 years ago
peizhilin cd562f8fb7 disable the parallel mode for adam op on windows test=develop
7 years ago
Xin Pan 16cb3ebd68
Merge pull request #15268 from xiaolil1/pool-int8
7 years ago
tensor-tang a7fc3d42a0
Merge pull request #15304 from tensor-tang/fuse/second_order_mul_sub
7 years ago
mozga-intel cba729404d Enable softmax operator for a ngraph engine
7 years ago
Qiao Longfei cd31b90a46 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
7 years ago
Qiao Longfei 8c516a24e5 remote min_row_size_to_use_multithread in adam interface test=develop
7 years ago
Qiao Longfei 9b4fe283e1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
7 years ago
Qiyang Min 3f687765e6
Merge pull request #15281 from velconia/fix_expand_op_compile_time
7 years ago
minqiyang c4cf5967db Change backward op infershape
7 years ago
tensor-tang 84b0ecdcce Merge remote-tracking branch 'ups/develop' into fuse/second_order_mul_sub
7 years ago
chengduo 46d01d798e
Revert "Revert "Remove workspace_handle in conv_cudnn (#15186)"" (#15290)
7 years ago
Qiao Longfei 4d15515c40 fix gru_gpu_kernel test=develop
7 years ago
tensor-tang 93e75c5ae5 refine jitcode of vsub and vsquare
7 years ago
tensor-tang d618e48309 fix fuse square mat order and refine test
7 years ago
Qiao Longfei 4feae25378 fix build problem test=develop
7 years ago
tensor-tang 38de1ff472 add fusion squared mat sub op
7 years ago
Qiao Longfei e641ffe77b change interface and api spec for dynamic_gru test=develop
7 years ago
tensor-tang 09c5786e22 add square jitkernel
7 years ago
Qiao Longfei 4c7be265d3 update avx gru grad kernel test=develop
7 years ago
Qiao Longfei 9b16e54064 update gru_grad_op
7 years ago
Qiao Longfei e477d789a1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gru-add-mode
7 years ago
tensor-tang f347d6e4a1 add repeated fc relu unit test
7 years ago
tensor-tang 99010e6eae init repeated fc relu op
7 years ago
tensor-tang 266a5d2f52 implement matmul refer and mkl kernel
7 years ago
tensor-tang c5623c87a3 init jit matmul kernel
7 years ago
Xin Pan a1bfb35dd6 try fix py2
7 years ago
Dun Liang a900015c03 add async copy and pinned place
7 years ago
colourful-tree 576c740d5d
Merge pull request #14964 from colourful-tree/data_norm
7 years ago
colourful-tree d5a8909131
Merge pull request #14950 from colourful-tree/develop
7 years ago
minqiyang bc3e0d6e01 Fix expand op compile time bug
7 years ago
chengduozh 358e657f68 Revert "Remove workspace_handle in conv_cudnn (#15186)"
7 years ago
tensor-tang fc9fbab6a0
Merge pull request #15271 from tensor-tang/fix/typo
7 years ago
chengduo 064512aa47
Remove workspace_handle in conv_cudnn (#15186)
7 years ago
tensor-tang c3a9f3c4b2 fix typo and refine
7 years ago
tensor-tang 146e942c65
Merge pull request #15250 from tensor-tang/refine/seqpool/feed
7 years ago
xiaolil1 8f17c714de Conv int8 residual (#15145)
7 years ago
xiaoli.liu@intel.com f34e779f4d Enhance key generation for INT8 test.
7 years ago
Wu Yi fd85418329
[Feature] support mix precision training for resnet (#14899)
7 years ago
tensor-tang ce909664d8 Merge remote-tracking branch 'ups/develop' into refine/seqpool/feed
7 years ago
乔龙飞 Qiao Longfei 5e74c4e88f
Merge pull request #15100 from jacquesqiao/fix-dist-sparse-decay
7 years ago
tensor-tang 8e086a8521 follow comment and fix typo
7 years ago
Qiao Longfei 653cd31971 remote unused code
7 years ago
Qiao Longfei 0a79d7a404 fix merge
7 years ago
Qiao Longfei 422449a945 fix style
7 years ago
Qiao Longfei edad60e612 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
7 years ago
nhzlx 4e3522e5b4 add trt int8 support
7 years ago
Qiao Longfei d0e3b24002 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-dist-sparse-decay
7 years ago
tensor-tang f8c305b243 Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat_2
7 years ago
tensor-tang 223c61ca5e
Merge pull request #15170 from tensor-tang/jit/seqpool
7 years ago
Qiao Longfei c3b9edf958 follow comment test=develop
7 years ago
Zeng Jinle e29f10d315
Merge pull request #15207 from sneaxiy/remove_op_handle_lock_and_fix_var
7 years ago
mozga-intel eff90eb941 PADDLE_WITH_NGRAPH was removed from the code
7 years ago
mozga-intel a42f8f4f6f Enable element_wise_add operator for a ngraph
7 years ago
mozga-intel e4184008a4 PADDLE_WITH_NGRAPH was removed from the code
7 years ago
Qiao Longfei 3ace486ebd fix sum_op selected rows test=develop
7 years ago
tensor-tang f702f8fd10 Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat
7 years ago
Qiao Longfei b16e832d4d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-dist-sparse-decay
7 years ago
sneaxiy ed409ac9f4 Revert "Revert "Remove op handle lock""
7 years ago
Tao Luo 4d9aa1745a
Merge pull request #14806 from mozga-intel/mozga-intel/scale_operator_ngraph
7 years ago
Tao Luo dc0c221426
Merge pull request #14803 from mozga-intel/mozga-intel/mean_operator_ngraph
7 years ago
Zeng Jinle dacfaaa966 Revert "Remove op handle lock"
7 years ago
Qiyang Min 317840d3ba
Merge pull request #14277 from velconia/add_fused_emb_seq_pool_op
7 years ago
tensor-tang 2dd331cc21 Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat
7 years ago
tensor-tang 316636404f add seqpool concat unit test
7 years ago
xiaolil1 c8f101e5da Conv int8 relu (#15130)
7 years ago
tensor-tang 7923d7271f add fusion seqpool concat op
7 years ago
Zeng Jinle f3a13512fc
Merge pull request #15139 from sneaxiy/remove_op_handle_lock
7 years ago
Qiao Longfei 44b300556d change min_row_size_to_use_multithread to parameter of adam
7 years ago
Qiao Longfei 87b4eb1da4 change min_param_size_to_use_multithread to min_row_size_to_use_multithread
7 years ago
minqiyang 0f94c1ac14 Polish code
7 years ago
minqiyang c09a379015 remove const_cast
7 years ago
tensor-tang 102d93712e Merge remote-tracking branch 'ups/develop' into jit/seqpool
7 years ago
tensor-tang 123b98f417 refine heigth and codesize and support all pool
7 years ago
tensor-tang 0145f40f45 use height from params of jitcode
7 years ago
tensor-tang e0591deebc enhance seqpool jitcode
7 years ago
Zeng Jinle 99e6e8b00f
Merge pull request #15179 from sneaxiy/fix_crf_grad_lod
7 years ago
minqiyang db8eb9b688 Polish code
7 years ago
minqiyang 39b98709b1 Move fused ops to fused dir
7 years ago
minqiyang 920d4a8b78 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_fused_emb_seq_pool_op
7 years ago
乔龙飞 Qiao Longfei 7c891e1ecc
Merge pull request #15111 from jacquesqiao/fix-adam-tmp-var
7 years ago
mozga-intel e77956c920 Enable mean operator for a ngraph
7 years ago
mozga-intel dd768714ab Enable scale operator for a ngraph
7 years ago
sneaxiy be425461a1 fix crf grad lod share
7 years ago
Qiao Longfei 3e1b914fcb update gru op forward kernel
7 years ago
Qiao Longfei 7a81ab8607 complete gru_unite_op and test
7 years ago
Qiao Longfei 72618c8da5 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gru-add-mode
7 years ago
Qiao Longfei 17b1b660fc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
7 years ago
Qiao Longfei c15270c5b2 optimize multi thread adam
7 years ago
乔龙飞 Qiao Longfei e1679b8847
Merge pull request #14893 from JiabinYang/feature/add_prefech_hs
7 years ago
tensor-tang 92201d3956 support avg and sqrt pool and add mkl impl
7 years ago
tensor-tang c50060bb26 add jitcode impl and use it
7 years ago
tensor-tang 142bb41748 add seqpool jitkernel test and benchmark
7 years ago
tensor-tang e58a569c6c use seqpool jitkernel
7 years ago
tensor-tang 3e01a4048f add refer seqpool jitkernel
7 years ago
xiaolil1 bbc9336878 Enable basic MKL-DNN INT8 Conv OP (#15124)
7 years ago
Qiao Longfei e10af895de update gru grad op
7 years ago
Qiao Longfei 78ec7c0f99 gru add origin mode
7 years ago
Yan Xu a1e60ab19b
Merge pull request #14791 from Yancey1989/parallel_graph_mode
7 years ago
Qiao Longfei 0e747e8d02 change the limit of thead num
7 years ago
qingqing01 c981bf0f9d
Fix compling error with cuDNN v5 (#15148)
7 years ago
wopeizl 67093da398
Merge pull request #15122 from wopeizl/windows/fixhuberloss
7 years ago
sneaxiy d0a8a1e950 remove_op_handle_lock
7 years ago
Xin Pan 087af6a686
Merge pull request #15131 from panyx0718/clean
7 years ago
Yancey1989 e65436103f Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
7 years ago
sneaxiy 6f06e6cdac Merge remote origin
7 years ago
xiaolil1 8eb1f26211 Enable INT8 pool OP (#15046)
7 years ago
Xin Pan 9186451f60 hide GetTensor
7 years ago
peizhilin dba009dbbf fix script issue
7 years ago
peizhilin cd2d60b4c8 fix build issue for density prior box op on windows test=develop
7 years ago
peizhilin 1f423f84ac fix the huber loss compile issue on windows test=develop
7 years ago
sneaxiy d25395fc98 remove tensor core lock
7 years ago
peizhilin b3688100ad fix unittest
7 years ago
peizhilin 5d8f281397 restore the memory mode
7 years ago
peizhilin 33b7821a75 fix save and load ops on windows test=develop
7 years ago
Qiao Longfei dfe85fb358 fix build
7 years ago
Qiao Longfei f057bbd1d1 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-adam-tmp-var
7 years ago
Qiao Longfei f1c973b014 adam op should not create tmp var in compute
7 years ago
Yancey1989 0a885ac12a Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
7 years ago
shippingwang 83f2e2c903 rewrite the comments, test=develop
7 years ago
gongweibao ce70229ba6
Add max_body_size flags to brpc (#15084)
7 years ago
qingqing01 6f0a1d7b47
Inception fusion operator. (#14968)
7 years ago
Qiao Longfei 25d44d40ac sum op support empty selected rows as input
7 years ago
Zeng Jinle 25b49a0896
Merge pull request #14933 from sneaxiy/rewrite_ddim
7 years ago
Wu Yi a8bc05b5ff
Refactor distributed RPC (#15075)
7 years ago
Xin Pan 3e8408429d
Merge pull request #15053 from panyx0718/imperative_hold
7 years ago
sneaxiy 73896eeb94 merge develop
7 years ago
Yancey1989 4743c9cd5d Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
7 years ago
Xin Pan f7294f8b25 register float16
7 years ago
Zhaolong Xing 4048cfa9da
Merge pull request #15048 from NHZlX/add_affine_channel_fuse
7 years ago
Zeng Jinle c0bcff00dc
Merge pull request #14962 from sneaxiy/rewrite_variable_type
7 years ago
Qiao Longfei d161215332 optimize adam multi thread
7 years ago
wopeizl 719ebe3786
Merge pull request #15070 from wopeizl/windows/testcasefix
7 years ago
Qiao Longfei 7a58ad5c79 lazy mode have higher priority then multithread
7 years ago
Xin Pan f52b514dcd call kernel
7 years ago
Xin Pan 7b6bf9ddf2 make fill_constant kernel-based
7 years ago
Xin Pan 61491ce250 clean
7 years ago
Xin Pan ce7e503cbe refactor to avoid scope.
7 years ago
Qiyang Min 0238a3bb4f
Merge pull request #14972 from velconia/accelerate_lstm
7 years ago
Houjiang Chen 242d3c71a6
Merge pull request #15031 from hjchen2/develop
7 years ago
Qiao Longfei d0572bf02e add log for lazy mode test=develop
7 years ago
Qiao Longfei 1177b0bc84 update multi thread adam
7 years ago
Qiao Longfei 3b294e2e2e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
7 years ago
Zeng Jinle 988bc2b5a7
Merge pull request #15060 from dzhwinter/fix/nccl
7 years ago
sneaxiy c4ce2e7b21 merge develop, solve conflict
7 years ago
shippingwang 9322d34032 Fix, test=develop
7 years ago
sneaxiy b56aca82e9 merge develop
7 years ago
jerrywgz ef2d292bfc
Merge pull request #14956 from jerrywgz/fix_bug_in_ifelse
7 years ago
peizhilin e49276e731 restore the huber_loss_op
7 years ago
Yancey1989 86bb583881 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
7 years ago
peizhilin 01c00b07dd fix test issues on windows
7 years ago
tangwei12 dc8eca826e
code style fix, test=develop (#15045)
7 years ago
Yancey1989 41a64f6a2a Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
7 years ago
hjchen2 956cf92145 Fix conv_elementwise_add2_act pass
7 years ago
whs 2314f2ebb3
Make topk op support variable k. (#15044)
7 years ago
shippingwang 5631fc0817 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel
7 years ago
sneaxiy dde3afe7b7 Merge develop
7 years ago
dzhwinter 3ea2f415dc fix ci error. test=develop
7 years ago
Wu Yi 856f0da0fe
Fp16 training (#14992)
7 years ago
chengduo b9fb03cf54
Move GetTensor to tensor_util (#15011)
7 years ago
Yihua Xu dbb90a76f0 Merge remote-tracking branch 'paddle/develop' into develop_641313ea7_elementwise_mul_mkldnn_bug_fix
7 years ago
nhzlx 73b47df1f4 Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_affine_channel_fuse
7 years ago
nhzlx ce3782c193 add affine_channel fuse.
7 years ago
shippingwang c2575ac72d update API.spec, test=develop
7 years ago
shippingwang ebeee9303e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel
7 years ago
Qiyang Min aba1f9b06e
Merge pull request #14891 from velconia/accelerate_adam
7 years ago
shippingwang 0a0b6f4a22 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel
7 years ago
minqiyang 8ec3d863b0 Fix throw_on_error direct call bug
7 years ago
sneaxiy 3a2afbf02e polish code
7 years ago
shippingwang 5a9ea9a73d Add ShuffleChannel Op
7 years ago
tensor-tang 05d1121b22
Merge pull request #14802 from mozga-intel/mozga-intel/fill_constant_operator_ngraph
7 years ago
tensor-tang 9d4f1d468a
Merge pull request #14804 from mozga-intel/mozga-intel/top_k_operator_ngraph
7 years ago
tensor-tang 8a6ac4dba7
Merge pull request #14973 from xiaolil1/dequantize
7 years ago
tensor-tang f0e02a65ed
Merge pull request #14974 from xiaolil1/quantize
7 years ago
shippingwang 76c6f11584 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel
7 years ago
shippingwang 16d4e13711 Add ShuffleChannelOP
7 years ago
qingqing01 51a9fca323
Async memory copy (#15013)
7 years ago
minqiyang 010f657b33 Polish code
7 years ago
JiabinYang 1a8cbb6799 test=develop, accelerate_hs_op and add prefetch with is_sparse
7 years ago
xiaoli.liu@intel.com 869d444b92 Fix comments misunderstanding
7 years ago
xiaoli.liu@intel.com d83d0f33fd extract templated function
7 years ago
Yihua Xu d4606bcb22 Fix the exception when tensor format is x
7 years ago
minqiyang 8b6b0da062 Use adam_update
7 years ago
minqiyang f4e7a47381 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_adam
7 years ago
tensor-tang 641313ea77
Merge pull request #15000 from tensor-tang/doc/en/jit
7 years ago
minqiyang e811e06555 Avoid comma in macro
7 years ago
minqiyang 0cf1461ccc Avoid comma in macro
7 years ago
wopeizl b117a5f208
Merge pull request #14931 from wopeizl/windows/mkl
7 years ago
tensor-tang 68ab16444a add eng doc of jit kernel and follow comments
7 years ago
chengduo 79bd6dfa18
[Feature] Add Temporary Allocator (#14875)
7 years ago
minqiyang e4719eb462 Fix bug in Windows VC 2010
7 years ago
peizhilin 2e35290ffa fix build issue
7 years ago
Xin Pan 484c24b756
Merge pull request #14996 from tensor-tang/fix/deps
7 years ago
tensor-tang e99e4e994d fix deps of jit benchmark
7 years ago
Tao Luo d4f384767a
Merge pull request #14987 from jczaja/prv-transpose-mkldnn-grad
7 years ago
peizhilin 201283f95c fix code style
7 years ago
peizhilin 9e60c58666 Merge remote-tracking branch 'upstream/develop' into windows/mkl
7 years ago
tensor-tang 693e5e65ce
Merge pull request #14958 from tensor-tang/refine/jit
7 years ago
minqiyang 0a4b6fc056 Remove unnessesary code
7 years ago
minqiyang 53619a79b4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_lstm
7 years ago
Zeng Jinle 95cbe07c40
Merge pull request #14836 from sneaxiy/feature/py_func
7 years ago
mozga-intel 969f669493 Static specifier was removed
7 years ago
mozga-intel 7048caf9a0 Enable top_k operator for a ngraph
7 years ago
mozga-intel 394635b2ee Static specifier was removed
7 years ago
mozga-intel ecfa68ecaa Enable fill_constant operator for a ngraph
7 years ago
Jacek Czaja d05094eec3 - Added transpose/transpose2 MKLDNN grad ops
7 years ago
Tao Luo 55af116861
Merge pull request #14966 from jczaja/prv-transpose-mkldnn-reuse-PR
7 years ago
sneaxiy c1f7e54f62 merge develop
7 years ago
Xin Pan 2537ac512e
Merge pull request #14983 from typhoonzero/revert_14850
7 years ago
chengduo a015a8a39d
Refine merge_selected_rows Doc (#14748)
7 years ago
typhoonzero da87f7a698 Revert "[Feature] Fp16 training for resnet50 (#14850)"
7 years ago
tensor-tang 1aaec571c2 fix enum style
7 years ago
Qiao Longfei d76bda50c4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
7 years ago
sneaxiy 490eb9061f polish infer shape of py_func op
7 years ago
tensor-tang facfecbdd0 follow comment: reuse time function and change to upper case
7 years ago
tensor-tang f5532877f7 Merge remote-tracking branch 'ups/develop' into refine/jit
7 years ago
Yihua Xu 3babc80160 Fix the regression issue and add the group unitest for conv2d (#14932)
7 years ago
Xin Pan a872eb90c2
Merge pull request #14959 from panyx0718/clean2
7 years ago
minqiyang 6467eda534 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_adam
7 years ago
Wu Yi 3d750f9c5a
[Feature] Fp16 training for resnet50 (#14850)
7 years ago
JiabinYang 1bec52f581 test=develop, fix cpu running error
7 years ago
minqiyang 679d1a9e0b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_lstm
7 years ago
Wang Guibao e7675dd542
Avoid calling VLOG() when program exiting (#14937)
7 years ago
heqiaozhi f18e8a7a5e remove some comments & refine doc & put template class in .h
7 years ago
sneaxiy 83ac85158a polish code
7 years ago
sneaxiy 045dc12766 merge develop
7 years ago
minqiyang 454db6662e Accelerate lstm
7 years ago
xiaoli.liu@intel.com 3146629485 Add Dequantize OP
7 years ago
xiaoli.liu@intel.com 019dbf7f5f Add Quantize OP
7 years ago
tensor-tang a369c8025b fix typo of mix impl of lstm
7 years ago
tensor-tang d651a91138 fix build on win, fix use condition of crf decoding and layer norm and
7 years ago
JiabinYang bfcb5e5235 test=develop, fix gpu compile error on prefetch, and fix hs/nce ut failed on gpu
7 years ago
tensor-tang d53c4756ad clean code and remove unused files
7 years ago
tensor-tang 95fb31285c Merge remote-tracking branch 'ups/develop' into refine/jit
7 years ago
Xin Pan cf3a07e8f8
Merge pull request #14878 from panyx0718/imperative
7 years ago
peizhilin 9f55f1ff50 use the platform api to decide the specific instruction support or not
7 years ago
tensor-tang c187a7c618 add more impls of lstm and gru and fix build on win
7 years ago
heqiaozhi 39f4e9273e data_norm
7 years ago
sneaxiy 74a8e6b032 merge develop
7 years ago
Xin Pan 1fe3ac352a move more and fix while
7 years ago
sneaxiy ae6f46a1a9 rewrite variable type
7 years ago
Jacek Czaja 709d9e3cb7 - Added reusing MKL-DNN primitives for Transpose MKL-DNN op
7 years ago
peizhilin 0b4f742e8a fix the build issue
7 years ago
peizhilin da42cf2055 fix build issue when xbyak is disabled on windows
7 years ago
tensor-tang 83d075aa79 fix lstm and gru jitcode
7 years ago
peizhilin 1cc9d59838 disable xbyak on windows
7 years ago
Xin Pan 876993887b convert more interface to avoid scope
7 years ago
tensor-tang 20392be001 Merge remote-tracking branch 'ups/develop' into refine/jit
7 years ago
tensor-tang f332f589bc add more impls of sigmoid and vtanh
7 years ago
jerrywgz dda28b0e68 fix bug in if-else op, test=develop
7 years ago
JiabinYang 4877f5d71f test=develop, fix compile error under gpu mode
7 years ago
JiabinYang 8515ee3a29 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_prefech_hs
7 years ago
JiabinYang 5ec9b37798 test=develop, fix compile error under gpu mode
7 years ago
heqiaozhi a94285869b add API
7 years ago
mozga-intel 9035bb81fe Enable mul operator for a ngraph engine (#14801)
7 years ago
tensor-tang ea259c6363 enable layer norm intrinsic code
7 years ago
gongweibao b849157e9d
Add size enforce (#14919)
7 years ago
heqiaozhi 5c7a8aee07 merge upstream to my develop
7 years ago
Jacek Czaja aa6e9c30be [MKL-DNN ]Added transpose/transpose2 Op (#14872)
7 years ago
heqiaozhi 4f6e9e3ac3 teacher student sigmoid loss
7 years ago
peizhilin 07c7eaabb4 Merge remote-tracking branch 'upstream/develop' into windows/mkl
7 years ago
wopeizl 6c66b3d496
Merge pull request #14943 from wopeizl/windows/ctc
7 years ago
Xin Pan dfcf746ea1
Merge pull request #14904 from panyx0718/clean2
7 years ago
tensor-tang b1516783ea enable crf decoding intrinsic code
7 years ago
tensor-tang 4cc7707d28 add crf_decoding and layer norm intrisic code
7 years ago
tensor-tang 10c340c9a3 fix confilcts
7 years ago
tensor-tang 893957f711 Merge remote-tracking branch 'ups/develop' into refine/jit
7 years ago
tensor-tang 6648995f53 fix build
7 years ago
JiabinYang 3b7b2e1ded Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_prefech_hs
7 years ago
Xin Pan 6324032602 MLP forward backward
7 years ago
peizhilin 19ebd8b4cf add ctc support for windows
7 years ago
Xin Pan c89a1fb287
Merge pull request #14879 from panyx0718/clean
7 years ago
Qiao Longfei 3f3a84b6dc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
7 years ago
Qiao Longfei e2d56561e7
Merge pull request #14889 from jacquesqiao/optimize-adam
7 years ago
sneaxiy a500dfa579 rewrite ddim
7 years ago
JiabinYang b5fa916413 fix bug after merge reyoung optimization, test=develop
7 years ago
sneaxiy dc8847af87 add examples and comments
7 years ago
peizhilin fa135bbf52 Fix the mkl build script on windows
7 years ago
Xin Pan 70981f5d79 clean
7 years ago
Qiao Longfei e0df9f2346 merge lazy mode
7 years ago
Yu Yang 2803cf5776
Merge pull request #14868 from reyoung/feature/refine_w2v
7 years ago
Zhaolong Xing a9fb34fad8
Merge pull request #14903 from NHZlX/add_conv_elementwise_pass
7 years ago
peizhilin b601f2de8d include the mkl fix only
7 years ago
Qiyang Min fd1d2c897e
Merge pull request #14894 from velconia/add_huber_regression_loss_op
7 years ago
peizhilin 5a6d7fe2ff add mkl,ctc support for windows
7 years ago
wopeizl 0f085f0a5a
Merge pull request #14892 from wopeizl/windows/port3
7 years ago
JiabinYang 656040c726 merge reyoung optimization
7 years ago
Qiao Longfei 8936c7913b add log test=develop
7 years ago
Xin Pan eaf8ba35b5 change input
7 years ago
Xin Pan 840e6729e2 inject context
7 years ago
Qiao Longfei 59cf96ec18 add log
7 years ago
wopeizl fa78fc60be
Merge pull request #14907 from wopeizl/windows/avx
7 years ago