Zeng Jinle
c0bcff00dc
Merge pull request #14962 from sneaxiy/rewrite_variable_type
...
Rewrite variable type
7 years ago
Qiao Longfei
d161215332
optimize adam multi thread
7 years ago
wopeizl
719ebe3786
Merge pull request #15070 from wopeizl/windows/testcasefix
...
fix test issues on windows
7 years ago
Qiao Longfei
7a58ad5c79
lazy mode have higher priority then multithread
...
test=develop
7 years ago
Xin Pan
f52b514dcd
call kernel
7 years ago
Xin Pan
7b6bf9ddf2
make fill_constant kernel-based
...
test=develop
7 years ago
Xin Pan
61491ce250
clean
...
test=develop
7 years ago
Xin Pan
ce7e503cbe
refactor to avoid scope.
...
test=develop
7 years ago
Qiyang Min
0238a3bb4f
Merge pull request #14972 from velconia/accelerate_lstm
...
Accelerate PADDLE_ENFORCE
7 years ago
Houjiang Chen
242d3c71a6
Merge pull request #15031 from hjchen2/develop
...
Fix conv_elementwise_add2_act pass
7 years ago
Qiao Longfei
d0572bf02e
add log for lazy mode test=develop
7 years ago
Qiao Longfei
1177b0bc84
update multi thread adam
7 years ago
Qiao Longfei
3b294e2e2e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
7 years ago
Zeng Jinle
988bc2b5a7
Merge pull request #15060 from dzhwinter/fix/nccl
...
fix ci error. test=develop
7 years ago
sneaxiy
c4ce2e7b21
merge develop, solve conflict
...
test=develop
7 years ago
shippingwang
9322d34032
Fix, test=develop
7 years ago
sneaxiy
b56aca82e9
merge develop
...
test=develop
7 years ago
jerrywgz
ef2d292bfc
Merge pull request #14956 from jerrywgz/fix_bug_in_ifelse
...
fix bug in if-else op
7 years ago
peizhilin
e49276e731
restore the huber_loss_op
...
test=develop
7 years ago
Yancey1989
86bb583881
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
7 years ago
peizhilin
01c00b07dd
fix test issues on windows
...
test=develop
7 years ago
tangwei12
dc8eca826e
code style fix, test=develop ( #15045 )
...
* code style fix, test=develop
7 years ago
Yancey1989
41a64f6a2a
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
7 years ago
hjchen2
956cf92145
Fix conv_elementwise_add2_act pass
...
test=develop
7 years ago
whs
2314f2ebb3
Make topk op support variable k. ( #15044 )
...
* Make topk op support variable k.
test=develop
* Fix tensor type.
test=develop
7 years ago
shippingwang
5631fc0817
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel
7 years ago
sneaxiy
dde3afe7b7
Merge develop
...
test=develop
7 years ago
dzhwinter
3ea2f415dc
fix ci error. test=develop
7 years ago
Wu Yi
856f0da0fe
Fp16 training ( #14992 )
...
* wip
* wip
* wip
* wip for test
* add fp16 tests test=develop
* fix cpu build test=develop
* fix test=develop
* fix py3 tests test=develop
* fix lr_scheduler dtype test=develop
* fix test=dvelop
* test fix ci compile test=develop
* fix build and merge test=develop
* fallback momentumop change to general test=develop
* make fp16 lr schedule simple test=develop
* fix ut test=develop
* fix tests test=develop
* remove fp16 learning rate cast test=develop
7 years ago
chengduo
b9fb03cf54
Move GetTensor to tensor_util ( #15011 )
...
* refine tensor
test=develop
* refine tensor
test=develop
* fix device_context log
test=develop
7 years ago
Yihua Xu
dbb90a76f0
Merge remote-tracking branch 'paddle/develop' into develop_641313ea7_elementwise_mul_mkldnn_bug_fix
7 years ago
nhzlx
73b47df1f4
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_affine_channel_fuse
...
test=develop
7 years ago
nhzlx
ce3782c193
add affine_channel fuse.
...
fix conv+elemenwise fuse bug.
7 years ago
shippingwang
c2575ac72d
update API.spec, test=develop
7 years ago
shippingwang
ebeee9303e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel
7 years ago
Qiyang Min
aba1f9b06e
Merge pull request #14891 from velconia/accelerate_adam
...
Remove BinarySearch from Adam Op (CPU part)
7 years ago
shippingwang
0a0b6f4a22
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel
7 years ago
minqiyang
8ec3d863b0
Fix throw_on_error direct call bug
...
test=develop
7 years ago
sneaxiy
3a2afbf02e
polish code
...
test=develop
7 years ago
shippingwang
5a9ea9a73d
Add ShuffleChannel Op
7 years ago
tensor-tang
05d1121b22
Merge pull request #14802 from mozga-intel/mozga-intel/fill_constant_operator_ngraph
...
Enable fill_constant operator for a ngraph engine
7 years ago
tensor-tang
9d4f1d468a
Merge pull request #14804 from mozga-intel/mozga-intel/top_k_operator_ngraph
...
Enable top_k operator for a ngraph engine
7 years ago
tensor-tang
8a6ac4dba7
Merge pull request #14973 from xiaolil1/dequantize
...
Add Dequantize OP
7 years ago
tensor-tang
f0e02a65ed
Merge pull request #14974 from xiaolil1/quantize
...
Add Quantize OP
7 years ago
shippingwang
76c6f11584
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel
7 years ago
shippingwang
16d4e13711
Add ShuffleChannelOP
7 years ago
qingqing01
51a9fca323
Async memory copy ( #15013 )
7 years ago
minqiyang
010f657b33
Polish code
...
test=develop
7 years ago
JiabinYang
1a8cbb6799
test=develop, accelerate_hs_op and add prefetch with is_sparse
7 years ago
xiaoli.liu@intel.com
869d444b92
Fix comments misunderstanding
...
test=develop
7 years ago
xiaoli.liu@intel.com
d83d0f33fd
extract templated function
...
test=develop
7 years ago
Yihua Xu
d4606bcb22
Fix the exception when tensor format is x
...
test=develop
7 years ago
minqiyang
8b6b0da062
Use adam_update
...
test=develop
7 years ago
minqiyang
f4e7a47381
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_adam
...
test=develop
7 years ago
tensor-tang
641313ea77
Merge pull request #15000 from tensor-tang/doc/en/jit
...
add eng doc of jit kernel and follow comments
7 years ago
minqiyang
e811e06555
Avoid comma in macro
...
test=develop
7 years ago
minqiyang
0cf1461ccc
Avoid comma in macro
...
test=develop
7 years ago
wopeizl
b117a5f208
Merge pull request #14931 from wopeizl/windows/mkl
...
add mkl support for windows
7 years ago
tensor-tang
68ab16444a
add eng doc of jit kernel and follow comments
...
test=develop
7 years ago
chengduo
79bd6dfa18
[Feature] Add Temporary Allocator ( #14875 )
...
* Add Temporal Allocator
* add Temporay Allocator to DeviceContext
test=develop
* code refine
test=develop
* fix mean_iou
test=develop
* Add DeviceTemporaryAllocator
test=develop
* fix conv_op bug
test=develop
* small fix
test=develop
* code refine
test=develop
* log refine
test=develop
* fix unit test
test=develop
* move double check
* refine concat_and_split
test=develop
* add limit_of_temporary_allocation
test=develop
* fix name
test=develop
7 years ago
minqiyang
e4719eb462
Fix bug in Windows VC 2010
...
test=develop
7 years ago
peizhilin
2e35290ffa
fix build issue
...
test=develop
7 years ago
Xin Pan
484c24b756
Merge pull request #14996 from tensor-tang/fix/deps
...
fix deps of jit benchmark
7 years ago
tensor-tang
e99e4e994d
fix deps of jit benchmark
...
test=develop
7 years ago
Tao Luo
d4f384767a
Merge pull request #14987 from jczaja/prv-transpose-mkldnn-grad
...
[MKL-DNN] Added transpose/transpose2 MKLDNN grad ops
7 years ago
peizhilin
201283f95c
fix code style
...
test=develop
7 years ago
peizhilin
9e60c58666
Merge remote-tracking branch 'upstream/develop' into windows/mkl
...
test=develop
7 years ago
tensor-tang
693e5e65ce
Merge pull request #14958 from tensor-tang/refine/jit
...
enhance jit
7 years ago
minqiyang
0a4b6fc056
Remove unnessesary code
...
test=develop
7 years ago
minqiyang
53619a79b4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_lstm
7 years ago
Zeng Jinle
95cbe07c40
Merge pull request #14836 from sneaxiy/feature/py_func
...
Featue/py_func op
7 years ago
mozga-intel
969f669493
Static specifier was removed
...
test=develop
7 years ago
mozga-intel
7048caf9a0
Enable top_k operator for a ngraph
...
test=develop
7 years ago
mozga-intel
394635b2ee
Static specifier was removed
...
test=develop
7 years ago
mozga-intel
ecfa68ecaa
Enable fill_constant operator for a ngraph
...
test=develop
7 years ago
Jacek Czaja
d05094eec3
- Added transpose/transpose2 MKLDNN grad ops
...
test=develop
7 years ago
Tao Luo
55af116861
Merge pull request #14966 from jczaja/prv-transpose-mkldnn-reuse-PR
...
[MKL-DNN] Extending Transpose Op to reuse MKL-dNN primitives
7 years ago
sneaxiy
c1f7e54f62
merge develop
...
test=develop
7 years ago
Xin Pan
2537ac512e
Merge pull request #14983 from typhoonzero/revert_14850
...
Revert "[Feature] Fp16 training for resnet50 (#14850 )"
7 years ago
chengduo
a015a8a39d
Refine merge_selected_rows Doc ( #14748 )
...
* add doc for MergeSelectedRows
test=develop
* checkout selected_rows
test=develop
7 years ago
typhoonzero
da87f7a698
Revert "[Feature] Fp16 training for resnet50 ( #14850 )"
...
This reverts commit 3d750f9c5a .
7 years ago
tensor-tang
1aaec571c2
fix enum style
...
test=develop
7 years ago
Qiao Longfei
d76bda50c4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
...
test=develop
7 years ago
sneaxiy
490eb9061f
polish infer shape of py_func op
...
test=develop
7 years ago
tensor-tang
facfecbdd0
follow comment: reuse time function and change to upper case
...
test=develop
7 years ago
tensor-tang
f5532877f7
Merge remote-tracking branch 'ups/develop' into refine/jit
7 years ago
Yihua Xu
3babc80160
Fix the regression issue and add the group unitest for conv2d ( #14932 )
...
* Add test items for mkldnn conv2d
* Fix the regression issue and pass the unit test for conv2d and conv3d
test=develop
7 years ago
Xin Pan
a872eb90c2
Merge pull request #14959 from panyx0718/clean2
...
Further op RunImpl refactor
7 years ago
minqiyang
6467eda534
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_adam
...
test=develop
7 years ago
Wu Yi
3d750f9c5a
[Feature] Fp16 training for resnet50 ( #14850 )
...
* wip
* wip
* wip
* wip for test
* add fp16 tests test=develop
* fix cpu build test=develop
* fix test=develop
* fix py3 tests test=develop
* fix lr_scheduler dtype test=develop
* fix test=dvelop
* test fix ci compile test=develop
* fix build and merge test=develop
* fallback momentumop change to general test=develop
7 years ago
JiabinYang
1bec52f581
test=develop, fix cpu running error
7 years ago
minqiyang
679d1a9e0b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_lstm
7 years ago
Wang Guibao
e7675dd542
Avoid calling VLOG() when program exiting ( #14937 )
...
* Avoid calling VLOG() when program exiting
test=develop
* Avoid calling VLOG() when program exiting
test=develop
7 years ago
heqiaozhi
f18e8a7a5e
remove some comments & refine doc & put template class in .h
...
test=develop
7 years ago
sneaxiy
83ac85158a
polish code
...
test=develop
7 years ago
sneaxiy
045dc12766
merge develop
...
test=develop
7 years ago
minqiyang
454db6662e
Accelerate lstm
7 years ago
xiaoli.liu@intel.com
3146629485
Add Dequantize OP
...
test=develop
7 years ago
xiaoli.liu@intel.com
019dbf7f5f
Add Quantize OP
...
test=develop
7 years ago
tensor-tang
a369c8025b
fix typo of mix impl of lstm
...
test=develop
7 years ago
tensor-tang
d651a91138
fix build on win, fix use condition of crf decoding and layer norm and
...
enhance test precision
test=develop
7 years ago
JiabinYang
bfcb5e5235
test=develop, fix gpu compile error on prefetch, and fix hs/nce ut failed on gpu
7 years ago
tensor-tang
d53c4756ad
clean code and remove unused files
...
test=develop
7 years ago
tensor-tang
95fb31285c
Merge remote-tracking branch 'ups/develop' into refine/jit
7 years ago
Xin Pan
cf3a07e8f8
Merge pull request #14878 from panyx0718/imperative
...
MLP forward backward
7 years ago
peizhilin
9f55f1ff50
use the platform api to decide the specific instruction support or not
...
test=develop
7 years ago
tensor-tang
c187a7c618
add more impls of lstm and gru and fix build on win
...
test=develop
7 years ago
heqiaozhi
39f4e9273e
data_norm
...
test=develop
7 years ago
sneaxiy
74a8e6b032
merge develop
...
fix conflict
test=develop
7 years ago
Xin Pan
1fe3ac352a
move more and fix while
...
test=develop
7 years ago
sneaxiy
ae6f46a1a9
rewrite variable type
...
test=develop
7 years ago
Jacek Czaja
709d9e3cb7
- Added reusing MKL-DNN primitives for Transpose MKL-DNN op
...
test=develop
7 years ago
peizhilin
0b4f742e8a
fix the build issue
...
test=develop
7 years ago
peizhilin
da42cf2055
fix build issue when xbyak is disabled on windows
...
test=develop
7 years ago
tensor-tang
83d075aa79
fix lstm and gru jitcode
...
test=develop
7 years ago
peizhilin
1cc9d59838
disable xbyak on windows
...
test=develop
7 years ago
Xin Pan
876993887b
convert more interface to avoid scope
...
test=develop
7 years ago
tensor-tang
20392be001
Merge remote-tracking branch 'ups/develop' into refine/jit
...
fix conflicts
test=develop
7 years ago
tensor-tang
f332f589bc
add more impls of sigmoid and vtanh
7 years ago
jerrywgz
dda28b0e68
fix bug in if-else op, test=develop
7 years ago
JiabinYang
4877f5d71f
test=develop, fix compile error under gpu mode
7 years ago
JiabinYang
8515ee3a29
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_prefech_hs
7 years ago
JiabinYang
5ec9b37798
test=develop, fix compile error under gpu mode
7 years ago
heqiaozhi
a94285869b
add API
...
test=develop
7 years ago
mozga-intel
9035bb81fe
Enable mul operator for a ngraph engine ( #14801 )
...
* Enable mul operator for a ngraph
test=develop
* Enable activation ops test
test=develop
* Remove unused line
test=develop
7 years ago
tensor-tang
ea259c6363
enable layer norm intrinsic code
7 years ago
gongweibao
b849157e9d
Add size enforce ( #14919 )
7 years ago
heqiaozhi
5c7a8aee07
merge upstream to my develop
...
test=develop
Merge remote-tracking branch 'upstream/develop' into develop
7 years ago
Jacek Czaja
aa6e9c30be
[MKL-DNN ]Added transpose/transpose2 Op ( #14872 )
...
* - Added transpose MKLDNN Op
- Few basic UT works
- Added 1D transpose
- implementing generic mem desc for MKLDNN transpose
- Modified trnaspose op to support more dimensional data eg. 5,6..10
- Added is_test attribute to transpose op
test=develop
* - Added support for MKLDNN::memory::format::any for Transpose MKLDNN op
test=develop
* - Additional transpose mkldnn op correction to mkldnn layout
test=develop
* Cosmetic fixes
test=develop
* - Removed const_cast to obey coding standard
test=develop
7 years ago
heqiaozhi
4f6e9e3ac3
teacher student sigmoid loss
7 years ago
peizhilin
07c7eaabb4
Merge remote-tracking branch 'upstream/develop' into windows/mkl
...
test=develop
7 years ago
wopeizl
6c66b3d496
Merge pull request #14943 from wopeizl/windows/ctc
...
add ctc support for windows
7 years ago
Xin Pan
dfcf746ea1
Merge pull request #14904 from panyx0718/clean2
...
refactor RunImpl
7 years ago
tensor-tang
b1516783ea
enable crf decoding intrinsic code
7 years ago
tensor-tang
4cc7707d28
add crf_decoding and layer norm intrisic code
7 years ago
tensor-tang
10c340c9a3
fix confilcts
7 years ago
tensor-tang
893957f711
Merge remote-tracking branch 'ups/develop' into refine/jit
7 years ago
tensor-tang
6648995f53
fix build
7 years ago
JiabinYang
3b7b2e1ded
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_prefech_hs
7 years ago
Xin Pan
6324032602
MLP forward backward
...
test=develop
7 years ago
peizhilin
19ebd8b4cf
add ctc support for windows
7 years ago
Xin Pan
c89a1fb287
Merge pull request #14879 from panyx0718/clean
...
clean parallel do
7 years ago
Qiao Longfei
3f3a84b6dc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
...
test=develop
7 years ago
Qiao Longfei
e2d56561e7
Merge pull request #14889 from jacquesqiao/optimize-adam
...
adam optimizer support lazy mode
7 years ago
sneaxiy
a500dfa579
rewrite ddim
...
test=develop
7 years ago
JiabinYang
b5fa916413
fix bug after merge reyoung optimization, test=develop
7 years ago
sneaxiy
dc8847af87
add examples and comments
...
test=develop
7 years ago
peizhilin
fa135bbf52
Fix the mkl build script on windows
...
test=develop
7 years ago
Xin Pan
70981f5d79
clean
...
test=develop
7 years ago
Qiao Longfei
e0df9f2346
merge lazy mode
7 years ago
Yu Yang
2803cf5776
Merge pull request #14868 from reyoung/feature/refine_w2v
...
Feature/refine w2v
7 years ago
Zhaolong Xing
a9fb34fad8
Merge pull request #14903 from NHZlX/add_conv_elementwise_pass
...
Add conv + elementwiseAdd pass
7 years ago
peizhilin
b601f2de8d
include the mkl fix only
...
test=develop
7 years ago
Qiyang Min
fd1d2c897e
Merge pull request #14894 from velconia/add_huber_regression_loss_op
...
Add python interface for huber loss
7 years ago
peizhilin
5a6d7fe2ff
add mkl,ctc support for windows
7 years ago
wopeizl
0f085f0a5a
Merge pull request #14892 from wopeizl/windows/port3
...
fix script issue
7 years ago
JiabinYang
656040c726
merge reyoung optimization
7 years ago
Qiao Longfei
8936c7913b
add log test=develop
7 years ago
Xin Pan
eaf8ba35b5
change input
...
test=develop
7 years ago
Xin Pan
840e6729e2
inject context
...
test=develop
7 years ago
Qiao Longfei
59cf96ec18
add log
7 years ago
wopeizl
fa78fc60be
Merge pull request #14907 from wopeizl/windows/avx
...
add avx support for windows
7 years ago
Qiao Longfei
fe3995d335
refine code test=develop
7 years ago
tensor-tang
74292f414c
enable eltwise nchw16c mul nc
7 years ago
Qiao Longfei
fd152289fa
clean for range in test=develop
7 years ago
nhzlx
050a68dde3
fix comments
...
test=develop
7 years ago
shippingwang
7f73c16e42
Add
7 years ago
shippingwang
2dd55b873f
Add shuffle_channel_op
7 years ago
tensor-tang
720b55cbcf
enable crf decoding and layer norm refer code
7 years ago
tensor-tang
64a90b2f1c
use vadd, vaddrelu, lstm and gru jitkernel
7 years ago
Qiao Longfei
1141db8114
update test_adam_op
...
test=develop
7 years ago
gongweibao
addded48e1
test=develop ( #14898 )
7 years ago
Qiao Longfei
3bd54ed769
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
7 years ago
Qiao Longfei
96604fda10
fix gpu data
...
test=develop
7 years ago
nhzlx
fcc93d96d5
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_conv_elementwise_pass
...
fix conflicts
test=develop
7 years ago
minqiyang
24eb8f038c
Fix bug
...
test=develop
7 years ago
Yu Yang
740e1626ce
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_w2v
...
test=develop
7 years ago
Yancey1989
a760a550b0
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
7 years ago
minqiyang
bd0067b26c
Polish code
...
test=develop
7 years ago
Yu Yang
bacf1d2399
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
7 years ago
Qiao Longfei
238b24bfa2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-adam
7 years ago
peizhilin
01dd9061a0
add avx support for windows
...
test=develop
7 years ago
Qiao Longfei
fcde2b2725
add ForRangeIn
7 years ago
tensor-tang
3713d08d40
enable jitcode gru
7 years ago
tensor-tang
7c1f3ad6eb
enable jitcode lstm
7 years ago
Xin Pan
363bf8a4d8
Merge pull request #14897 from panyx0718/clean2
...
In most times, const_cast is bad and break interface contract and
7 years ago
nhzlx
388953027e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_conv_elementwise_pass
...
test=develop
7 years ago
nhzlx
514648665a
fix trt_op test test=develop
7 years ago
Kaipeng Deng
dc76e4b0f1
Merge pull request #14701 from heavengate/adaptive_pool
...
add adaptive pool2d and pool3d
7 years ago
tensor-tang
80766bcb82
enable act jitcode vexp, vrelu, vsigmoid and vtanh
7 years ago
nhzlx
050e118f3c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_trt_thread_bug
...
test=develop
7 years ago
nhzlx
96216052d5
1. fix trt multi thread bug
7 years ago
gongweibao
0b1c7d838c
Add brpc serialization support. ( #11430 )
7 years ago
tensor-tang
fd0a954fbf
enable blas jitcode vmul, vadd, vaddrelu, vscal and vaddbias
7 years ago
Yan Chunwei
a985949be9
Fea/fuse conv elementwise add fuse ( #14669 )
7 years ago
tensor-tang
5e97be7ba7
enable jitkernel mkl vexp, vsigmoid and vtanh
7 years ago
minqiyang
5fea8cd478
Add sorted_result parameter to SelectedRows Functor
...
test=develop
7 years ago
tensor-tang
ae17926987
enable jitkernel mkl vmul, vadd and vscal
7 years ago
tensor-tang
77907a3502
refine benchmark template
7 years ago
Xin Pan
e90b2f104c
In most times, const_cast is bad and break interface contract and
...
make the code unreadable and make the program unstable.
test=develop
7 years ago
Yancey1989
4a4ccac1d0
update by comment test=develop
7 years ago
tensor-tang
8e785fec8d
clean code and refine tests template
7 years ago
minqiyang
65d355a72c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_huber_regression_loss_op
...
test=develop
7 years ago
minqiyang
c550e0ce06
Add python interface for huber regression loss
...
test=develop
7 years ago
peizhilin
23dec78772
fix script issue
...
test=develop
7 years ago
minqiyang
da796dfe05
Remove BinarySearch from Adam Op
...
test=develop
7 years ago
Yu Yang
b17444c84c
Fix merge bug
...
test=develop
7 years ago
Qiao Longfei
c624417c6f
change sparse mode to lazy mode
7 years ago
Qiao Longfei
4035e4bab2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-adam
7 years ago
Qiao Longfei
fac8702269
adam support multithread
7 years ago
tensor-tang
00d3afbcc9
add gru refer functions, test and benchmark
7 years ago
Qiao Longfei
3dc29b3905
change sparse_update to adam_update
7 years ago
tensor-tang
6eec461725
add lstm peephole benchmark
7 years ago
tensor-tang
bf9302f950
add lstm, peephole refer and test
7 years ago
sneaxiy
f0df62f136
add more unittest case
...
test=develop
7 years ago
Qiao Longfei
fc6ec6bd14
add sparse mode adam
7 years ago
Yancey1989
c722b1dcb6
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
...
test=develop
7 years ago
Yu Yang
4ecdb6f486
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
...
test=develop
7 years ago
Xin Pan
47ea2534fb
clean parallel do
...
test=develop
7 years ago
sneaxiy
f6741df462
merge develop
...
fix bug
test=develop
7 years ago
Yu Yang
7b10bf0e60
Use mkl
7 years ago
Zeng Jinle
1b564bc49a
Merge pull request #14670 from sneaxiy/refactor_eager_deletion
...
Rewrite eager deletion
7 years ago
SunGaofeng
e3c4b0dace
this is for psroi_pool op, test=develop ( #14796 )
...
* Add psroi_pool operator.
7 years ago
tensor-tang
bf951fa737
add refer vrelu, videntity, vexp, vsigmoid, vtanh and test and benchmark
7 years ago
Yu Yang
15550a2753
Polish code
7 years ago
sneaxiy
deb0d41cea
fix cmake
...
fix cmake again
test=develop
7 years ago
Yancey1989
23eb8c4299
fix ci test=develop
7 years ago
Yu Yang
9e0b33d7ad
Merge branch 'feature/tensor_type' into feature/refine_w2v
7 years ago
JiabinYang
50fce87905
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_prefech_hs
7 years ago
Yu Yang
194e66f785
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
7 years ago
tensor-tang
e9216e82f9
add refer vscal, vaddbias and test and benchmark
7 years ago
sneaxiy
db2daefe50
merge develop
...
test=develop
7 years ago
sneaxiy
8b9d33fa1e
add unittest and fix bug
...
add API.spec
test=develop
7 years ago
tensor-tang
a37038880e
fix unit test with double type
7 years ago
tensor-tang
417d031f90
add refer vadd, vaddrelu, vsub and tests and benchmark
7 years ago
Yancey1989
106e285236
add unittest for parllelgraph mode test=develop
7 years ago
JiabinYang
c2e851f7b2
test=develop, remove sparse bias and add prefetch and related tests
7 years ago
Yu Yang
be11375661
Refine code
7 years ago
Yu Yang
8d9401152e
Refine w2v
7 years ago
Tao Luo
66b6e473d0
Merge pull request #14732 from Sand3r-/mgallus/mkldnn-concat
...
[MKL-DNN] Concat Layer
7 years ago
sneaxiy
0c554a59fa
merge develop
...
test=develop
7 years ago
tensor-tang
f3250097bc
fix bug and mac compile
7 years ago
tensor-tang
bc0df6a948
make typename tuples
7 years ago
tensor-tang
194ce2e92c
add benchmark
7 years ago
Yibing Liu
6951ef9a55
Fix the gelu backward to avoid nan ( #14857 )
...
* Fix the gelu backward to avoid nan
test=develop
* Remove unnecessary calls
test=develop
7 years ago
Yu Yang
c00e07cda0
Fix distribute compile
...
test=develop
7 years ago
Qiao Longfei
3668f07965
Merge pull request #14844 from jacquesqiao/pserver-should-crash
...
pserver should crash early whe has problem
7 years ago
sneaxiy
ca84c2ca8f
merge develop
...
test=develop
7 years ago
sneaxiy
e240ba2918
implement backward
...
test=develop
7 years ago
sneaxiy
06f8aa5b97
remove while_op support temporarily
...
test=develop
7 years ago
Yu Yang
81520a24cf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_eigen_tensor
7 years ago
Yu Yang
9bd70a1e04
Change tensor uses proto::VarType::type
...
test=develop
7 years ago
Yu Yang
eeca721a99
Merge pull request #14842 from reyoung/feature/refine_eigen_tensor
...
Fix Eigen macro when using GPU
7 years ago
tensor-tang
d538513fce
fix the compile error on mac
7 years ago
tensor-tang
28eb7d840c
test all impls and all inplace cases
7 years ago
Yihua Xu
acc6ae49b1
Fix the issue to run on AVX2 and AVX512F machines ( #14851 )
...
test=develop
7 years ago
Michal Gallus
92daace55c
MKL-DNN Concat: Fix segfault related to referencing deleter memory primitive
...
test=develop
7 years ago
tensor-tang
d4cab7d948
use jitkernel in one file
7 years ago
tensor-tang
adc7ba2edd
Merge remote-tracking branch 'ups/develop' into refine/jit
7 years ago
tensor-tang
900c789a35
use jitcode and use vmul
7 years ago
Qiao Longfei
1870262ba9
pserver should crash early whe has problem
...
test=develop
7 years ago
dengkaipeng
a81fabd327
fix doc errors. test=develop
7 years ago
dengkaipeng
cf06e50f1d
add doc for adaptive pool. test=develop
7 years ago
dengkaipeng
266c6856c9
add adaptive pool 2d & 3d. test=develop
7 years ago
dengkaipeng
eab4745965
add adaptive mode for pool.
7 years ago
Yu Yang
7604b1ad51
Fix Eigen macro when using GPU
...
The macro should be defined by compiler rather than by source.
test=develop
7 years ago
Qiao Longfei
1213e2838f
Merge pull request #14820 from jacquesqiao/fix-split-selected-rows
...
split selected rows op should always init output selected rows
7 years ago
JiabinYang
c35fdf1581
Merge branch 'add_prefetch_in_nce' of https://github.com/seiriosPlus/Paddle into feature/add_prefech_hs
7 years ago
sneaxiy
7923042365
merge develop
...
test=develop
7 years ago
tangwei12
59cbf06e2e
fix numel nce and prefetch
...
test=develop
7 years ago
sneaxiy
8760d23c7d
featue/py_func
7 years ago
tangwei12
33a004a779
fix numel nce and prefetch
7 years ago
zhang wenhui
c4c5f0b8ca
Merge pull request #14771 from frankwhzhang/bpr
...
add bpr_loss operator
7 years ago
Yancey1989
79082c9459
fix pyreader failed
7 years ago
tensor-tang
53709e7e61
refine names
7 years ago
Qiao Longfei
abf140289f
split selected rows op should always init output selected rows
...
test=develop
7 years ago
Yancey1989
2dda19f756
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
7 years ago
frankwhzhang
c9a653820b
fix label_pos ,add test_layers.py, test=develop
7 years ago
tangwei12
57557f6774
fix scope in nce and prefetch
7 years ago
frankwhzhang
a672b291e5
fix code style, test=develop
7 years ago
frankwhzhang
ea95f9c335
fix style bug, test=develop
7 years ago
tangwei12
bb2e7f0bbe
add scope in prefetch
7 years ago
gongweibao
f1fb64b17f
Add reduce sparse tensor feature. ( #14757 )
7 years ago
tangwei12
527946df49
add scope in prefetch
7 years ago
Yancey1989
73edf13767
update
7 years ago
Yancey1989
220db4f334
clean code
7 years ago
frankwhzhang
f4cc5881b0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into bpr
7 years ago
frankwhzhang
97de98cd0a
update bpr_loss op code, test=develop
7 years ago
Yihua Xu
3821fc3950
Merge branch 'develop' into develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
...
test=develop
7 years ago
Yihua Xu
240d974ac5
Clean Code
...
test=develop
7 years ago
Tao Luo
54fcafb5f6
Merge pull request #14707 from yihuaxu/develop_4f71a6ee2_conv3d_mkldnn_opt
...
Implement conv3d with mkldnn library
7 years ago
tangwei12
b653ed0516
add prefetch and remvoe selectedrows of bias
7 years ago
sneaxiy
387bac46b5
refine code
...
test=develop
7 years ago
Yihua Xu
155328a488
Clean Code
...
test=develop
7 years ago
Tao Luo
743cb840f1
update with comments
...
test=develop
7 years ago
Yancey1989
c9de6f1b05
init parallel graph mode
7 years ago
tensor-tang
ce674b685f
add readme doc and complete TODOs
7 years ago
tangwei12
7fa2e821e4
add local scope in nce
7 years ago
Tao Luo
42359e88a4
clean code
...
test=develop
7 years ago
Tao Luo
923b18877e
Merge branch 'develop' into memory_load
...
test=develop
7 years ago
Tao Luo
405b2486db
support loading from memory
...
test=develop
7 years ago
tangwei12
627a6b8bac
add prefetch in nce
7 years ago
frankwhzhang
272f3d3111
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into bpr
7 years ago
tangwei12
4cb0100c8e
add prefetch in nce
7 years ago
frankwhzhang
570d89ec84
add bpr_loss operator , test=develop
7 years ago
Qiao Longfei
05208e1f2b
optimize code
...
test=develop
7 years ago
qingqing01
549f165b59
Speed conv_fusion_op for identity activation. ( #14744 )
...
* Refine conv_fusion_op for identity activation.
* Fix unit testing.
test=develop
7 years ago
tensor-tang
fab0ee8757
Merge remote-tracking branch 'ups/develop' into refine/jitkernel
7 years ago
Houjiang Chen
c6b39a0099
Merge pull request #14714 from NHZlX/add_prelu_gpu
...
add prelu cuda kernel for inference.
7 years ago
tensor-tang
dbe451976b
Merge pull request #14753 from tensor-tang/refine/namespace
...
remove jit namespace
7 years ago
Jiabin Yang
d9bb55a1f9
Merge pull request #14756 from JiabinYang/fix_hs_op
...
fix bug in dist train on hs, test=develop
7 years ago
Yihua Xu
65dbc7cca4
Merge branch 'develop' into develop_4f71a6ee2_conv3d_mkldnn_opt
7 years ago
JiabinYang
e05e1d7d88
fix bug in dist train on hs, test=develop
7 years ago
tensor-tang
a1eb21e704
refine names
7 years ago
tensor-tang
b523787f9f
remove jit namespace
...
test=develop
7 years ago
tensor-tang
191948c933
enable jitcode
7 years ago
tensor-tang
4a93db9288
remove jit namespace
...
test=develop
7 years ago
Hongyu Liu
8cda28f345
Merge pull request #14733 from phlrain/add_cudnn_5_support
...
Add cudnn 5 support
7 years ago
Xin Pan
73b4d1aa72
Merge pull request #14742 from panyx0718/infer2
...
support customized kernel selection
7 years ago
Qiao Longfei
9af76ade4c
fix unused var
7 years ago
Jiabin Yang
21c0f8749e
Merge pull request #14728 from JiabinYang/optimize_hs_op
...
Optimize hs op
7 years ago
tensor-tang
45bfa70cb8
complete vmul jit kernel
7 years ago
tensor-tang
77236e33fc
init jitkernel
7 years ago
Xin Pan
82d68281c0
follow comments
...
test=develop
7 years ago
liuhongyu
8b2898e201
fix bug of formate; test=develop
7 years ago
Xin Pan
41c28d54c6
allow customize kernel selection
...
test=develop
7 years ago
liuhongyu
773dc73fbf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_5_support
7 years ago
liuhongyu
8daf67f90f
fix bugs; test=develop
7 years ago
chengduo
04539d4c5d
Fix clip.py ( #14718 )
...
* expose square
test=develop
* fix activation
test=develop
* Add square API
test=develop
* add necessary op
* code refine
* fix API.spec
test=develop
* fix unit test
test=develop
* add unit test sparse_grad_clip
test=develop
* fix API.spec
test=develop
* remove mac test for test_gradient_clip
test=develop
* remove selectedrows_mul_tensor
test=develop
7 years ago
Michal Gallus
6fdbb365ce
Include MKL-DNN header to concat op only when flag is set
...
test=develop
7 years ago
Michal Gallus
f2a880421e
Fix style @ concat integration and tests
...
test=develop
7 years ago
Michal Gallus
738069e491
Refactor MKL-DNN Concat
...
test=develop
7 years ago
Michal Gallus
208f912512
Implement MKL-DNN Concat
...
test=develop
7 years ago
liuhongyu
968dd3c078
add cudnn 5 support; test=develop
7 years ago
sneaxiy
e694d0c2e4
fix while_op eager deletion bug
...
add unittest
test=develop
7 years ago
Qiao Longfei
7b7fe01cae
optimize code
7 years ago
Qiao Longfei
daba57f752
complete ctr_reader
7 years ago
JiabinYang
8c75705984
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize_hs_op
...
, test=develop
7 years ago
JiabinYang
b387a19410
optimize op with blas
7 years ago
Zeng Jinle
ff4237309a
Merge pull request #14720 from sneaxiy/fix_seq_mask_op_infershape
...
Fix sequence_mask_op InferShape
7 years ago
Kaipeng Deng
934f13a70a
Merge pull request #14371 from heavengate/yolo_loss
...
Add YOLOv3 loss operator for YOLOv3 model
7 years ago
sneaxiy
65867d8989
test=develop
7 years ago
sneaxiy
64ad051b9a
merge develop
...
test=develop
7 years ago
sneaxiy
c47c451a00
fix bug
7 years ago
nhzlx
e7abe6b654
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_prelu_gpu
...
test=develop
7 years ago
nhzlx
f75815b78c
add prelu gpu inference
7 years ago
Yihua Xu
ea00270fe8
Remove the dims checking when the dim is 3 (test=develop)
7 years ago
jerrywgz
96dc3d8326
Merge pull request #14511 from jerrywgz/ignore_index_for_sigmoid_cross_entropy
...
add ignore index for sigmoid cross entropy with logits op, test=develop
7 years ago
Yihua Xu
669191c9cc
Implement conv3d with mkldnn library (test=develop)
7 years ago
Hongyu Liu
4f71a6ee2c
Merge pull request #14622 from PaddlePaddle/add_cudnn_lstm
...
Add cudnn lstm
7 years ago