Qiao Longfei
4feae25378
fix build problem test=develop
7 years ago
tensor-tang
38de1ff472
add fusion squared mat sub op
7 years ago
Qiao Longfei
e641ffe77b
change interface and api spec for dynamic_gru test=develop
7 years ago
tensor-tang
09c5786e22
add square jitkernel
7 years ago
Qiao Longfei
4c7be265d3
update avx gru grad kernel test=develop
7 years ago
tensor-tang
4461a458a5
adjust diff since abs is too large
...
test=develop
7 years ago
Qiao Longfei
9b16e54064
update gru_grad_op
...
test=develop
7 years ago
tensor-tang
ca6fdc6e33
refine and fix test
...
test=develop
7 years ago
tensor-tang
a89296ac1f
add repeated fc relu pass
7 years ago
Qiao Longfei
e477d789a1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gru-add-mode
7 years ago
tensor-tang
f347d6e4a1
add repeated fc relu unit test
...
test=develop
7 years ago
tensor-tang
99010e6eae
init repeated fc relu op
7 years ago
tensor-tang
266a5d2f52
implement matmul refer and mkl kernel
7 years ago
tensor-tang
c5623c87a3
init jit matmul kernel
7 years ago
Xin Pan
a92860a3b1
Merge pull request #15298 from panyx0718/fix
...
Fix python2 bug cause CE to fail
7 years ago
Xin Pan
50b4ac08b0
fix
...
test=develop
7 years ago
Xin Pan
a1bfb35dd6
try fix py2
...
test=develop
7 years ago
tensor-tang
781cd0cf51
add multi threads test of seqpool test ( #15293 )
7 years ago
Xin Pan
3f65869ba6
try fix
...
test=develop
7 years ago
Xin Pan
3e79e6544f
try fix
...
test=develop
7 years ago
Tao Luo
1d434a9de6
Merge pull request #15291 from wojtuss/wojtuss/fix-performance-drop
...
Fix performance drop when with MKL-DNN
7 years ago
minqiyang
c86b3dd6e6
Polish code
...
test=develop
7 years ago
minqiyang
ddfb9f1123
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_shared_ptr
...
test=develop
7 years ago
Xin Pan
d1220f23bb
Merge pull request #15229 from panyx0718/imperative
...
support python codes in the imperative model
7 years ago
colourful-tree
576c740d5d
Merge pull request #14964 from colourful-tree/data_norm
...
add data norm op
7 years ago
colourful-tree
d5a8909131
Merge pull request #14950 from colourful-tree/develop
...
add teacher student sigmoid loss
7 years ago
minqiyang
bc3e0d6e01
Fix expand op compile time bug
...
test=develop
7 years ago
Tao Luo
cbd1c7c01f
fix CompareDeterministic error when test_all_data
...
test=develop
7 years ago
Xin Pan
6a18c0f9ff
Merge pull request #15278 from chengduoZH/revert_remove_workspace_handle_in_conv2d_cudnn
...
Revert "Remove workspace_handle in conv_cudnn (#15186 )"
7 years ago
Zhaolong Xing
98e85f3735
add_transpose_flatten_concat_fuse ( #15121 )
7 years ago
chengduozh
c4eced9881
fix thread safe bug
...
test=develop
7 years ago
chengduozh
358e657f68
Revert "Remove workspace_handle in conv_cudnn ( #15186 )"
...
test=develop
This reverts commit 064512aa47
.
7 years ago
wopeizl
5d9edb4124
Merge pull request #15156 from wopeizl/windows/fixgpuissue
...
fix gpu buils issue on windows test=develop
7 years ago
Wojciech Uss
cb2ba58458
Fix performance drop when with MKL-DNN
...
test=develop
7 years ago
tensor-tang
fc9fbab6a0
Merge pull request #15271 from tensor-tang/fix/typo
...
fix typo and refine
7 years ago
minqiyang
d0b640dca1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_shared_ptr
...
test=develop
7 years ago
chengduo
064512aa47
Remove workspace_handle in conv_cudnn ( #15186 )
...
* remove workspace_handle in conv2d_cudnn
test=develop
* remove workspace_handle
test=develop
* fix bug
test=develop
* make test_conv2d_op SERIAL
test=develop
* save memory in conv_cudnn
test=develop
* enhance thread safety
test=develop
* enhance temporary allocator
test=develop
* Add excess fraction
test=develop
* follow comments
test=develop
* fix bug and code refine
test=develop
* fix memory size check
test=develop
* rename reuse_tmp_allocation_excess_fraction
test=develop
7 years ago
minqiyang
687171d22b
Move from shared_ptr to raw pointer
...
test=develop
7 years ago
tensor-tang
c3a9f3c4b2
fix typo and refine
...
test=develop
7 years ago
tensor-tang
146e942c65
Merge pull request #15250 from tensor-tang/refine/seqpool/feed
...
Refine/seqpool/feed with infer zerocopytensor
7 years ago
xiaolil1
8f17c714de
Conv int8 residual ( #15145 )
...
* Enable basic MKL-DNN INT8 Conv OP
test=develop
* Modify test case
test=develop
* Clean unittest code
test=develop
* Fix test
test=develop
* Modify test
test=develop
* Enable MKL-DNN INT8 Conv with Relu Fusion OP
test=develop
* Enable INT8 Conv with residual fusion OP
test=develop
* Modify code.
test=develop
* Modify basic INT8 Conv
test=develop
* Modify Conv.
test=develop
* fix style
test=develop
* Fix style
test=develop
* Fix test
test=develop
* Modify code.
test=develop
* Fix test
test=develop
7 years ago
Tao Luo
93d5c1ed5a
Merge pull request #15261 from wopeizl/fixdemos
...
remove the dismatch enclosure to avoid warning message test=develop
7 years ago
xiaoli.liu@intel.com
f34e779f4d
Enhance key generation for INT8 test.
...
test=develop
7 years ago
peizhilin
439691f5bd
adjust the shlwapi on windows
...
test=develop
7 years ago
peizhilin
92da467c99
Merge remote-tracking branch 'upstream/develop' into windows/fixgpuissue
7 years ago
Wu Yi
fd85418329
[Feature] support mix precision training for resnet ( #14899 )
...
* clip softmax for fp16
* updates
* fuse xent support fp16 test=develop
* wip
* wip
* add simple row reduce
* wip fp16 accurate softmax
* add accurate softmax kernel for fp16 test=develop
* update test=develop
* fix cpu build test=develop
* update api.spec test=develop
* follow comments test=develop
* fix build test=develop
* fix trt build test=develop
* fix inference build test=develop
* fix merge test=develop
* update test=develop
* try fix build test=develop
* fix build test=develop
* rename real_exp test=develop
* fortest
* remove hacky kernels test=develop
* clean up test=develop
7 years ago
tensor-tang
96786d3716
add compare_determine of seqpool1 test
...
test=develop
7 years ago
tensor-tang
ab9c4b2a9f
refine seqpool concat pass and remove unused nodes
...
test=develop
7 years ago
tensor-tang
ce909664d8
Merge remote-tracking branch 'ups/develop' into refine/seqpool/feed
7 years ago
peizhilin
e239558e56
remove the dismatch enclosure to avoid warning message test=develop
7 years ago
flame
fb63cd89d4
Add python ir graph API ( #14917 )
7 years ago
tensor-tang
a0a27bd240
add seqpool concat fuse pass tester
...
test=develop
7 years ago
Tao Luo
7d13d20769
Merge pull request #15245 from luotao1/rnn1_multi_thread
...
reduce threads number to avoid analyzer_rnn1_tester hang in CI
7 years ago
minqiyang
80197fac26
Add missing files
...
test=develop
7 years ago
Tao Luo
2b11c710b3
Merge pull request #15249 from NHZlX/fix_trt_demo_ci
...
fix demo ci bug
7 years ago
乔龙飞 Qiao Longfei
5e74c4e88f
Merge pull request #15100 from jacquesqiao/fix-dist-sparse-decay
...
fix dist sparse l2 decay
7 years ago
tensor-tang
8e086a8521
follow comment and fix typo
...
test=develop
7 years ago
minqiyang
08e2a5d611
Polish tracer code
...
test=develop
7 years ago
minqiyang
cded24768c
Remove shared_ptr holder for VarBase
...
test=develop
7 years ago
minqiyang
c8d1a8e909
Change var_ and grad_ to shared_ptr
7 years ago
minqiyang
7aab39af15
Change grads to VarBase
7 years ago
tensor-tang
54afcb7ec6
add compare zerocopy test with native result
...
test=develop
7 years ago
tensor-tang
137060135e
fix zerocopy size
7 years ago
tensor-tang
7461356723
add zerocopy for seqpool test
7 years ago
tensor-tang
48410b9bfe
Merge pull request #15237 from tensor-tang/fuse/seqpool_concat_2
...
Fuse/seqpool concat 2
7 years ago
nhzlx
e7d83389e6
fix demo ci bug
...
1. trt_demo bug
2. trigger exit when exists a bug
test=develop
7 years ago
Tao Luo
9b41e45584
Merge pull request #15222 from luotao1/native_config
...
fix analyzer_test runs error in native_config
7 years ago
Tao Luo
d43983b61d
reduce threads number to avoid hang in CI
...
test=develop
7 years ago
Qiao Longfei
653cd31971
remote unused code
7 years ago
Qiao Longfei
0a79d7a404
fix merge
7 years ago
Qiao Longfei
422449a945
fix style
7 years ago
Qiao Longfei
edad60e612
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-cpp-reader
7 years ago
nhzlx
c1264e99f3
fix win error
...
test=develop
7 years ago
peizhilin
c1235c935f
add the enable_debug flag
...
test=develop
7 years ago
nhzlx
4e3522e5b4
add trt int8 support
...
test=develop
7 years ago
Xin Pan
7b73fc9e1a
Merge pull request #15089 from panyx0718/api
...
try unify Executor and ParallelExecutor
7 years ago
Xin Pan
9597fd05e9
polish
...
test=develop
7 years ago
Qiao Longfei
d0e3b24002
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-dist-sparse-decay
...
test=develop
7 years ago
tensor-tang
f8c305b243
Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat_2
...
test=develop
7 years ago
Tao Luo
197d0f2431
fix trt_model_tester to pass the ci
...
test=develop
7 years ago
tensor-tang
223c61ca5e
Merge pull request #15170 from tensor-tang/jit/seqpool
...
refine seqpool op
7 years ago
Qiao Longfei
c3b9edf958
follow comment test=develop
7 years ago
Zeng Jinle
e29f10d315
Merge pull request #15207 from sneaxiy/remove_op_handle_lock_and_fix_var
...
Remove op handle lock and fix var
7 years ago
Zeng Jinle
7b638f2781
Merge pull request #15218 from sneaxiy/fix_same_name_func
...
Fix same name func framework::ToTypeIndex
7 years ago
Tao Luo
feee78d9f0
Merge pull request #15214 from tensor-tang/fix/debug
...
fix debug build error
7 years ago
Xin Pan
7aad6afd49
forward and backward
...
test=develop
7 years ago
mozga-intel
eff90eb941
PADDLE_WITH_NGRAPH was removed from the code
...
test=develop
7 years ago
mozga-intel
a42f8f4f6f
Enable element_wise_add operator for a ngraph
...
test=develop
7 years ago
mozga-intel
e4184008a4
PADDLE_WITH_NGRAPH was removed from the code
...
test=develop
7 years ago
Qiao Longfei
3ace486ebd
fix sum_op selected rows test=develop
7 years ago
Tao Luo
71d9097a89
fix analyzer_test runs error in native_config
...
test=develop
7 years ago
Tao Luo
9c02765158
Merge pull request #15210 from Superjomn/fix/analysis_tester_bug
...
fix analysis_tester bug
7 years ago
tensor-tang
72d2a1801e
add seqpool concat fuse pass
...
test=develop
7 years ago
tensor-tang
f702f8fd10
Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat
7 years ago
sneaxiy
bc205ef374
fix same name func
...
test=develop
7 years ago
tensor-tang
69fd3fdb52
fix debug build error
...
test=develop
7 years ago
Xin Pan
2349acea48
checkpoint
...
test=develop
7 years ago
xuezhong
c0bc818688
Merge pull request #15188 from velconia/add_pyramid_dnn_support
...
Add no lock optimization pass
7 years ago
Qiao Longfei
b16e832d4d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-dist-sparse-decay
7 years ago
Qiao Longfei
4d169ad981
update api spec test=develop
7 years ago
superjomn
23bdd0a223
fix analysis_tester bug
...
test=develop
7 years ago
Yan Chunwei
d09d6eadc0
make inference api work with Doxygen ( #15195 )
7 years ago
Zeng Jinle
c562be20d9
Merge pull request #15193 from sneaxiy/fix_cudnn_compatible_check
...
Fix cudnn compatible check
7 years ago
peizhilin
1cd95d8a0b
use thread local instance test=develop
7 years ago
minqiyang
7b7d0d0caf
Change hash function back
...
test=develop
7 years ago
Xin Pan
11d4d39cd7
forward working
...
test=develop
7 years ago
sneaxiy
ed409ac9f4
Revert "Revert "Remove op handle lock""
...
test=develop
7 years ago
sneaxiy
4a443ffc98
merge develop
...
test=develop
7 years ago
peizhilin
d54133ea85
not include the numeric under linux test=develop
7 years ago
sneaxiy
7c7342bf12
fix scope.var()
...
test=develop
7 years ago
Tao Luo
4d9aa1745a
Merge pull request #14806 from mozga-intel/mozga-intel/scale_operator_ngraph
...
Enable scale operator for a ngraph engine
7 years ago
Tao Luo
dc0c221426
Merge pull request #14803 from mozga-intel/mozga-intel/mean_operator_ngraph
...
Enable mean operator for a ngraph engine
7 years ago
Xin Pan
b629133375
checkpoint runnable PyLayer
...
test=develop
7 years ago
peizhilin
a6f5ceee74
add the python callstack for debug support test=develop
7 years ago
Zeng Jinle
dacfaaa966
Revert "Remove op handle lock"
...
test=develop
7 years ago
Tao Luo
6ca9a4810b
Merge pull request #15196 from luotao1/serial
...
run analyzer_tester serial in multi-thread
7 years ago
Xin Pan
c4b09a713f
polish
...
test=develop
7 years ago
minqiyang
b76695418a
Polish log
...
test=develop
7 years ago
minqiyang
1bfbc0d963
Polish code
...
test=develop
7 years ago
minqiyang
7f45b9511a
Polish code
7 years ago
minqiyang
68a07328fa
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_pyramid_dnn_support
...
test=develop
7 years ago
Qiyang Min
317840d3ba
Merge pull request #14277 from velconia/add_fused_emb_seq_pool_op
...
Add fused emb seq pool op
7 years ago
tensor-tang
2dd331cc21
Merge remote-tracking branch 'ups/develop' into fuse/seqpool_concat
...
test=develop
7 years ago
tensor-tang
316636404f
add seqpool concat unit test
7 years ago
Yan Chunwei
6ccf8685f7
refactor tensorrt node teller ( #15181 )
7 years ago
Tao Luo
7dc0181c46
run analyzer_tester serial in multi-thread
...
test=develop
7 years ago
xiaolil1
c8f101e5da
Conv int8 relu ( #15130 )
...
* Enable basic MKL-DNN INT8 Conv OP
test=develop
* Modify test case
test=develop
* Clean unittest code
test=develop
* Fix test
test=develop
* Modify test
test=develop
* Enable MKL-DNN INT8 Conv with Relu Fusion OP
test=develop
* Modify basic INT8 Conv
test=develop
* fix type
test=develop
* Modify test
test=develop
7 years ago
sneaxiy
9793a0b6a6
fix_cudnn_compatible_check
7 years ago
Zeng Jinle
ccb322d6a5
merge develop
7 years ago
Xin Pan
0d0bc61248
update api
...
test=develop
7 years ago
tensor-tang
7923d7271f
add fusion seqpool concat op
7 years ago
Zeng Jinle
f3a13512fc
Merge pull request #15139 from sneaxiy/remove_op_handle_lock
...
Remove op handle lock
7 years ago
Qiao Longfei
44b300556d
change min_row_size_to_use_multithread to parameter of adam
...
test=develop
7 years ago
Qiao Longfei
87b4eb1da4
change min_param_size_to_use_multithread to min_row_size_to_use_multithread
7 years ago
minqiyang
0f94c1ac14
Polish code
...
test=develop
7 years ago
minqiyang
00e4de04bf
Polish code
7 years ago
minqiyang
4bfa110fd8
Add no lock optimize pass
...
test=develop
7 years ago
chengduo
eabb2105fa
Refactor MultiDevSSAGraphBuilder ( #15090 )
...
* Refactor ParallelExecutor
test=develop
* extract Reduce and AllReduce mode from MultiDevSSAGraphBuilder
test=develop
* Refactor MultiDevSSAGraphBuilder
test=developt
* Remove enable_data_balance
test=develop
* code refine
test=develop
* remove data balance
test=develop
* refine ScaleLossGradOp
test=develop
* remove uncessary file
test=develop
* code refine
test=develop
* modify function name
test=develop
* follow comments
test=develop
* add is_distribution field
test=develop
* set is_distribution
test=develop
* fix DistSSAGraphBuilder
test=develop
7 years ago
Yan Chunwei
875a07c32d
refactor inference analysis api ( #14634 )
7 years ago
minqiyang
c09a379015
remove const_cast
...
test=develop
7 years ago
tensor-tang
102d93712e
Merge remote-tracking branch 'ups/develop' into jit/seqpool
...
test=develop
7 years ago
tensor-tang
123b98f417
refine heigth and codesize and support all pool
...
test=develop
7 years ago
tensor-tang
0145f40f45
use height from params of jitcode
7 years ago
tensor-tang
e0591deebc
enhance seqpool jitcode
7 years ago
Zeng Jinle
99e6e8b00f
Merge pull request #15179 from sneaxiy/fix_crf_grad_lod
...
Fix crf grad lod share
7 years ago
minqiyang
db8eb9b688
Polish code
...
test=develop
7 years ago
minqiyang
f4c990e7b8
Add fused embedding ops
7 years ago
minqiyang
39b98709b1
Move fused ops to fused dir
...
test=develop
7 years ago
minqiyang
920d4a8b78
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_fused_emb_seq_pool_op
...
test=develop
7 years ago
Tao Luo
5ee596cae5
Merge pull request #15175 from baojun-nervana/intel/mkldnn
...
Update ngraph to resolve issue with mkldnn upgrade
7 years ago
乔龙飞 Qiao Longfei
7c891e1ecc
Merge pull request #15111 from jacquesqiao/fix-adam-tmp-var
...
Fix adam tmp var on cpu
7 years ago
mozga-intel
e77956c920
Enable mean operator for a ngraph
...
test=develop
7 years ago
mozga-intel
dd768714ab
Enable scale operator for a ngraph
...
test=develop
7 years ago
sneaxiy
be425461a1
fix crf grad lod share
...
test=develop
7 years ago
Qiao Longfei
3e1b914fcb
update gru op forward kernel
7 years ago
Qiao Longfei
7a81ab8607
complete gru_unite_op and test
7 years ago
Qiao Longfei
72618c8da5
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into gru-add-mode
7 years ago
Qiao Longfei
17b1b660fc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
...
test=develop
7 years ago
Qiao Longfei
c15270c5b2
optimize multi thread adam
7 years ago
乔龙飞 Qiao Longfei
e1679b8847
Merge pull request #14893 from JiabinYang/feature/add_prefech_hs
...
Feature/add prefech hs
7 years ago
baojun-nervana
f0cde74564
Update ngraph with elt-wise relu test=develop
7 years ago
tensor-tang
92201d3956
support avg and sqrt pool and add mkl impl
...
test=develop
7 years ago
tensor-tang
c50060bb26
add jitcode impl and use it
7 years ago
tensor-tang
142bb41748
add seqpool jitkernel test and benchmark
7 years ago
tensor-tang
e58a569c6c
use seqpool jitkernel
7 years ago
tensor-tang
3e01a4048f
add refer seqpool jitkernel
7 years ago
Qiao Longfei
4ecb9c93f0
update API.spec
...
test=develop
7 years ago
xiaolil1
bbc9336878
Enable basic MKL-DNN INT8 Conv OP ( #15124 )
...
* Enable basic MKL-DNN INT8 Conv OP
test=develop
* Modify test case
test=develop
* Clean unittest code
test=develop
* Fix test
test=develop
* Modify test
test=develop
* Modify basic INT8 Conv
test=develop
7 years ago
Xin Pan
8ae9094e07
polish and resolve conflicts
...
test=develop
7 years ago
Xin Pan
5e928e579a
try unify Executor and ParallelExecutor
...
test=develop
7 years ago
Qiao Longfei
e10af895de
update gru grad op
...
test=develop
7 years ago
Qiao Longfei
78ec7c0f99
gru add origin mode
...
test=develop
7 years ago
peizhilin
c919b2f31d
Merge remote-tracking branch 'upstream/develop' into windows/fixgpuissue
7 years ago
peizhilin
fd4f4d0e5f
fix build issue test=develop
7 years ago
Yan Xu
a1e60ab19b
Merge pull request #14791 from Yancey1989/parallel_graph_mode
...
[Feature] Add ParallelGraph executor mode in parallelexecutor to improve performance
7 years ago
peizhilin
9ae50dd07d
fix gpu buils issue on windows test=develop
7 years ago
Qiao Longfei
0e747e8d02
change the limit of thead num
7 years ago
qingqing01
c981bf0f9d
Fix compling error with cuDNN v5 ( #15148 )
...
test=develop
7 years ago
Yancey1989
4ad9de74dd
disable sync nccl by default test=develop
7 years ago
Yancey1989
449bf58ea6
disable parallelgraph mode by default test=develop
7 years ago
Yancey1989
db603398b7
disable parallel graph executor by default
7 years ago
wopeizl
67093da398
Merge pull request #15122 from wopeizl/windows/fixhuberloss
...
fix the huber loss compile issue on windows test=develop
7 years ago
sneaxiy
d0a8a1e950
remove_op_handle_lock
...
test=develop
7 years ago
Xin Pan
087af6a686
Merge pull request #15131 from panyx0718/clean
...
hide temp tensor allocation
7 years ago
Yancey1989
e65436103f
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
...
test=develop
7 years ago
Yancey1989
94c80347b6
update by comment
7 years ago
sneaxiy
6f06e6cdac
Merge remote origin
...
test=develop
7 years ago
Qiyang Min
23761beaef
Merge pull request #14971 from velconia/imperative_mnist
...
Imperative Optimizer
7 years ago
xiaolil1
8eb1f26211
Enable INT8 pool OP ( #15046 )
...
* Enable INT8 pool OP
test=develop
* fix unittest
test=develop
* Clean unittest code.
test=develop
7 years ago
Wu Yi
227e0c4518
fix nccl2 mode startup test=develop ( #15132 )
7 years ago
Xin Pan
9186451f60
hide GetTensor
...
test=develop
7 years ago
wopeizl
7305fc2ff9
Merge pull request #15112 from wopeizl/windows/fixsaveandloadops
...
fix save and load ops on windows test=develop
7 years ago
peizhilin
dba009dbbf
fix script issue
...
test=develop
7 years ago
peizhilin
cd2d60b4c8
fix build issue for density prior box op on windows test=develop
7 years ago
Yancey1989
35cda13e9f
fix unittest test=develop
7 years ago
peizhilin
1f423f84ac
fix the huber loss compile issue on windows test=develop
7 years ago
sneaxiy
d25395fc98
remove tensor core lock
...
test=develop
7 years ago
tensor-tang
516fe301ee
add comment in case of empty name
...
test=develop
7 years ago
peizhilin
b3688100ad
fix unittest
...
test=develop
7 years ago
minqiyang
2547f9d1b8
Polish code
...
test=develop
7 years ago
tensor-tang
b9c645639b
workaround with third party cache
...
test=develop
7 years ago
peizhilin
5d8f281397
restore the memory mode
...
test=develop
7 years ago
tensor-tang
c02165d23a
Merge remote-tracking branch 'ups/develop' into refine/seqpool
...
test=develop
7 years ago
tensor-tang
dca68cdf97
throw error when name not find
...
test=develop
7 years ago
peizhilin
33b7821a75
fix save and load ops on windows test=develop
7 years ago
Qiao Longfei
dfe85fb358
fix build
7 years ago
Qiao Longfei
f057bbd1d1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-adam-tmp-var
7 years ago
Qiao Longfei
f1c973b014
adam op should not create tmp var in compute
7 years ago
Yancey1989
82b42e31f0
polish unittest test=develop
7 years ago
wopeizl
10bedbdeaa
Merge pull request #15105 from wopeizl/windows/fixtimer
...
fix timer test=develop
7 years ago
minqiyang
09e2e66236
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
7 years ago
Yancey1989
0a885ac12a
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
...
test=develop
7 years ago
Yancey1989
ca8c77d966
selecte execution according to strategy test=develop
7 years ago
tensor-tang
484085693e
update url and num_ops
...
test=develop
7 years ago
tensor-tang
cd94df8679
fix load and refine
7 years ago
tensor-tang
8e271896ae
add test data for seqpool1
7 years ago
minqiyang
858e903231
Add unittest for operator
...
test=develop
7 years ago
shippingwang
83f2e2c903
rewrite the comments, test=develop
7 years ago
gongweibao
ce70229ba6
Add max_body_size flags to brpc ( #15084 )
7 years ago
qingqing01
6f0a1d7b47
Inception fusion operator. ( #14968 )
...
* Inception fusion operator.
* Support horizontal layer fusion in conv_fusion_op.
* Search conv algo strategy for variable-length input.
search N times and cache the searched algos. For other input, choose the algo of input whose area is closest to this input.
7 years ago
peizhilin
813c2ce539
fix timer test=develop
7 years ago
Qiao Longfei
25d44d40ac
sum op support empty selected rows as input
7 years ago
wopeizl
7ab501264d
Merge pull request #15069 from wopeizl/windows/dsosupport
...
add cuda dso support for windows
7 years ago
minqiyang
6a5f604607
Support stop_gradients var in imperative backward
...
test=develop
7 years ago
guru4elephant
ff739449ab
Merge pull request #15018 from guru4elephant/add_timer
...
Add debug thread function for async executor
7 years ago
Qiyang Min
e29cbfe4f7
Merge pull request #14829 from velconia/accelerate_ddpg
...
Accelerate little models
7 years ago
Tao Luo
9c2cbfb89e
Merge pull request #15093 from baojun-nervana/intel/cmake
...
Upgrade ngraph & clean up cmake
7 years ago
Zeng Jinle
25b49a0896
Merge pull request #14933 from sneaxiy/rewrite_ddim
...
Rewrite ddim
7 years ago
Wu Yi
a8bc05b5ff
Refactor distributed RPC ( #15075 )
...
* wip
* wip
* refactor no.1 dir structure test=develop
* fix linking test=develop
* fix includes test=develop
* fix build test=develop
* fix build test=develop
7 years ago
baojun-nervana
555fbc10d8
upgrade ngraph to v0.10.1 test=develop
7 years ago
baojun-nervana
c714c36482
simplify logic test=develop
7 years ago
minqiyang
9e3155e01d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
7 years ago
minqiyang
6bb84490af
Fix imperative unit test
...
test=develop
7 years ago
Xin Pan
3e8408429d
Merge pull request #15053 from panyx0718/imperative_hold
...
refactor to avoid scope.
7 years ago
sneaxiy
73896eeb94
merge develop
...
test=develop
7 years ago
Wu Yi
e26cced7cc
refine batch merge pass ( #14777 )
...
* refine batch merge pass
* refine batch merge pass test=develop
7 years ago
minqiyang
336160e651
Complete imperative optimizer implementation
...
test=develop
7 years ago
Yancey1989
4743c9cd5d
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
7 years ago
sneaxiy
9a3a246cb5
fix py35 compile error
...
test=develop
7 years ago
Xin Pan
f7294f8b25
register float16
...
test=develop
7 years ago
Zhaolong Xing
4048cfa9da
Merge pull request #15048 from NHZlX/add_affine_channel_fuse
...
Add conv+ affine channel fuse pass
7 years ago
minqiyang
ef7d563db9
Add changes back
...
test=develop
7 years ago
minqiyang
a318a490ab
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
...
test=develop
7 years ago
Zeng Jinle
c0bcff00dc
Merge pull request #14962 from sneaxiy/rewrite_variable_type
...
Rewrite variable type
7 years ago
chengduo
fe8495a758
[WIP] Refine MultiDevSSAGraph ( #15040 )
...
* refine parallel_exe
test=develop
* rename shared_var_device
* code refine
* add test_weight_decay
* remove Sort
test=develop
* Add SortForReduce
test=develop
* code refine
test=develop
* follow comment
test=develop
7 years ago
Qiao Longfei
d161215332
optimize adam multi thread
7 years ago
dongdaxiang
82335cd88c
Merge branch 'add_timer' of https://github.com/guru4elephant/Paddle into add_timer
...
test=develop
7 years ago
Tao Luo
85471533e0
Merge pull request #15079 from luotao1/analysis_test
...
simplify analysis tests
7 years ago
minqiyang
d4b9928c5a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into imperative_mnist
...
test=develop
7 years ago
minqiyang
5822f7f1d8
Polish code
...
test=develop
7 years ago
wopeizl
719ebe3786
Merge pull request #15070 from wopeizl/windows/testcasefix
...
fix test issues on windows
7 years ago
Qiao Longfei
7a58ad5c79
lazy mode have higher priority then multithread
...
test=develop
7 years ago
Xin Pan
c132c79011
address comments and resolve conflicts.
...
test=develop
7 years ago
Xin Pan
b91a7a9d30
clear operator changes
...
test=develop
7 years ago
Xin Pan
f52b514dcd
call kernel
7 years ago
Xin Pan
4e80e04f23
fix
...
test=develop
7 years ago
Xin Pan
7b6bf9ddf2
make fill_constant kernel-based
...
test=develop
7 years ago
Xin Pan
61491ce250
clean
...
test=develop
7 years ago
Xin Pan
ce7e503cbe
refactor to avoid scope.
...
test=develop
7 years ago
Qiyang Min
0238a3bb4f
Merge pull request #14972 from velconia/accelerate_lstm
...
Accelerate PADDLE_ENFORCE
7 years ago
Houjiang Chen
242d3c71a6
Merge pull request #15031 from hjchen2/develop
...
Fix conv_elementwise_add2_act pass
7 years ago
Qiao Longfei
d0572bf02e
add log for lazy mode test=develop
7 years ago
Xin Pan
71a4a8e981
Merge pull request #15071 from wopeizl/revert/15035
...
Revert "cherry-pick the #12759"
7 years ago
minqiyang
68e9b841ab
Add support for optimizer
7 years ago
Qiao Longfei
1177b0bc84
update multi thread adam
7 years ago
Qiao Longfei
3b294e2e2e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multithread-sparse-adam
7 years ago
Zeng Jinle
988bc2b5a7
Merge pull request #15060 from dzhwinter/fix/nccl
...
fix ci error. test=develop
7 years ago
sneaxiy
c4ce2e7b21
merge develop, solve conflict
...
test=develop
7 years ago
minqiyang
8ed0233924
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
...
test=develop
7 years ago
Zeng Jinle
9c6a0203e2
Merge pull request #15073 from sneaxiy/add_scope_pool
...
Add scope_pool
7 years ago
shippingwang
9322d34032
Fix, test=develop
7 years ago
tensor-tang
656c672cdd
Merge pull request #15051 from tensor-tang/test/seq_pool1
...
Add seq pool1 inference test
7 years ago
Tao Luo
ecae157edf
simplify some data record in analyzer_tester
...
test=develop
7 years ago
sneaxiy
b56aca82e9
merge develop
...
test=develop
7 years ago
Tao Luo
05f1b65da3
simplify prepere_input in analyzer_test
...
test=develop
7 years ago
sneaxiy
ee83ce75bf
try to fix py35 compile error
...
test=develop
7 years ago
sneaxiy
10a6bc9675
modify API.spec
...
test=develop
7 years ago
sneaxiy
3e917a934a
add scope_pool
...
add module cleanup
test=develop
7 years ago
nhzlx
02e17396c2
fix comments
...
test=develop
7 years ago
jerrywgz
ef2d292bfc
Merge pull request #14956 from jerrywgz/fix_bug_in_ifelse
...
fix bug in if-else op
7 years ago
Yancey1989
1a4f79a7de
fix unittest test=develop
7 years ago
peizhilin
e49276e731
restore the huber_loss_op
...
test=develop
7 years ago
Yancey1989
86bb583881
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
7 years ago
Yancey1989
495e73d766
enable gc
7 years ago
Yancey1989
28cdfbc2b0
delete comment code
7 years ago
Yancey1989
845bfd5807
cleanup code
7 years ago
peizhilin
2388d0e7d6
Revert "cherry-pick the #12759"
...
test=develop
This reverts commit 7f6d8acecb
.
7 years ago
nhzlx
71636e677d
add min_subgraph_size attr to tensorrt config
...
test=develop
7 years ago
peizhilin
01c00b07dd
fix test issues on windows
...
test=develop
7 years ago
peizhilin
1e7f83e60a
add cuda dso support for windows
...
test=develop
7 years ago
tangwei12
dc8eca826e
code style fix, test=develop ( #15045 )
...
* code style fix, test=develop
7 years ago
Yancey1989
41a64f6a2a
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
7 years ago
nhzlx
a6aa8ea771
faster rcnn input is presistable. (fix it in paddle-trt)
...
test=develop
7 years ago
hjchen2
956cf92145
Fix conv_elementwise_add2_act pass
...
test=develop
7 years ago
Tao Luo
69659f4ae2
Merge pull request #15037 from jianhang-liu/fix/abnormal_stack_op_time
...
Fix/abnormal stack op time
7 years ago
whs
2314f2ebb3
Make topk op support variable k. ( #15044 )
...
* Make topk op support variable k.
test=develop
* Fix tensor type.
test=develop
7 years ago
shippingwang
5631fc0817
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into shufflechannel
7 years ago
sneaxiy
179acc60b3
fix conflict with develop
...
test=develop
7 years ago
wopeizl
09bd8fa67a
Merge pull request #15035 from wopeizl/debug/improvement1
...
cherry-pick the #12759
7 years ago
sneaxiy
dde3afe7b7
Merge develop
...
test=develop
7 years ago