Xin Pan
5839e3236b
add program check
...
test=develop
7 years ago
gmcather
ba22624d7e
position encoding && log loss
...
test=develop
7 years ago
Tao Luo
3a96d41d72
remove with_inference option
...
test=develop
7 years ago
sneaxiy
2494ca83ab
test=develop
7 years ago
dzhwinter
ebfe5a02b3
merge develop branch
7 years ago
JiabinYang
7c45e77c41
test=develop
7 years ago
barrierye
b5f78ce42d
update paddle/fluid/API.spec
...
test=develop
7 years ago
qingqing01
cb27a9219d
Merge pull request #13971 from sefira/FasterOpDoc
...
generate proposal labels doc
7 years ago
sneaxiy
5e5d2223a1
test=develop
7 years ago
tensor-tang
3c957af139
Merge pull request #14080 from tensor-tang/refine/jit/crf2
...
Refine/jit/crf decoding
7 years ago
Xin Pan
aa87a989ec
Merge pull request #14119 from Superjomn/fix/api-impl-tester
...
disable some tests
7 years ago
barrierye
5f3acac9b3
update paddle/fluid/API.spec
...
test=develop
7 years ago
Xin Pan
9ef19d4919
Merge pull request #14106 from luotao1/fix_cmake_warning
...
[1.1] fix cmake warning when ON_INFER=false
7 years ago
sneaxiy
f2eed667c0
test=develop
7 years ago
Xin Pan
16dfedb8b8
Merge pull request #14103 from jacquesqiao/cpu-for-1.1-merge-with-shape
...
[1.1] Cpu for 1.1 merge with shape
7 years ago
sneaxiy
cef8cc81db
merge develop
7 years ago
Jacek Czaja
458b16f42a
Rebase of seqpool-max optimization
...
test=develop
- Added rough profiling
- Profiled maxpool itself
- First draft of max seqpool optimization (is_test added)
- Added unit tests to seqpool
- Cosmetic fixes
- Fix to UT of Seq pool
Disabled grad checking for sequence max pool when is_test is set to True
-Cosmetic fix to comment
test=develop
- Fix to GPU build
test=develop
- yet another GPU fix for sequence max pool
- Fix to comment
test=develop
- Change to API of sequence_pool
test=develop
- Yet another API spec change
test=develop
7 years ago
superjomn
5f7fda0b07
disable some tests
...
test=develop
7 years ago
dengkaipeng
ff6329bd5f
fix some inappropriate expressions in api doc for grid_sampler. test=develop
7 years ago
Tao Luo
d3534d2b14
refine warning message
...
test=develop
7 years ago
Xin Pan
177720a737
Merge pull request #14116 from chengduoZH/release/1.1.0
...
[1.1]Fix op_role value
7 years ago
chengduozh
acec4cb8ca
[1.1]fix op_role value
...
test=release/1.1
7 years ago
barrierye
73671379cd
update paddle/fluid/API.spec
...
test=develop
7 years ago
dengkaipeng
8f1e398824
move param exclusive to the last in pool2d/pool3d for forward compatibility:. test=develop
7 years ago
dengkaipeng
593e1b18d7
fix some bugs and add some doc for GridSampleOp
7 years ago
dengkaipeng
0bb0e0c10f
add Grid Sampler Operator for STN.
7 years ago
Qiao Longfei
3d4e050802
fix compile, optimize code test=develop
7 years ago
Yu Yang
c01696f8c2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
...
test=develop
7 years ago
Qiao Longfei
d26ff8cb2d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpu-for-1.1-merge-with-shape
7 years ago
JiabinYang
e0a89503f8
test=develop
7 years ago
JiabinYang
0e3038680b
test=develop
7 years ago
Tao Luo
79da263b11
Merge pull request #14032 from sfraczek/sfraczek/fix-test-multithreading-mkldnn
...
fix test resnet50 multi-threading on mkldnn
7 years ago
Wu Yi
26200f2e42
[1.1] [project] train imagenet using large batch size ( #13766 )
...
* fix nccl2 lars dist support
* put lars in momentum op
* add tests lars
* fix ci
* fix cpu kernel
* soft warning
* remove lars in test_recognize_digits.py
* move to another op
* add file
* update api.spec test=develop
* update test=develop
* fix api.spec test=develop
* wip
* wip, finish grad merge ops
* wip, finish graph build
* wip test running
* work on 1 gpu
* workable version
* update
* fix tests
* fuse broadcast op
* fix compile failed
* refine
* add batch merge test mnist
* fix CI test=develop
* fix build
* use independent bn params for batch merge test=develop
* update api.spec
* follow comments and for test
* wip
* refine tests test=develop
* follow comments test=develop
* remove startup bn modify test=develop
* follow comments test=develop
* fix merge test=develop
7 years ago
sneaxiy
2414f92f54
test=develop
7 years ago
barrierye
8c1e304307
merge nn.py
7 years ago
sneaxiy
45559d042c
move to pass
...
test=develop
7 years ago
dengkaipeng
c93e044ae0
add inclusive/exclusive mode in PoolOp avg pool type
7 years ago
JiabinYang
9a74c4489f
test=develop
7 years ago
barrierye
9dc28179a4
add similarity_focus op
7 years ago
Qiao Longfei
7cd2417fe2
Merge branch 'develop' into cpu-for-1.1-merge-with-shape
...
test=develop
7 years ago
Xin Pan
0a80f06ec4
Merge pull request #14086 from panyx0718/fix6
...
delete unused codes.
7 years ago
sneaxiy
a314a80cdb
merge develop
7 years ago
Tao Luo
4928ff32a9
fix cmake warning when ON_INFER=false
...
test=develop
7 years ago
dzhwinter
c8adc2c6fe
cudnn version. staged.
7 years ago
Qiao Longfei
06ffbc4f28
Merge branch 'shape_int_to_int64' of https://github.com/seiriosPlus/Paddle into cpu-for-1.1-merge-with-shape
...
test=develop
7 years ago
seiriosPlus
06de824ba8
fix shape in floats
7 years ago
Yan Chunwei
ee74be3a49
[1.1] Bugfix/tensorarray ( #14044 )
7 years ago
Qiyang Min
33b4920d2d
Merge pull request #14057 from velconia/continue_hash_op
...
[1.1] Add hash_op implementation
7 years ago
Qiyang Min
209f24a241
Merge pull request #14051 from velconia/accelerate_embedding_grad
...
[1.1] Accelerate sparse embedding grad op in CPU device
7 years ago
minqiyang
2fec8c5d9a
Polish code
...
test=develop
7 years ago
minqiyang
7f7af5d412
Add xxhash deps to inference demo and trainer demo
...
test=develop
7 years ago
Qiao Longfei
7cfc3c4415
Merge branch 'optimize-sum-seq-pooling-op' of ssh://github.com/jacquesqiao/Paddle into cpu-for-1.1-merge
7 years ago
Qiao Longfei
72aef6b168
sum selected rows check empty
7 years ago
minqiyang
fe18adfbaa
Add fluid inference support
...
test=develop
7 years ago
seiriosPlus
c34610f86d
Fix lookup table at CPU Reduce strategy, test=develop
7 years ago
Qiao Longfei
641369f92b
Merge branch 'dist-table-do-not-init-on-trainer' of ssh://github.com/jacquesqiao/Paddle into cpu-for-1.1-merge
7 years ago
Qiao Longfei
d69c820707
Merge branch 'add-flag-to-control-rpc-thread-num' of ssh://github.com/jacquesqiao/Paddle into cpu-for-1.1-merge
7 years ago
Qiao Longfei
1ed9ef6d70
Merge branch 'shape_int_to_int64' of https://github.com/seiriosPlus/Paddle into cpu-for-1.1-merge
7 years ago
Qiao Longfei
f1a3fb041b
Merge branch 'fix_lookuptable_in_reduce' of https://github.com/seiriosPlus/Paddle into cpu-for-1.1-merge
7 years ago
Qiao Longfei
da61a5b672
Merge branch 'optimizer-prefetch' of https://github.com/seiriosPlus/Paddle into cpu-for-1.1-merge
7 years ago
tangwei12
5ce3a32e06
Merge branch 'develop' into optimizer-prefetch
7 years ago
seiriosPlus
b6590b05fb
submit by tangwei12, test=develop
7 years ago
Wu Yi
9da9b1926b
[1.1] fix graph num hang ( #14072 )
...
* fix graph num hang test=develop
* re-enable tests test=develop
* re-enable graph num check test=develop
* fix multi device pass role check test=develop
7 years ago
tangwei12
cb1ccc710b
fix shape type in uniform_random_op.cu
7 years ago
Qiao Longfei
575f22711d
optimize code
...
test=develop
7 years ago
Qiao Longfei
96d5500934
optimize code
7 years ago
Qiao Longfei
748ee35c89
sum op handle empty input update selected_rows_functor.cu
7 years ago
Qiao Longfei
dd78b5df93
sum op handle empty input
7 years ago
Qiao Longfei
cbe128bbae
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-sum-seq-pooling-op
7 years ago
Qiao Longfei
f4df0cb1a2
update the type of shape to int64, format code
7 years ago
Qiao Longfei
fad42fe7cc
broadcast handle not inited parameter
7 years ago
Qiao Longfei
7dcb0dc8c6
update year
7 years ago
Qiao Longfei
68aeb4e7e9
add fake init test in test_dist_transpiler
7 years ago
Tao Luo
5ed3e6f3f6
Merge pull request #14042 from luotao1/remove_unused_code
...
[1.1] remove unused code in paddle_inference_api.h
7 years ago
Qiao Longfei
a13c788a04
fix a bug
7 years ago
Zeng Jinle
97d47a7d08
Merge pull request #13913 from sneaxiy/seq_reverse
...
Add sequence_reverse_op
7 years ago
JiabinYang
6e3615422f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_reorg_op
7 years ago
Jiabin Yang
a3efba176c
Merge pull request #14085 from jerrywgz/fix_generate_proposals_op
...
[1.1] fix erase end in generate proposals op
7 years ago
dzhwinter
7141debe38
add cudnn back. staged.
7 years ago
Guo Sheng
b9ae1c49f8
Merge pull request #13994 from guoshengCS/add-reshape-reuse-input
...
[1.1] Make reshape_op reuse input.
7 years ago
Zeng Jinle
60058180cb
Merge pull request #13945 from sneaxiy/unify_mixed_vector_api
...
Unify API of mixed_vector in GPU and CPU
7 years ago
Qiao Longfei
0328ffd3ab
add fake init op
7 years ago
Xin Pan
bcc9126e7b
Merge pull request #14056 from panyx0718/fix
...
Fix threadpool
7 years ago
Sylwester Fraczek
2098b42584
review fixes (Teamcity fails)
...
test=develop
7 years ago
Tao Luo
961baea16c
Merge pull request #14063 from wojtuss/wojtuss/remove-unused-EnableMKLDNN
...
remove unused method from naive executor
7 years ago
Hongyu Liu
379d933ae5
Merge pull request #14036 from phlrain/add_dropout_att_new
...
Add dropout att new 1.1 merge
7 years ago
tangwei12
d8b697357f
update height_sections to int64_t
7 years ago
minqiyang
a2820b9899
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_embedding_grad
7 years ago
Xin Pan
bba0c4a9f2
delete unused codes.
...
test=develop
7 years ago
jerrywgz
de2f965c9b
test=develop
7 years ago
guosheng
cc0e23973d
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-reshape-reuse-input
...
test=develop
7 years ago
tangwei12
318ba99124
revert changes in protobuf.cc and type_defs
7 years ago
tangwei12
aa6dc82f4b
revert changes in protobuf.cc and type_defs
7 years ago
dzhwinter
09409bad4d
staged. test speed=49ms in 1080.
7 years ago
tensor-tang
64d5b4385e
fix crf decode avx512
7 years ago
tensor-tang
21487d78bf
add crf decode jit kernel
7 years ago
sneaxiy
b1fd62f39e
test=develop
7 years ago
guosheng
3cfaeac288
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-reshape-reuse-input
...
test=develop
7 years ago
sneaxiy
1af3fe8c35
test=develop
7 years ago
Xin Pan
d5d09672c8
better fix
...
test=develop
7 years ago
Qiao Longfei
de539d72da
format
...
test=develop
7 years ago
sneaxiy
5be6f762d0
remove_lock_in_some_ops
...
test=develop
7 years ago
buxingyuan
6c1d74bb47
Merge branch 'develop' into FasterOpDoc
...
test=develop
7 years ago
Xin Pan
726fd438cd
avoid blocking everyone
...
please fix offline
7 years ago
JiabinYang
7bcba47e41
test=develop
7 years ago
barrierye
a7f94ec794
add similarity_focus op
7 years ago
Tao Luo
8ab953e37c
auto insert infer_graph_clean_pass as the default first one
...
test=develop
7 years ago
Tao Luo
d70c7fb9b3
Merge branch 'develop' into remove_unused_code
7 years ago
Tao Luo
ea2bdd192d
Merge branch 'develop' into remove_unused_code
7 years ago
minqiyang
0de6811ee0
Change reserve to resize
...
test=develop
7 years ago
tangwei12
b58957d9d7
Revert "fix lookuptable in reduce strategy"
...
This reverts commit 0e722c5
7 years ago
JiabinYang
9cad409f2a
test=develop
7 years ago
tangwei12
2761eafb92
shape type to int64_t, test=develop
7 years ago
tangwei12
d4a8967c1e
add const in &, test=develop
7 years ago
minqiyang
5660d6a3ba
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_embedding_grad
7 years ago
guosheng
1f92c30565
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-reshape-reuse-input
...
test=develop
7 years ago
tensor-tang
a05fce6544
Merge remote-tracking branch 'ups/develop' into fix/jit/avx
...
test=develop
7 years ago
JiabinYang
bd064c0f44
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_reorg_op
7 years ago
tangwei12
0e25e397bd
shape type to int64_t, test=develop
7 years ago
Qiyang Min
d0fdcb2f6d
Merge pull request #14048 from velconia/change_sequence_pool_to_cpu
...
Accelerate Sequence Pool Grad Op
7 years ago
tangwei12
d1e85e33d7
shape type to int64_t, test=develop
7 years ago
Yu Yang
8310ce6007
Fix cluster memory
...
test=develop
7 years ago
tensor-tang
d24d282a7a
fix avx error
...
test=develop
7 years ago
tensor-tang
9cb8738f54
Merge pull request #14018 from tensor-tang/refine/jit/gru
...
Refine/jit/gru
7 years ago
Xin Pan
70effddfc1
fix
...
test=develop
7 years ago
Xin Pan
64e7688ade
clean more APIs
...
test=develop
7 years ago
Xin Pan
c891bc22f5
clarify Reset
...
test=develop
7 years ago
Qiao Longfei
6253b152e6
Merge branch 'optimize-sum-seq-pooling-op' of https://github.com/jacquesqiao/Paddle into optimize-sum-seq-pooling-op
7 years ago
Qiao Longfei
14f5a40898
fix unit test
7 years ago
minqiyang
447a680a2b
Add API.spec
...
test=develop
7 years ago
minqiyang
5de4619781
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_embedding_grad
7 years ago
minqiyang
0695c1fbe8
Add remind for code
...
test=develop
7 years ago
minqiyang
0c5c4c4a5b
Add blas header file
...
test=develop
7 years ago
guosheng
aac426444f
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-reshape-reuse-input
...
test=develop
7 years ago
buxingyuan
d0ccdf8fc1
follow comments
...
test=develop
7 years ago
minqiyang
e2a348cd10
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into change_sequence_pool_to_cpu
7 years ago
Qiao Longfei
f4e6fe0786
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-sum-seq-pooling-op
7 years ago
Xin Pan
4f59690b4c
clean unused codes
...
test=develop
7 years ago
Xin Pan
784a19ecd0
fix some thread-safty issue and simplify threadpool
...
test=develop
7 years ago
Wojciech Uss
be58997443
remove unused method from naive executor
...
test=develop
7 years ago
minqiyang
40141f749b
Implement the unittest for hash op
...
test=develop
7 years ago
Sylwester Fraczek
741cb33bd9
test multithreading
7 years ago
Brian Liu
a53e8a8da6
Update MKLDNN integration framework to support Paddle multi-instances
...
Make all blob info saved in global device context to be thread based.
Meanwhile save thread id in thread local storage in ParallelDo
7 years ago
minqiyang
8a0f26f45f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into continue_hash_op
7 years ago
minqiyang
d4f9aa0852
Add hash op implementation
7 years ago
dzhwinter
468467f391
update real incnet tester
7 years ago
tangwei12
39b3bf24d0
shape type to int64_t, test=develop
7 years ago
tangwei12
755927d2b0
shape type to int64_t, test=develop
7 years ago
Qiao Longfei
7357d8412e
add flags for control the thead num for pserver
7 years ago
phlrain
a4ad286e6b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_dropout_att_new
7 years ago
phlrain
469bdb9e55
modify api.spec; test=develop
7 years ago
minqiyang
1a3b38a432
Polish code
...
test=develop
7 years ago
dzhwinter
b154e0b492
clean demo_ci
7 years ago
minqiyang
133bac2b10
Accelerate embedding op grad
...
test=develop
7 years ago
Zhaolong Xing
2256fae45d
Merge pull request #13938 from NHZlX/ocr_attention_support
...
ceil pool mode support for ocr attention model.
7 years ago
dzhwinter
abe8e207c4
clean demo_ci
7 years ago
dzhwinter
597d92179b
clean demo_ci
7 years ago
phlrain
201d4f2a85
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_dropout_att_new
7 years ago
phlrain
a6e6bc45d6
modify dropout att; test=develop
7 years ago
minqiyang
2468057da6
Move code to SumSeqPoolGradFunctor
...
test=develop
7 years ago
minqiyang
9725db0d40
Fix copy wrong pos bug
...
test=develop
7 years ago
minqiyang
9c68709036
Accelerate sequence_pool functor
7 years ago
minqiyang
14ebc424d6
Add gpu support for unittest
7 years ago
jerrywgz
e906c8e5e7
Merge pull request #14022 from jerrywgz/fix_rpn_target_assign_op
...
fix random fail in rpn target assign
7 years ago
minqiyang
bd5a82e193
Polish unit test code
7 years ago
minqiyang
047fa2f9aa
Add unit-test for sequence_pooling functor
7 years ago
qingqing01
c7379a7320
Fix top_k op ( #14034 )
...
1. Fix CUDA kernel when height is large than 2048.
2. Support input with more than 2D.
3. Fix unit test when k is large than 1.
4. Enhence unit testing.
test=develop
7 years ago
sneaxiy
016bf51e3f
test=develop
7 years ago
Tao Luo
f7bbcfa913
remove unused code in paddle_inference_api.h
...
test=develop
7 years ago
JiabinYang
c056328563
test=develop
7 years ago
nhzlx
11f189bacf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_demo_ci_trt
...
test=develop
7 years ago
tangwei12
8b7f45a889
add longs in framework
7 years ago
JiabinYang
c13f1ef3c4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_reorg_op
7 years ago
tangwei12
f3729db6e0
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into Pdv
7 years ago
Xin Pan
8837669782
Merge pull request #13982 from panyx0718/fix
...
Clean up Reuse
7 years ago
dzhwinter
dbd0075b68
Merge branch 'windows/support' into lb
7 years ago
dzhwinter
c6dcffc61a
lb. add debug output
7 years ago
wanghaoshuang
78cf76a1ca
fix linux compile
7 years ago
tangwei12
770e2a1881
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into Pdv
7 years ago
chengduo
e943f4508b
add graph number check ( #14025 )
...
test=develop
7 years ago
sneaxiy
92a2817a2b
test=develop
7 years ago
JiabinYang
8e8e8e66ab
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_reorg_op
7 years ago
nhzlx
ae8f26072d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_demo_ci_trt
...
test=develop
7 years ago
phlrain
049c9c7d2a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_dropout_att_new
7 years ago
phlrain
ffb24a73ec
add dropout attr; test=develop
7 years ago
sneaxiy
8f07f60915
test=develop
7 years ago
wanghaoshuang
5993155d67
Merge remote-tracking branch 'dzhwinter/windows/support' into windows/support
7 years ago
wanghaoshuang
f9e7cfb03c
save binary file
7 years ago
tensor-tang
032c3a07e3
Merge remote-tracking branch 'ups/develop' into refine/jit/gru
...
test=develop
7 years ago
tensor-tang
159be8cc63
optimize fusion gru kernel at size 8
7 years ago
dzhwinter
607080e888
windows static library
7 years ago
Tao Luo
23da8defc8
Merge pull request #14028 from luotao1/fix_resnet50_test
...
fix typo and warning in analyzer_resnet50_test
7 years ago
Yu Yang
71c846ef8a
Revert buggy changes
...
test=develop
7 years ago
JiabinYang
ff07dc315e
test=develop
7 years ago
chengduo
a7497653d0
Refine Split op ( #13967 )
...
* speedup split_op
test=develop
* speedup split_op
test=develop
* rename ConcatGrad to Split
* refine concat and split
test=develop
* fix compile error
7 years ago
Yu Yang
dbf9f6f408
Fix distribute compile
...
test=develop
7 years ago
guosheng
3099a8f3aa
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-reshape-reuse-input
...
test=develop
7 years ago
jerrywgz
e0708e62ba
refine code
7 years ago
jerrywgz
1c591c3909
Merge branch 'develop' into fix_rpn_target_assign_op
7 years ago
sneaxiy
a9d7a9d720
test=develop
7 years ago
Tao Luo
316bc9bfc9
fix typo and warning in analyzer_resnet50_test
...
test=develop
7 years ago
guosheng
6447b69aec
Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-reshape-reuse-input
...
test=develop
7 years ago
jerrywgz
f06c6193d7
fix rpn target assign test=develop
7 years ago
Yu Yang
1d4d4e73ab
Remove place hash
...
test=develop
7 years ago
dongzhihong
563e7bca7f
"fix op. test=develop"
7 years ago
Xin Pan
4625f83f92
better handle var type inference
...
avoid the default one that usually overwrites manually set ones
test=develop
7 years ago
Xin Pan
8f2116d8fa
clean up after the changes have been stopped for so long.
...
test=develop
7 years ago
tensor-tang
83dc689877
Merge remote-tracking branch 'ups/develop' into refine/jit/gru
...
test=develop
7 years ago
tensor-tang
640e789d3d
add fusion gru jit kernel
7 years ago
JiabinYang
39d39775c3
test=develop
7 years ago
JiabinYang
70351de1b5
test=develop
7 years ago
Yu Yang
461f71a90b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
qingqing01
0e24138494
Merge pull request #13991 from qingqing01/refine_generate_proposals_op
...
Refine generate proposals op
7 years ago
gongweibao
58c027cc38
Add rpc profiler flags. ( #13989 )
...
Add rpc profiler flags
7 years ago
Xin Pan
d10e54c460
Merge pull request #14003 from chengduoZH/fix_fast_parallel_exe_bug
...
Fix test_parallel_executor_mnist.py randomly hang.
7 years ago
Tao Luo
42aa1d409d
Merge pull request #13485 from tpatejko/tpatejko/capi-resnet-conv-elementwise-fusion
...
MKLDNN conv+elementwise_add fusion for residual connections in Resnet
7 years ago
Yu Yang
9dcddf92f2
Polish best_fit_allocator
7 years ago
tensor-tang
664159ad42
Merge pull request #13998 from tensor-tang/fea/fusion_seqconv_add
...
Fea/fusion seqconv eltadd relu
7 years ago
Yu Yang
0c25da39a0
Refine auto_increment_allocator
7 years ago
Yu Yang
ab87a88200
Polish retry allocator
7 years ago
guosheng
6d3b030bb5
Refine the api of reshape to be compatible.
...
test=develop
7 years ago
chengduozh
82d2903b63
Fix fast ParallelExe bug
...
test=develop
7 years ago
Tomasz Patejko
aa35aaa1ab
MKLDNN conv + elementwise_add fusion: fixing formatting
...
test=develop
7 years ago
jerrywgz
765085d297
Merge pull request #13904 from jerrywgz/roialign
...
Add RoI align operator.
7 years ago
Dang Qingqing
56936b9e25
Refine doc for generate_proposals_op.
...
test=develop
7 years ago
Tomasz Patejko
ce2464fd98
MKLDNN conv + elementwise_add fusion: UT for missing bias added. UTs refactored. Some minor changes in the pass
7 years ago
Tomasz Patejko
4e72ab411e
MKLDNN conv + elementwise_add fusion: fix for crash when bias is not present
7 years ago
Tomasz Patejko
415b261555
MKLDNN conv + elementwise_add fusion: fusion options added
7 years ago
Tomasz Patejko
1676094697
MKLDNN conv + elementwise_add fusion: turn on residual connection pass when CAPI is used.
...
test=develop
7 years ago
Tomasz Patejko
0fe3079c46
MKLDNN conv + elementwise_add fusion: fix for order of parameters in elementwise_add in resnet50
...
test=develop
7 years ago
Tomasz Patejko
b73b868366
MKLDNN conv + elementwise_add fusion: bias in tests made persistent.
...
test=develop
7 years ago
Tomasz Patejko
a1fa203287
MKLDNN conv + elementwise_add fusion: name of the pass reused with name_scope_
7 years ago
Tomasz Patejko
2c43419db1
MKLDNN conv + elementwise_add fusion: comment explaining CorrectGraphEdges added
7 years ago
Tomasz Patejko
8fb29b2ca9
MKLDNN conv + elementwise_add fusion: new nodes marked as input or output
...
test=develop
7 years ago
Tomasz Patejko
cc1c8e37c1
MKLDNN conv + elementwise_add fusion: attributes in new conv op copied from old op
7 years ago
Tomasz Patejko
a27a8c5da8
MKLDNN conv + elementwise_add fusion: bias in test marked as persistable
7 years ago
Tomasz Patejko
af8c71317c
MKLDNN conv + elementwise_add fusion: CorrectGraphEdges refactored
7 years ago
Tomasz Patejko
3e033087f1
MKLDNN conv + elementwise_add fusion: LinkNodes function removed and
...
macro used.
test=develop
7 years ago
Tomasz Patejko
4be45af1cc
MKLDNN conv + elementwise_add fusion: skip connection attribute renamed. Comments about patterns added.
...
test=develop
7 years ago
Tomasz Patejko
9a335e0277
MKLDNN conv + elementwise_add fusion: changed a name of a formal argument in ElementwiseAdd pattern
7 years ago
Tomasz Patejko
fb7a50b230
MKLDNN conv + elementwise_add fusion: removed commented code. Internal functions marked as static.
...
test=develop
7 years ago
Michal Gallus
f688197182
MKLDNN conv + elementwise_add fusion: Fix output_data to point to the right tensor, also fix transpiler integration
7 years ago
Tomasz Patejko
efd76614fb
MKLDNN conv + elementwise_add fusion: implementation changed to conform with Paddle API
7 years ago
Tomasz Patejko
347bf90412
MKLDNN conv + elementwise_add fusion: bias is also handled
7 years ago
Tomasz Patejko
bf95ac36a7
MKLDNN conv + elementwise_add fusion: further reformatting
7 years ago
Tomasz Patejko
cbe122ae2e
MKLDNN conv + elementwise_add fusion: correcting formatting
7 years ago
Tomasz Patejko
2a251bbf27
MKLDNN conv + elementwise_add fusion: some refactoring: consts, function calls instead of constant values
7 years ago
Tomasz Patejko
b8e54ab5cc
MKLDNN conv + elementwise_add fusion: parameter name changed to ResidualData
7 years ago
Tomasz Patejko
27573ece03
MKLDNN conv + elementwise_add fusion: trailing spaces removed
7 years ago
Tomasz Patejko
7f5c8a95e8
MKLDNN conv + elementwise_add fusion: arguments are replaced for many parameters in operator
7 years ago
Tomasz Patejko
5996bd39e8
MKLDNN conv + elementwise_add fusion: graph is corrected based on actual argument name, not formal argument name
7 years ago
Tomasz Patejko
41f3d78fdf
MKLDNN conv + elementwise_add fusion: output and elemwise param share data in conv primitive. Output is properly allocated
7 years ago
Tomasz Patejko
07a62ddc08
MKLDNN conv + elementwise_add fusion: inputs in pass modified. Support for new conv parameter. UTs corrected
7 years ago
Tomasz Patejko
56528531ea
MKLDNN conv + elementwis_add fusion: initial work on passing eltwise data to conv primitive
7 years ago
Tomasz Patejko
42f569fdfd
MKLDNN conv + elementwise_add fusion: use_mkldnn attribute added
7 years ago
Tomasz Patejko
441d3a4726
MKLDNN conv + elementwise_add: added some refactoring in the pass
7 years ago
Tomasz Patejko
38b7b34b1c
MKLDNN conv + elementwise_add fusion: added reachability tests, inputs and outputs in graph nodes are transformed
7 years ago
Tomasz Patejko
16eaaf3fbe
MKLDNN conv + elementwise_add fusion: added one more UT, found and corrected bugs in pass
7 years ago
Tomasz Patejko
604bad08bc
MKLDNN conv + elementwise_add fusion: implementation of patterns refarctored, applied to graph. UTs added
7 years ago
Tomasz Patejko
9ce343f868
MKLDNN conv + elementwise_add fusion: initial implementation of patterns
7 years ago
tensor-tang
40f8456a4f
refine fuse pattern and attr
...
test=develop
7 years ago
tensor-tang
cbbacb2534
Merge remote-tracking branch 'ups/develop' into fea/fusion_seqconv_add
...
test=develop
7 years ago
tensor-tang
603ba5e01d
add seqconv eltadd relu pass
7 years ago
Dang Qingqing
4801ee8f97
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine_generate_proposals_op
7 years ago
Tao Luo
da722d6d9b
Merge pull request #13858 from Sand3r-/mgallus/conv-bias-pass
...
[MKLDNN] Fuse Conv + Bias using Pass
7 years ago
Tao Luo
a4b48f70c1
Merge pull request #13997 from wojtuss/wojtuss/do-not-enable-mkldnn-twice
...
do not enable MKL-DNN twice
7 years ago
Tao Luo
252401c5d3
Merge pull request #13992 from wojtuss/wojtuss/add-ifdef-mkldnn
...
Add ifdef guard for MKL-DNN placement pass
7 years ago
Michał Gallus
f9ca31811d
Remove use mkldnn from config in resnet50 test
...
test=develop
7 years ago
tensor-tang
23fc896bc2
Merge remote-tracking branch 'ups/develop' into fea/fusion_seqconv_add
...
test=develop
7 years ago
tensor-tang
339e655aec
refine and add seqconv elementwiseadd relu op test
7 years ago
sneaxiy
2002e71da8
fix pinned allocator
7 years ago
Michal Gallus
c504a5a1b7
Adjust Conv+bias to placement pass
...
test=develop
7 years ago
Michal Gallus
d7509d63f1
Conv+Bias: Support non-null bias
...
test=develop
7 years ago
Michal Gallus
91e8fbac2f
Enable MKLDNN in Resnet50Tester
...
test=develop
7 years ago
Michal Gallus
582f59c190
Conv+Bias fuse
7 years ago
jerrywgz
a1d3db031b
Merge pull request #13844 from jerrywgz/fix_roi_pool
...
fix roi pool register
7 years ago
guosheng
dfb841ad5a
Make reshape_op reuse input.
...
test=develop
7 years ago
Dang Qingqing
8e0b9496de
Fix unit test
...
test=develop
7 years ago
Wojciech Uss
e6f480ec44
add comment on the default first pass
7 years ago
Wojciech Uss
2cf258e381
remove redundant pass list
7 years ago
Wojciech Uss
5632019f0f
add MKL-DNN placement pass
...
This patch also refactors conv+bn (includes changes from PR
https://github.com/PaddlePaddle/Paddle/pull/13926 )
updated to use the mkldnn-placement-pass.
test=develop
7 years ago
tensor-tang
0a9f5f1790
Merge pull request #13968 from tensor-tang/fix/jit/exp
...
Fix jit exp
7 years ago
Wojciech Uss
5083ec3a1b
do not enable MKL-DNN twice
...
After the MKL-DNN placement pass there is no need to enable MKL-DNN
in operators via executor
test=develop
7 years ago
Yipeng
fcb2e8103e
Ocr end2end dev ( #13889 )
...
* add detect and end2end code
* update the scale for coodinates restore
* fix merge bug with dev.
* fix merge bug with dev.
* test=develop
* fix code style test=develop
* fix code style test=develop
* test=develop
* test=develop
* test=develop
7 years ago
tensor-tang
e5ce965952
refine and add eltadd_relu unit test
7 years ago
sneaxiy
5a38930660
test=develop
7 years ago
Wojciech Uss
c3b70aece9
Add MKL-DNN placement pass ( #13958 )
...
* add MKL-DNN placement pass
This patch also refactors conv+bn (includes changes from PR
https://github.com/PaddlePaddle/Paddle/pull/13926 )
updated to use the mkldnn-placement-pass.
test=develop
* remove redundant pass list
* add comment on the default first pass
* fix test for conv+relu mkldnn fuse
7 years ago
Wojciech Uss
4a368a4901
add ifdef guard for MKL-DNN placement pass
...
test=develop
7 years ago
Xin Pan
909e1341bd
Merge pull request #13966 from panyx0718/fix4
...
set default timeout to avoiding blocking CI
7 years ago
chengduo
9775e50ca2
Fix add doc for bias_attr ( #13937 )
...
* fix conv doc
test=develop
* fix seq_conv doc
test=develop
* fix simple_img_conv_pool
test=develop
* update API.spec
* update parameter doc
test=develop
* follow comment
test=develop
* fix other layer
test=develop
* fix lstm bias_attr doc
test=develop
7 years ago
Tao Luo
7b11162ab5
Merge pull request #13949 from PaddlePaddle/wojtuss/unique-patterns-request-comment
...
Added comment with request for enhancement
7 years ago
tensor-tang
7cb19a5976
fuse elementwise_add and relu
7 years ago
tensor-tang
3c249283af
init seqconv eltadd relu op
7 years ago
tangwei12
48982e9dc7
fix lookuptable in reduce strategy
7 years ago
Xin Pan
9a819265eb
fix
...
test=develop
7 years ago
sneaxiy
ac2eba4457
test=develop
7 years ago
Tao Luo
305034f5b3
Merge pull request #13909 from luotao1/mkldnn_test
...
refine mkldnn test in analyzer_tests
7 years ago
superjomn
b77e4f4978
update
...
test=develop
7 years ago
jerrywgz
553342624e
test=develop
7 years ago
jerrywgz
9a14ca91b8
test=develop
7 years ago
tensor-tang
60ff05e312
Merge branch 'luotao1-fix_rnn2_test' into fix/jit/exp
...
test=develop
7 years ago
Tao Luo
ef09862450
fix analyzer_rnn2_test
...
test=develop
7 years ago
tangwei12
0e722c5ea2
fix lookuptable in reduce strategy
7 years ago
Tao Luo
e5b4643ad8
add profile_mkldnn test
...
test=develop
7 years ago
Tao Luo
7d680be5a3
Merge branch 'develop' into mkldnn_test
7 years ago
buxingyuan
0bb3b099c2
generate_proposal_labels doc
7 years ago
Wojciech Uss
55fd136ab0
Added comment with request for enhancement
...
This adds a `TODO` comment according to https://github.com/PaddlePaddle/Paddle/issues/13550#issuecomment-430133585
test=develop
7 years ago
gongweibao
a831ecc75d
Add grpc error context. ( #13957 )
...
Add grpc error context
7 years ago
tensor-tang
b139b687de
Merge remote-tracking branch 'ups/develop' into fix/jit/exp
...
test=develop
7 years ago
qingqing01
67a2b5215d
Add affine channel op to speed and save memory for faster-rcnn model. ( #13919 )
...
* Add affine channel op.
* Update code and add Python API.
test=develop
* Update API.spec
test=develop
7 years ago
tensor-tang
748435586a
clean code exp avx
7 years ago
tensor-tang
b4751a34a5
fix illegal instruction of rnn2
7 years ago
sneaxiy
21fdf8e87d
add unittest for allocator_facade.cc
7 years ago
Xin Pan
6de08b5eef
set default timeout to avoiding blocking CI
...
test=develop
7 years ago
tensor-tang
30dfbdee7f
Merge pull request #13951 from tensor-tang/fix/warning
...
fix warning and mac compile
7 years ago
Tao Luo
34ed7d1379
Merge pull request #13924 from luotao1/clean_inference_lib
...
Clean inference lib
7 years ago
tensor-tang
36588b3365
fix illegal instruction of rnn1 and text
7 years ago
Tao Luo
6a4e9230ed
Merge branch 'develop' into mkldnn_test
7 years ago
gongweibao
078223b3e3
Add rpc timeline. ( #13900 )
...
Add rpc timeline
7 years ago
dzhwinter
29382db625
Merge pull request #13874 from dzhwinter/fix/momentum
...
add sparse update momentum. test=develop
7 years ago
Xin Pan
6a54c3de1f
Merge pull request #13928 from panyx0718/doc
...
remove unused codes
7 years ago
qingqing01
5dbb2e9986
Small changes for sum_op to avoid zero setting. ( #13923 )
7 years ago
Tao Luo
b819684370
add compare_mkldnn test
...
test=develop
7 years ago
Tao Luo
e47f4186ae
fix some compiler warning
7 years ago
nhzlx
b970c6d5d0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_demo_ci_trt
...
test=develop
7 years ago
nhzlx
32072d31b5
fix demo ci error on manylinux
7 years ago
Jiabin Yang
6553956bd6
Merge pull request #13931 from JiabinYang/fix_dist_on_mac
...
remove proxy before unit test to avoid error under distribution
7 years ago
dzhwinter
00e8791f66
fix compile in cpu error. test=develop
7 years ago
tensor-tang
e69328c3bc
fix warning and mac compile
...
test=develop
7 years ago
Tao Luo
6ea9d1b595
add analysis_predictor in vis_demo
...
test=develop
7 years ago
Tao Luo
f444a7226e
Merge branch 'develop' into clean_inference_lib
7 years ago
Tao Luo
3598500773
Merge pull request #13867 from Superjomn/clean/CreatePaddlePredictor
...
clean CreatePaddlePredictor
7 years ago
Tao Luo
41eeb771e8
Merge branch 'develop' into clean_inference_lib
7 years ago
sneaxiy
3419d04c3f
test=develop
7 years ago
dzhwinter
d239cf2e15
use binary search. test=develop
7 years ago
dzhwinter
a9f5f822e6
use binary search. test=develop
7 years ago
Tao Luo
b854d959a5
update with comments
7 years ago
Qiao Longfei
cc7f551441
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pyreader
...
test=develop
7 years ago
nhzlx
2b5edfbc37
Add ceil model pooling for trt (ocr attention)
...
test=develop
7 years ago
Tao Luo
75bb0babef
Merge branch 'develop' into mkldnn_test
7 years ago
tensor-tang
6447155dac
Merge pull request #13851 from tensor-tang/fea/jitkernel_peephole
...
Fea jitkernel lstm peephole
7 years ago
sneaxiy
4b4af84e67
test=develop
7 years ago
jerrywgz
4c9884e713
refine unittest test=develop
7 years ago
JiabinYang
02f863400e
test=develop
7 years ago
Qiao Longfei
0225957515
change elementwise_add to elementwise_add_to test=develop
7 years ago
Qiao Longfei
bd2b6d7f8f
sum_op support inplace
7 years ago
Xin Pan
7fb5b66ac2
Merge pull request #13916 from panyx0718/fix2
...
Make Var::GetMutable robust
7 years ago
Yan Chunwei
6809238d97
fix analysis predictor profile ( #13896 )
7 years ago
Xin Pan
abbfb60ca9
remove unused codes
...
test=develop
7 years ago
Yibing Liu
6b795d424c
Merge pull request #13901 from kuke/seq_slice_py
...
Add py api for sequence_slice_op
7 years ago
nhzlx
9d98ca0424
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_googlenet_bug_with_rule
...
test=develop
7 years ago
nhzlx
849a6874ad
fix googlenet bug with relu
7 years ago
dzhwinter
3861269594
merge develop branch
7 years ago
jerrywgz
98c3294b85
Merge branch 'roialign' of https://github.com/jerrywgz/Paddle into roialign
7 years ago
Tao Luo
a35e7f4bae
adjust demo_ci with fluid_inference_install_dir
...
test=develop
7 years ago
tangwei12
fa2ab3346c
fill constant add infervarshape, lookuptable clone lr var ( #13830 )
...
* fill constant add infervarshape, lookuptable clone lr var
* test=develop
* add lookuptable ut, test=develop
* bug fix in transpliler about async with lookup table
* test=develop
7 years ago
jerrywgz
8c79071d6a
roi_align for gpu
7 years ago
Xin Pan
342e436158
Make Var::GetMutable robust
...
test=develop
7 years ago
Yan Chunwei
7a751b83ac
fix isfinite_op sprintf ( #13850 )
...
test=develop
7 years ago
Qiyang Min
e3a64fca44
Merge pull request #13835 from velconia/fix_reshape_op
...
Fix Reshape op when input is the same with output
7 years ago
Yibing Liu
46b0b7903c
Merge pull request #13856 from kuke/seq_unpad_op
...
Add sequence unpad op
7 years ago
tensor-tang
dcfb687584
Merge pull request #13846 from tensor-tang/fix/mkldnn
...
fix default number of threads when inference with or without MKLDNN
7 years ago
Qiao Longfei
b4a32eafdf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-sum-seq-pooling-op
...
test=develop
7 years ago
Tao Luo
bd77460182
refine mkldnn test in analyzer_tests
...
test=develop
7 years ago
jerrywgz
c9d2046f76
roi_align for gpu
7 years ago
jerrywgz
2f5a80174e
add roi_align api
7 years ago
dzhwinter
e41a3fcd68
fix update to develop hang problem.
7 years ago
Zeng Jinle
af91d41ab8
Merge pull request #13852 from sneaxiy/feature/eager_delete_tensor
...
Fix bug of eager deletion to support if_else_op
7 years ago
Zeng Jinle
93606c2c2c
Merge pull request #13689 from sneaxiy/sparse_rmsprop
...
Fix sparse rmsprop
7 years ago
Qiao Longfei
681226e97c
Merge pull request #13864 from jacquesqiao/py-reader-add-test-mode
...
reader block queue add test mode
7 years ago
jerrywgz
90f39b1123
Merge branch 'roialign' of https://github.com/jerrywgz/Paddle into roialign
7 years ago
Tao Luo
f8874b3cb2
Merge pull request #13884 from luotao1/rename_inference_lib_dist
...
rename inference_lib_dist to fluid_lib_dist
7 years ago
Xin Pan
288a112ffd
Revert "Revert "Revert "Make variable::GetMutable robust"""
7 years ago
sneaxiy
5cedfb60c8
test=develop
7 years ago
Yibing Liu
b785798585
Expose layer's name for sequence pad & unpad
...
test=develop
7 years ago
Yibing Liu
18e1c1e07d
Update API spec for seq slice
...
test=develop
7 years ago
jerrywgz
5e52dafda5
add roi align
7 years ago
jerrywgz
c0e34eebec
add roi align
7 years ago
Tao Luo
c26f2b21eb
Merge pull request #13813 from sfraczek/sfraczek/conv-bn-fuse-pass-full-eigen
...
rewrite conv_bn fuse pass fully to eigen
7 years ago
sneaxiy
64d94596ab
fix allocator_facade bug
7 years ago
Yibing Liu
16b2c6dc78
Add py api for sequence_slice_op
...
test=develop
7 years ago
superjomn
1cfd2b51a7
update
...
test=develop
7 years ago
Xin Pan
fededdda20
Merge pull request #13872 from panyx0718/fix2
...
Revert "Revert "Make variable::GetMutable robust""
7 years ago
dzhwinter
804dd7da04
merge conflict. both linux and windows pass.
7 years ago
dzhwinter
962061f0a3
windows fix
7 years ago
wanghaoshuang
b12f7c2399
compile in linux.
7 years ago
Qiao Longfei
b16e9cd105
a small fix for compile WITH_INFERENCE=OFF ( #13869 )
...
test=develop
7 years ago
Qiao Longfei
ec25a09bd5
revert unused change test=develop
7 years ago
Qiao Longfei
60030e8678
change the use of FLAGS_reader_queue_speed_test_mode
...
test=develop
7 years ago
Tao Luo
323d67cfc1
Merge pull request #13879 from panyx0718/doc
...
fix doc
7 years ago
Qiao Longfei
936926aadd
code optimize
...
test=develop
7 years ago
Sylwester Fraczek
50c5e9b0c6
reshape_2d used from ddim.h
...
test=develop
7 years ago
Qiyang Min
cab29828a5
Merge pull request #13829 from velconia/accelerate_sequence_pool_op
...
Accelerate SequencePool Op on SUM mode of CPU
7 years ago
minqiyang
aeec82acd5
Add unittest for reshape op
...
test=develop
7 years ago
Qiao Longfei
9fd78df71c
revert unused change
7 years ago
Xin Pan
ddb76d0d09
Make GetMutable more robust
...
test=develop
7 years ago
Qiao Longfei
c52ccbc109
clean code
7 years ago
Qiao Longfei
6056d04361
optimize blas call
7 years ago
Qiyang Min
c2842377ce
Merge pull request #13837 from velconia/add_pyramid_dnn_support
...
Add margin_rank_loss_op to python
7 years ago
Qiao Longfei
5db7551317
optimize code
7 years ago
minqiyang
24c9fbdba3
Polish code
...
test=develop
7 years ago
chengduo
2c9839c847
add cuda version display ( #13885 )
...
test=develop
7 years ago
sneaxiy
d3ed070e10
test=develop
7 years ago
minqiyang
d9b202e717
Move tensor copy src_ptr and dst_ptr check to TensorCopy function
...
test=develop
7 years ago
sneaxiy
fb6201e93e
test=develop
7 years ago
chengduo
8e2fdc54b1
Add check for opt op ( #13840 )
...
* add check for opt op
* fix opt op
test=develop
* fix test fail
test=develop
* fix optimization doc
test=develop
* test=develop
7 years ago
Qiao Longfei
eb6d9e3bbe
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-sum-seq-pooling-op
7 years ago
Yibing Liu
46e61d81a7
Wrapper py api for sequence_unpad
...
test=develop
7 years ago
Qiao Longfei
0170d36c42
fix a bug
7 years ago
superjomn
28459592cc
update
...
test=develop
7 years ago
Qiyang Min
e37c9e6732
Merge pull request #13828 from velconia/accelerate_selected_rows_functor
...
Accelerate SelectedRows Functors:
7 years ago
Qiao Longfei
86e2e686ee
fix bug
7 years ago
Qiao Longfei
333fd15204
add gpu test for mrege add
7 years ago
Tao Luo
3d976f3f18
rename inference_lib_dist to fluid_lib_dist
...
test=develop
7 years ago
Qiao Longfei
ab3e36da80
update MergeAdd for selected_rows_functor.cu
7 years ago
sneaxiy
84d9300365
test=develop
7 years ago
chengduo
2562eb92b8
Add strategy doc ( #13849 )
...
* add strategy doc
test=develop
* fix doc
test=develop
* add ParallelExecutor arg doc
test=develop
7 years ago
Xin Pan
b11372a0af
fix doc
...
test=develop
7 years ago
Zhaolong Xing
7413fa458f
Merge pull request #13838 from NHZlX/add_trt_pad_op
...
Add trt pad op converter
7 years ago
superjomn
e2bd40ca82
update
...
test=develop
7 years ago
Qiao Longfei
d5c64af24f
change map to unordered_map
7 years ago
Qiao Longfei
005f1923a2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-sum-seq-pooling-op
7 years ago
wanghaoshuang
3ae9645084
compile in linux
7 years ago
dzhwinter
8329a1f139
add sparse update momentum. test=develop
7 years ago
superjomn
049fcbe125
update
...
test=develop
7 years ago
Qiao Longfei
ea97e83e41
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-pyreader
7 years ago
superjomn
f5c0221c17
clean CreatePaddlePredictor
...
test=develop
7 years ago
Xin Pan
d852be7c48
Revert "Make variable::GetMutable robust"
7 years ago
Qiao Longfei
c61e16b181
add reader_queue_speed_test_mode_flag test
7 years ago
Qiao Longfei
8686f7c68e
add reader_queue_speed_test_mode flag for speed test
7 years ago
Xin Pan
ae8b1c32bf
Merge pull request #13821 from panyx0718/fix
...
Make variable::GetMutable robust
7 years ago
Xin Pan
9b3e0df375
Merge pull request #13819 from panyx0718/doc
...
Explain LoD and a few other concepts
7 years ago
Zhaolong Xing
44f37d019a
Merge pull request #13848 from NHZlX/fix_demo_ci_with_trt
...
fix demo_ci bug in vis_demo.cc
7 years ago
Sylwester Fraczek
55d6950a1a
rewrite conv_bn fuse pass to eigen
...
test=develop
7 years ago
chengduo
9c77b65c06
Fix layers.uniform_random ( #13823 )
...
* fix layers.uniform_random
* fix uniform_random
test=develop
* remove var type set
test=develop
* fix similar error
test=develop
7 years ago
nhzlx
320c78e16f
fix commets
...
test=develop
7 years ago
Yibing Liu
cbe4292516
Add sequence unpad op
...
test=develop
7 years ago
tensor-tang
bcb8ea397d
Merge remote-tracking branch 'ups/develop' into fea/jitkernel_peephole
...
test=develop
7 years ago
tensor-tang
8e182170ba
refine and replace lstm peephole kernel
7 years ago
nhzlx
efa5bac7ad
fix demo_ci bug in vis_demo.cc
...
test=develop
7 years ago
tensor-tang
dc5a7b906d
fix default number of threads when inference with or without MKLDNN
...
test=develop
7 years ago
Xin Pan
228506618b
Avoid GetMutable implicitly reset Var Type.
...
This can cause a lot of problem:
1. Wrong operator implementation, Op can get a wrong type without failure.
2. Anytype can be Get without defined in VarType.
Also fix wrong STEP_SCOPE usage.
test=develop
7 years ago
jerrywgz
3c963336e4
fix roi pool register
7 years ago
Dun
5f2e837847
optimize depthwise conv by register memory ( #13778 )
...
* optimize depthwise conv by register memory
* test=develop
7 years ago
sneaxiy
d87569134c
test=develop
7 years ago
minqiyang
3f6ec90060
Polish code
...
test=develop
7 years ago
minqiyang
9878eedbaa
Change API.spec
...
test=develop
7 years ago
Qiao Longfei
5428cb9908
Profiler support merge data of all thread ( #13811 )
...
* profiler infor merge thread statistic information
* update profiler
* fix bug
* add merge thread msg to report
* optimize report
* statistic the time of ops in each thread but not all
* optimize report format
* optimize profile report
* optimize profile report
test=develop
7 years ago
nhzlx
bf7a2789a0
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_trt_pad_op
...
test=develop
7 years ago
nhzlx
0cb88c34be
add op converter
7 years ago