nhzlx
c0c9fcd9c7
add source file
...
test=develop
6 years ago
dongdaxiang
4c0a769d1d
avoid clock time in WIN32 mode
...
test=develop
6 years ago
dongdaxiang
66522046ad
remove clock time in WIN32 mode
...
test=develop
6 years ago
dongdaxiang
f2b92d77b5
remove clock time in WIN32 mode
6 years ago
nhzlx
4e4a777243
add conv+elementwiseadd pass
...
test=develop
6 years ago
gongweibao
0b1c7d838c
Add brpc serialization support. ( #11430 )
6 years ago
Yan Chunwei
a985949be9
Fea/fuse conv elementwise add fuse ( #14669 )
6 years ago
Yancey1989
4a4ccac1d0
update by comment test=develop
6 years ago
Yu Yang
04a570b463
Fix ut
...
test=develop
6 years ago
heqiaozhi
09d669ba40
fix static_cast to const_cast
6 years ago
peizhilin
23dec78772
fix script issue
...
test=develop
6 years ago
heqiaozhi
bd1c1724aa
add ps_instance doc
6 years ago
Yu Yang
62b5772ffb
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
6 years ago
Qiao Longfei
fac8702269
adam support multithread
6 years ago
Wang Guibao
e213050223
Fix multi-threading bug with WItH_MKL=ON ( #14882 )
...
fixes #14884
6 years ago
Yancey1989
4f304eaa6f
fix unittest test=develop
6 years ago
dongdaxiang
d839bd0dd4
simple commit
6 years ago
minqiyang
3bb9b0cac2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into accelerate_ddpg
...
test=develop
6 years ago
minqiyang
a81495d6f4
Fix code
6 years ago
minqiyang
ad6ae0b071
1. Add SpinLock
...
2. Seperate the lock of kids and vars in Scope
test=develop
6 years ago
Yancey1989
c722b1dcb6
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
...
test=develop
6 years ago
Yu Yang
4ecdb6f486
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
...
test=develop
6 years ago
sneaxiy
f6741df462
merge develop
...
fix bug
test=develop
6 years ago
Zeng Jinle
1b564bc49a
Merge pull request #14670 from sneaxiy/refactor_eager_deletion
...
Rewrite eager deletion
6 years ago
dongdaxiang
c9b799896e
fix tag in async_executor
6 years ago
heqiaozhi
f81957a753
refine cmake for pslib & pre_define
6 years ago
sneaxiy
2328bee1cc
fix Windows compile bug
...
test=develop
6 years ago
Yancey1989
23eb8c4299
fix ci test=develop
6 years ago
heqiaozhi
2912d5311b
fix code style bug & change pslib.cmake & change Cmakelist adapt pslib
6 years ago
dongdaxiang
c59cdf3a24
refine executor_thread_worker.h and executor_thread_worker.cc code style
6 years ago
heqiaozhi
3c01cdeff0
refine executor_thread_worker.cc & executor_thread_worker.h code style
6 years ago
dongdaxiang
c71279bc69
refine code style for async_executor.h and async_executor.cc
6 years ago
Yu Yang
162637b64a
Fix ngraph compile
...
test=develop
6 years ago
dongdaxiang
33ee5cad61
format code style of executor_thread_worker.cc
6 years ago
Yu Yang
194e66f785
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/tensor_type
6 years ago
heqiaozhi
7bd16e3afa
fix some bug & add log
6 years ago
Xin Pan
30aad88449
Merge pull request #14839 from panyx0718/infer3
...
fix clang
6 years ago
sneaxiy
db2daefe50
merge develop
...
test=develop
6 years ago
Yancey1989
106e285236
add unittest for parllelgraph mode test=develop
6 years ago
minqiyang
a61eb543f5
Add RWLock to Scope
6 years ago
sneaxiy
e82772f425
fix cmake conflict
...
test=develop
6 years ago
sneaxiy
0c554a59fa
merge develop
...
test=develop
6 years ago
minqiyang
1b61021cb3
Polish code
6 years ago
minqiyang
b75bd29c3a
Remove debug info
6 years ago
minqiyang
7a43e51703
Add gperf tools
6 years ago
Yu Yang
c00e07cda0
Fix distribute compile
...
test=develop
6 years ago
tensor-tang
d846fb03a8
Merge pull request #14807 from baojun-nervana/ngraph_follow_up
...
Follow ups on nGraph PR comments
6 years ago
sneaxiy
ca84c2ca8f
merge develop
...
test=develop
6 years ago
sneaxiy
e240ba2918
implement backward
...
test=develop
6 years ago
sneaxiy
06f8aa5b97
remove while_op support temporarily
...
test=develop
6 years ago
Yu Yang
81520a24cf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_eigen_tensor
6 years ago
Yu Yang
9bd70a1e04
Change tensor uses proto::VarType::type
...
test=develop
6 years ago
Yu Yang
8175983ef9
Merge pull request #14814 from reyoung/feature/gprof
...
Add gperftools supports for PE
6 years ago
heqiaozhi
10ed9e0a6e
download & run & instance
6 years ago
Yancey1989
5cc83f79bf
update by comment
6 years ago
Yancey1989
82726402be
exception safe
6 years ago
minqiyang
570338699b
Add debug info
6 years ago
Yu Yang
9f8d86858e
Revert data_type
...
test=develop
6 years ago
sneaxiy
7923042365
merge develop
...
test=develop
6 years ago
Tao Luo
3243b45bb2
Merge pull request #14651 from luotao1/has_attr
...
add Opdesc's HasProtoAttr
6 years ago
Xin Pan
1735022a1b
fix clang
...
test=develop
6 years ago
heqiaozhi
016a068775
stop server
6 years ago
Tao Luo
067ed70f2d
add HasProtoAttr function in op_desc.h, clean node.h
...
test=develop
6 years ago
heqiaozhi
8e3fe2d735
add skip op
6 years ago
Yancey1989
79082c9459
fix pyreader failed
6 years ago
minqiyang
202b2f1fa7
Move the beta pow scale calculation into Adam Op
6 years ago
Yu Yang
b22d638d8f
Speed up SizeOfType
...
test=develop
6 years ago
Xin Pan
4a716a70c6
Merge pull request #14811 from panyx0718/revert-14782-revert-14398-imperative
...
Imperative
6 years ago
Brian Liu
9623b45f40
Remove unnecessary MKLDNN reorder ( #14799 )
...
When data flow from a MKLDNN OP kernel to a non-MKLDNN OP kernel,
data layout transform (via MKLDNN reorder) will occur even when
those two OP kernels share same layout. Add code to remove this
unnecessary reorder.
test=develop
6 years ago
Yancey1989
2dda19f756
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into parallel_graph_mode
6 years ago
Tao Luo
e99597d35c
Merge branch 'develop' into luotao1-has_attr
6 years ago
Yu Yang
f0c0bf328d
Add gperftools supports for PE
6 years ago
Xin Pan
748549b2e3
Revert "Merge pull request #14798 from PaddlePaddle/revert-14786-revert-14782-revert-14398-imperative"
...
This reverts commit b1d3a1c8b4
, reversing
changes made to f1fb64b17f
.
6 years ago
bingyanghuang
943ad4781f
One possible solution to add flexibility for mkldnn placement pass ( #14768 )
...
* Choose to turn on use_mkldnn attribute v1
* Fix mkldnn_op empty bug
* format change test=develop
* fix ci test=develop
* fix ci test and add test in dam test=develop
* add example to dam compare test test=develop
* review changes test=develop
6 years ago
baojun-nervana
fddbd87c0a
Rename argument
...
test=develop
6 years ago
baojun-nervana
22ac2133e4
Rename class
...
test=develop
6 years ago
baojun-nervana
bfde5e10ce
Move ngraph compile control to cmake
...
test=develop
6 years ago
sneaxiy
2c6159a151
fix unittest
...
fix cmake
test=develop
6 years ago
Xin Pan
c049fa7cf7
Revert "Revert "Revert "Imperative"""
7 years ago
gongweibao
f1fb64b17f
Add reduce sparse tensor feature. ( #14757 )
7 years ago
sneaxiy
eb8252466b
polish code
...
add unittest model containing while_op
remove unnecessary codes
test=develop
7 years ago
Tao Luo
c83d5b7a16
Merge pull request #14709 from yihuaxu/develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
...
Implement the fusion of convolution 3D and bias for mkldnn
7 years ago
Yancey1989
47740ace28
fix performance
7 years ago
Yancey1989
220db4f334
clean code
7 years ago
Yancey1989
cb8a24be14
clean code
7 years ago
heqiaozhi
575ae7c6c3
refine pslib inferface & fix some bugs
7 years ago
Yihua Xu
3821fc3950
Merge branch 'develop' into develop_4f71a6ee2_conv3d_bias_fusion_mkldnn_impl
...
test=develop
7 years ago
Yihua Xu
240d974ac5
Clean Code
...
test=develop
7 years ago
Xin Pan
2538ef64f1
Revert "Revert "Imperative""
7 years ago
guru4elephant
b82a44ea85
Merge pull request #14778 from wangguibao/async_executor_bugfix
...
Async executor bugfix: Tensor changed to LoDTensor
7 years ago
sneaxiy
8095fb5e68
fix code bug in CPU compilation
...
test=develop
7 years ago
sneaxiy
387bac46b5
refine code
...
test=develop
7 years ago
Tao Luo
cf66133857
Merge pull request #14734 from luotao1/memory_load
...
support loading from memory
7 years ago
Xin Pan
6217f42ab7
Revert "Imperative"
7 years ago
Tao Luo
743cb840f1
update with comments
...
test=develop
7 years ago
Yancey1989
c9de6f1b05
init parallel graph mode
7 years ago
wangguibao
5a2cd4505b
AsyncExecutor bugfix: Tensor to LoDTensor
...
test=develop
7 years ago
wangguibao
5f98d80039
AsyncExecutor bugfix: Tensor change to LoDTensor
7 years ago
Tao Luo
923b18877e
Merge branch 'develop' into memory_load
...
test=develop
7 years ago
Tao Luo
405b2486db
support loading from memory
...
test=develop
7 years ago
Xin Pan
b52f5d2870
Merge pull request #14398 from panyx0718/imperative
...
Imperative
7 years ago
heqiaozhi
d3ca359e44
config init & adapt to interface
7 years ago
Xin Pan
73b4d1aa72
Merge pull request #14742 from panyx0718/infer2
...
support customized kernel selection
7 years ago
Xin Pan
82d68281c0
follow comments
...
test=develop
7 years ago
Xin Pan
41c28d54c6
allow customize kernel selection
...
test=develop
7 years ago
Xin Pan
439af8d50a
Merge pull request #14717 from panyx0718/infer
...
fix a const_cast and avoid using stale program.
7 years ago
sneaxiy
d0c8b9b9b3
remove timeout unittest
...
test=develop
7 years ago
heqiaozhi
419506f510
refine for compile pslib.so
7 years ago
Wu Yi
29d9fb53fc
[Feature] multi process multi gpu dist training, boost v100 performance by 20% ( #14661 )
...
* wip multi process multi gpu dist training
* workable for p2p
* update test=develop
* change back env name test=develop
* fix alloc init
* fix cpu build test=devlop
* fix mac tests test=develop
* refine code
* refine test=develop
7 years ago
sneaxiy
e694d0c2e4
fix while_op eager deletion bug
...
add unittest
test=develop
7 years ago
Xin Pan
dc458b1482
Merge pull request #14713 from panyx0718/api
...
add more files to protected file list
7 years ago
heqiaozhi
2301abc481
cc libaray add pslib
7 years ago
sneaxiy
35a2578426
fix bug
...
test=develop
7 years ago
zhang wenhui
abbe382e1e
Revert "Add EstiminateFlops"
7 years ago
Xin Pan
0591ba96ec
fix hack
...
test=develop
7 years ago
sneaxiy
64ad051b9a
merge develop
...
test=develop
7 years ago
sneaxiy
c47c451a00
fix bug
7 years ago
heqiaozhi
a77fa67bbd
async_thread_trainer & libmct & pslib.cmake
7 years ago
Tao Luo
3437e17713
Merge branch 'has_attr' of https://github.com/luotao1/Paddle into luotao1-has_attr
7 years ago
Xin Pan
bcf36d8401
add more files to protected file list
...
test=develop
7 years ago
Tao Luo
0e3048db43
Merge pull request #14659 from luotao1/update_pass
...
update is_test_pass and mkldnn_placement_pass
7 years ago
Xin Pan
7e0801d4ed
Merge pull request #14441 from baojun-nervana/intel/ngraph_op
...
Implementing ngraph engine
7 years ago
Xin Pan
b80fe8264a
polish
...
test=develop
7 years ago
Yihua Xu
64e261c6cd
Implement the fusion of convolution and bias for mkldnn
...
(test=develop)
7 years ago
Tao Luo
8d6984eb9b
change OpHasAttr to RuntimeHasAttr, add some comments
...
test=develop
7 years ago
Tao Luo
a6ac42669c
Merge branch 'develop' into update_pass
7 years ago
Yu Yang
0f0e197914
Merge pull request #14666 from reyoung/feature/estiminate_flops
...
Add EstiminateFlops
7 years ago
Xin Pan
aeb74af54c
allow operator to run imperatively
7 years ago
Xin Pan
b1f6fda5e5
run forward
7 years ago
barrierye
08233beed7
add the comment for CheckFile function. test=develop
7 years ago
barrierye
d62a3dd72d
add the comment for CheckFile function. test=develop
7 years ago
barrierye
d89108766c
update CheckFile function in data_feed for ignore the space at the end of each line of data(for example, it may be added '\t' character to the end of the reduce task output when processes data by hadoop, which does not affect the correctness of the data). test=develop
7 years ago
Tao Luo
61ae88b760
Revert "Fix for accuracy problem for inplace operators when MKL-DNN mode is enabled"
7 years ago
dongdaxiang
52a0be7bb4
add mct into CMakeLists.txt
7 years ago
Tao Luo
99177b424b
Merge pull request #14693 from kbinias/fix-for-accuracy-problem-for-inlplace-operators
...
Fix for accuracy problem for inplace operators when MKL-DNN mode is enabled
7 years ago
heqiaozhi
3c239cd640
pslib
7 years ago
Krzysztof Binias
bc7db6cec9
Fix for accuracy problem for inplace operators when MKL-DNN mode is enabled
...
test=develop
7 years ago
Wang Guibao
41e19eb431
AsyncExecutor ( #14627 )
...
* AsyncExecutor: C++ side
* Google naming conventions
* Rename MultiExecutor to AsyncExecutor
* pybind with async_executor
* Naming convention
* remove some flags and unused code
* add refactored file of async_executor and data_feed
* clear async executor interface and add data feed factory
* split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
* Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
* refine async_executor_refactor.cc
* add some files about datafeed
* Revert "add some files about datafeed"
This reverts commit 8ee8133ab841196925a2812b76f18d2812a6701d.
* Interface rework
* add MultiSlotDataFeed
* Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
* update data_feed for add MultiSlotDataFeed
* update datafeed and async_executor to run bow_net demo
* fix bug that finish_set_filelist failed in multithread
* delete finish_binding_memory_(flag), because it can not be marked under the current interface
* Fix bug
* update async_executor.py for support set_use_slots
* update async_executor.py for support set_use_slots and set set_dense_slots
* fix bug that when the number of files is less than the number of threads, it will fetch nan
* remove redundant code, and make executor exit when set a illegal queue size
* add batch_size check
* add MultiSlotDesc
* Revert "add MultiSlotDesc"
This reverts commit 2e72ebfad364ed6b5dcc75f38ffb2a1fdec83d8e.
* add some checkpoint in DataFeedDesc
* add CheckFile function in MultiSlotDataFeed
* update something error info
* fix deaded lock bug
* Fix fetch variable
* Merge error
* fix code style in async_executor
* using one lock blocking queue replace two lock blocking queue because of some bugs
* update code style
* add utest for data_feed
* Fix fetch var
* update utest for data_feed for multithread
* update SetFileList info
* fix bug in utest of data_feed
* Add comments for python
* Add comments for python code
* Fix pybind.cc with new pybind11 version
* add note for DataFeedDesc's set_use_slots function
* Add save_model
* update data_feed_test for multi-type
* add comment for executor_thread_worker
* Remove unused code
* update data_feed_test for generate test data file
* removed unnecessary interfaces and add comments
* c++ style check
* update data_feed.cc
* AsyncExecutor: C++ side
Google naming conventions
Rename MultiExecutor to AsyncExecutor
pybind with async_executor
Naming convention
remove some flags and unused code
add refactored file of async_executor and data_feed
clear async executor interface and add data feed factory
split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
refine async_executor_refactor.cc
add some files about datafeed
Revert "add some files about datafeed"
This reverts commit 8ee8133ab841196925a2812b76f18d2812a6701d.
add MultiSlotDataFeed
Interface rework
Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
update datafeed and async_executor to run bow_net demo
update async_executor.py for support set_use_slots
Fix bug
update async_executor.py for support set_use_slots and set set_dense_slots
fix bug that when the number of files is less than the number of threads, it will fetch nan
remove redundant code, and make executor exit when set a illegal queue size
add MultiSlotDesc
Revert "add MultiSlotDesc"
This reverts commit 2e72ebfad364ed6b5dcc75f38ffb2a1fdec83d8e.
add some checkpoint in DataFeedDesc
Fix fetch variable
fix code style in async_executor
Fix fetch var
add utest for data_feed
Add comments for python
update utest for data_feed for multithread
fix bug in utest of data_feed
Add comments for python code
Fix pybind.cc with new pybind11 version
add note for DataFeedDesc's set_use_slots function
update data_feed_test for multi-type
Add save_model
update data_feed_test for generate test data file
removed unnecessary interfaces and add comments
add comment for executor_thread_worker
Remove unused code
update data_feed.cc
c++ style check
* commit for code style
* commit for code style
* commit for code style
* commit for code style
* Comment away __init__ in async_executor.py
* clang-format fix test=develop
* use PADDLE_THROW instead of exit(-1); use unique_ptr to manage scope var in data_feed_test.cc
* commit for update code style
* commit for update code style
* Add async_executor demo; Remove some methods
test=develop
* commit for update code style
* commit for update code style
* commit for update code style
* update API.spec
* AsyncExecutor
test=develop
* AsyncExecutor
test=develop
* AsyncExecutor
test=develop
* AsyncExecutor
test=develop
* Fix API.spec
test=develop
* Fix API.spec
test=develop
* Fix windows build error
test=develop
* FIx windows build error
test=develop
* FIx windows build error
test=develop
* FIx windows build error
test=develop
* Fix Windows Build
test=develop
* Fix Windows Build
test=develop
* Fix Windows Build
test=develop
* Fix code style
test=develop
* Fix code style
test=develop
* update datafeed
* Fix code style
test=develop
* update data_feed_test for test Tensor test=develop
* Fix code style
test=develop
* Fix windows build failure
test=develop
* Fix code style and windows build failure
test=develop
* Fix PYTHON3.5 build failure
test=develop
* AsyncExecutor API
test=develop
7 years ago
baojun-nervana
fc61bf1b16
Renamed methods
...
test=develope
7 years ago
sneaxiy
096673f675
refactor eager deletion
...
test=develop
7 years ago
Tao Luo
c856ac8721
add OpHasAttr in node.h, update is_test_pass and mkldnn_placement_pass
...
test=develop
7 years ago
Qiao Longfei
44debca844
Merge pull request #14589 from jacquesqiao/refactor-prefetch
...
Refactor prefetch
7 years ago
luotao1
5db273d874
enhance HasAttr to fix ci
...
test=develop
7 years ago
Yu Yang
589b863b98
Add EstiminateFlops
...
test=develop
7 years ago
baojun-nervana
d5ee05e6c3
Replaced VarIsTensor
...
test=develop
7 years ago
baojun-nervana
e6bd53be60
Named to RuntimeInferShape
...
test=develop
7 years ago
baojun-nervana
a29696146c
Added annotation
...
test=develop
7 years ago
baojun-nervana
caf4b937b3
Added RunInferShape
...
test=develop
7 years ago
baojun-nervana
1d19eb2bd4
Implemented ngraph engine
...
test=develop
7 years ago
luotao1
fe915901cd
update Opdesc's HasAttr
...
test=develop
7 years ago
chengduo
6776e92846
refine tensor_array_write_read ( #14643 )
...
test=develop
7 years ago
Qiao Longfei
8ebde595c9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
...
test=develop
7 years ago
Qiao Longfei
145c535750
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
...
test=develop
7 years ago
minqiyang
a02ce58f2c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
...
test=develop
7 years ago
JiabinYang
7594787deb
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enhance_hierachical_sigmod_op
7 years ago
Xin Pan
3c77ce3751
Merge pull request #14593 from panyx0718/fix5
...
Protect important header files.
7 years ago
Tao Luo
e8ef14d2a7
Merge pull request #14610 from Superjomn/revert/cache_fix
...
Revert "fix transfer cache thread_local bug (#14581 )"
7 years ago
JiabinYang
87648f8edf
merge develop, test=develop
7 years ago
wopeizl
db9284ecde
Merge pull request #14617 from wopeizl/windows/online
...
Windows/online
7 years ago
JiabinYang
c3c3c0b33c
polish code, test=develop
7 years ago
gongweibao
867c312bc4
Fix allreduce dependency order. ( #14586 )
7 years ago
Qiao Longfei
9589babe12
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
...
test=develop
7 years ago
peizhilin
38715e6fd0
minor fix
7 years ago
JiabinYang
7e4bd695e6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enhance_hierachical_sigmod_op
7 years ago
JiabinYang
b10df8bcfa
refine code and add none bias ut, test=develop
7 years ago
superjomn
dc249d3b69
Revert "fix transfer cache thread_local bug ( #14581 )"
...
This reverts commit 5c073a4db2
.
7 years ago
minqiyang
be04d99fe4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
...
test=develop
7 years ago
Qiao Longfei
af2f5fc824
fix some bugs
7 years ago
Xin Pan
3e665862b8
Protect important header files.
...
test=develop
7 years ago
minqiyang
53433d7f2e
Revert the changes of VLOG
...
test=develop
7 years ago
peizhilin
30849d1f20
Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Yan Chunwei
5c073a4db2
fix transfer cache thread_local bug ( #14581 )
7 years ago
Xin Pan
87332bb18d
Merge pull request #14579 from Superjomn/fix/transfer-cache-compile-error
...
fix compile
7 years ago
peizhilin
36cd18b549
Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
peizhilin
1afa9492af
Recover the profiler
7 years ago
Yiqun Liu
bf222f197d
Use sub scope in tensor_array_to_tensor op. ( #14524 )
...
test=develop
7 years ago
superjomn
4b40c0013b
fix compile
...
test=develop
7 years ago
Yan Chunwei
a7188d5bc7
fix executor transfer cache bug ( #14518 )
7 years ago
JiabinYang
42470f14b7
test=develop
7 years ago
qingqing01
36f08eef3b
CUDA kernel for density_prior_box_op. ( #14513 )
...
* CUDA kernel for density_prior_box_op.
* Support flatten to 2D.
7 years ago
peizhilin
7c8c9dc9bf
fix unit test cases
7 years ago
JiabinYang
3c6102a367
test=develop
7 years ago
sneaxiy
488610a65a
merge develop
...
test=develop
7 years ago
wopeizl
d9a1f3e58e
Windows/online ( #14474 )
...
* add recordio support
* disable the openblas multi-thread on windows since no support
adjust the python script
* code style
* code style
test=develop
* add create_recordio_file_reader back
* fix code style
test=develop
* fix the gtest.cmake on windows
* fix cc_test on windows
* fix the win build
test=develop
* remove fused compile support on windows
test=develop
* add the jit support
test=develop
* add the jit support, test=develop
* add the jit support, test=develop
* add the jit back
fix compile error on windows
* rollback test=develop
* test case fix
* disable DSO by default on windows
* exclude warpctc_op on windows
* exclude the dynload_warpctc out on windows
test=develop
* fix the scripts error
test=develop
* disable avx on windows by default
test=develop
* re-organize the cmake file
* disable mkl on windows by default
* add warp_ctc back
* fix the dependency
* fix the dependency
* fix the build issue on windows
* remove unsupported flag on windows
* code style
* code style
test=develop
* fix issue
* add profiler, parallel_executor back
* clean up the pre-definitions on windows
* fix build issue
* test=develop
7 years ago
sneaxiy
3912545ffe
add dlpack support
...
test=develop
7 years ago
Jiabin Yang
f7b55de9e5
Merge branch 'develop' into enhance_hierachical_sigmod_op
7 years ago
JiabinYang
af9a3301da
test=develop
7 years ago
JiabinYang
014e50c284
test=develop
7 years ago
peizhilin
6e66fadb95
clean up the pre-definitions on windows
7 years ago
peizhilin
67562a6fcd
Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
peizhilin
703b26e697
add profiler, parallel_executor back
7 years ago
tensor-tang
10fb4ceefc
Merge pull request #14351 from tpatejko/tpatejko/mkldnn-elementwise_mul
...
[MKLDNN][JIT][AVX512] Elementwise Mul
7 years ago
peizhilin
25adf970b2
Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
chengduo
a94a7355f0
Refine the GraphNum check ( #14144 )
...
* refine GraphCheck
test=develop
* fix ci fail
test=develop
7 years ago
peizhilin
3a72a634cf
Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Tao Luo
cf685f361b
Merge pull request #14458 from tpatejko/tpatejko/mkldnn-skip-connections
...
[WIP] Correcting and extending MKLDNN residual connection fuse pass
7 years ago
Yu Yang
98bbfc17be
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
...
test=develop
7 years ago
Yu Yang
d424115f9e
Clean code
...
test=develop
7 years ago
peizhilin
1cc23ef67d
merge from paddle:develop
7 years ago
Michal Gallus
4e54ab76ec
Add HasAttr method to Operator
7 years ago
Wu Yi
a2d9b34417
Refine operator cmake ( #14413 )
...
* wip simplify operator framework
* wip
* wip
* done test=develop
* clean test=develop
* fix test=develop
* fix deps test=develop
* fix cpu build test=develop
* fix tensorrt build test=develop
* fix tests test=develop
* fix test=develop
* fix cpu build test=develop
7 years ago
Tomasz Patejko
53da846d1e
MKLDNN residual connections fuse pass: initial implementation of fusion for projection pass
...
test=develop
7 years ago
Tomasz Patejko
dbc4fcd722
MKLDNN residual connections fuse pass: unit tests enabled and added
7 years ago
Tomasz Patejko
4224089354
MKLDNN residual connections fuse pass: Maybe removed and boost::optional used where it makes sense
7 years ago
Tomasz Patejko
86fd3b32be
MKLDNN residual connections fuse pass: counting statistics added to the pass
7 years ago
Tomasz Patejko
ee6f778beb
MKLDNN residual connections fuse pass: further refactoring
7 years ago
Tomasz Patejko
7423748e37
MKLDNN residual connections fuse pass:
...
* implements reachability check between identity node and non-identity argument to elementwise_add
* implements handling identity node as x and as y argument to elementwise_add
7 years ago
peizhilin
6d0d5a76eb
Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Yan Chunwei
7796f65f89
fix inference on gpu out of mem ( #14414 )
...
* fix inference on gpu out of mem
the transfer logic in operator.cc will keep creating new scopes.
7 years ago
Yu Yang
c8f6e70ab4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
...
test=develop
7 years ago
Sylwester Fraczek
8a1eeec579
add mkldnn prop_kind phase for inference-only case to pooling and activations ( #14278 )
...
* add is_test to pooling and activations
add prop_kind support for layers activation. conv and pooling
add a pass that sets is_test to true
add transpiler version of is_test pass
test=develop
* patch test and pass
test=develop
* add pass to analyzer.h
test=develop
* add is_test attr description & pass only on mkldnn
in:
activation_op.cc
batch_norm_op.cc
conv_op.cc
dropout_op.cc
lrn_op.cc
pool_op.cc
sequence_pool_op.cc
softmax_op.cc
* fix is_test handling for activation pool and conv
* change description of is_test for all layers again
* remove GetAttr(use_mkldnn) from pass
* rename correct_mkldnn_test_phase to is_test
and remove dependency on MKLDNN
test=develop
* review fix magic number
* two if(..)s into one
* Check is_test once and pass mkldnn forward prop kind
* dereference shared_ptr with * (without get())
test=develop
* add is_test_pass back
test=develop
7 years ago
peizhilin
d1429ac4a5
add recordio support
7 years ago
Yu Yang
0d6718fcbd
Pass compile
7 years ago
Tao Luo
5ef123c778
Merge branch 'develop' into dam_fc
7 years ago
dzhwinter
d3aed98d86
Merge pull request #14320 from wopeizl/windows/online
...
Windows/online
7 years ago
peizhilin
be332a13bc
Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Xin Pan
8cfda7ee0c
Merge pull request #14382 from panyx0718/fix4
...
Refine the pass builder and buildstrategy
7 years ago
Tao Luo
980a6753a8
fix typo to pass the ci
...
test=develop
7 years ago
Tao Luo
c7b3bfcdf1
Merge pull request #14376 from baojun-nervana/intel/ngraph_fusedop
...
Adding fused operator for ngraph
7 years ago
peizhilin
1a9008c420
code style fix
...
test=develop
7 years ago
Tao Luo
8ea13e336a
add in_num_col_dims for fc
7 years ago
peizhilin
08d1dc84a9
fix
7 years ago
peizhilin
30ddc07a7e
Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Qiao Longfei
e65cbd3b06
Merge pull request #14387 from jacquesqiao/lookup_sparse_table_add_test_mode
...
Lookup sparse table add test mode
7 years ago
Zeng Jinle
7066b3850a
Merge pull request #14395 from sneaxiy/fix_num_threads_in_fast_pe
...
Fix num_threads settings in fast_pe
7 years ago
Yan Chunwei
9f252e0032
Combine Inference Analysis with IR ( #13914 )
7 years ago
baojun-nervana
51a538e055
Fix style and use enum
...
test=develop
7 years ago
Zeng Jinle
38d32c98b8
merge develop
...
test=develop
7 years ago
sneaxiy
eb18d532a5
fix num_threads in fast_pe
...
test=develop
7 years ago
Qiao Longfei
51f3838f96
add log for not exist code
...
test=develop
7 years ago
Qiao Longfei
7aa8b2ccf2
optimize code
7 years ago
Qiao Longfei
8d205c853c
add is_test for lookup_sparse_table
7 years ago
baojun-nervana
ea3538d8dd
Added fused operator
...
test=develop
7 years ago
Xin Pan
759ffca423
some improvements
...
test=develop
7 years ago
Xin Pan
99dffb91d6
allow to repeatedly share and update BuildStrategy
...
test=develop
7 years ago
Tao Luo
6c32945556
Merge pull request #14372 from luotao1/speedup_analysis
...
speedup DetectPatterns
7 years ago
Tao Luo
668ae523d2
speedup DetectPatterns
...
test=develop
7 years ago
Yan Chunwei
9a6e239281
fix mac graph detector sort ( #14356 )
7 years ago
peizhilin
1b75fd2236
revert
7 years ago
peizhilin
7840d181c9
fix style issue
7 years ago
peizhilin
dc339b78d7
fix code style
7 years ago
sneaxiy
d231e55065
merge develop
...
test=develop
7 years ago
JiabinYang
c8801e100f
grad diff problem to be fixed and need api spec change to be done
7 years ago
peizhilin
e23061e0dc
Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
chengduo
6c6e638550
Add InferVarType for some op ( #14201 )
...
* add_infer_var_type
test=develop
* InferVarTypeHelper-> VarTypeInferenceHelper
test=develop
* PassInputTypeAndDTypeOnOutput
test=develop
* follow comment
test=develop
7 years ago
peizhilin
ca60e1d34d
Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
peizhilin
d08334011a
fix merge issue
7 years ago
peizhilin
52f7644f53
Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
qingqing01
abe209234f
Exhaustive search for cuDNN conv. ( #14286 )
...
* exhaustive search for cuDNN conv.
* Refine code and add unit testing.
* Fix model load in fluid/inference and unit testing in conv2d
* Follow comments.
* Fix compiling test=develop
7 years ago
Yu Yang
fdc689142c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
...
test=develop
7 years ago
minqiyang
87450b9ad4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
...
test=develop
7 years ago
peizhilin
dcfab11193
merge from develop
7 years ago
peizhilin
4ffa92d4f0
Merge branch 'develop' into windows/build
7 years ago
chengduo
c5b6573a5a
Fix input<tensor> ( #14208 )
...
* fix input<tensor>
test=develop
* fix split_ids
test=develop
* ElementwiseMul should not support SelectedRows
* fix scale op
test=develop
* change GetTensorFromVar() method to GetTensorOrSelectedRowsFromVar()
* fix operator
* refine MultiOutput
* fix MultiOutput
test=develop
* disable test_dist_save_load
test=develop
* fix elementwise_op
test=develop
* add get_sparse_as_op
test=develop
* add info for check
test=develop
* rename get_sparse_as_op with extract_rows_as_op.
test=develop
* elementwise doesn't support selected_rows
* fix regularizer
* remove extract_rows_as
test=develop
* fix ci
test=develop
* add test for sum_op
* fix regularizer
test=develop
* test=develop
* fix pserver weight decay multi inputs test=develop
7 years ago
Tao Luo
813e54efbd
Merge pull request #14328 from PaddlePaddle/revert-14046-windows/debug
...
Revert "cherry picked windows patches."
7 years ago
minqiyang
3db9fad764
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
...
test=develop
7 years ago
Xin Pan
ff6c809bfc
Merge pull request #14251 from panyx0718/fix
...
Make OpHandle/VarHandle and ir::Node works cleaner
7 years ago
Zhaolong Xing
ba8b5619a3
Revert "cherry picked windows patches."
7 years ago
minqiyang
49710960ef
Revert tensor_util.cu
...
test=develop
7 years ago
minqiyang
fcc0452c8b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_vlog
...
test=develop
7 years ago
minqiyang
0c3227a523
Change the origin VLOG level to 10 times
...
Fix code to support cpplint syntax check
test=develop
7 years ago
Xin Pan
59c66532e7
add more logs and comments
...
test=develop
7 years ago
peizhilin
869487a2b7
Merge remote-tracking branch 'origin/develop' into windows/build
7 years ago
dzhwinter
234a1d9248
Merge remote-tracking branch 'origin/develop' into windows/debug
...
test=develop
7 years ago
Yu Yang
057a682ee9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
Yu Yang
c28beb8a3c
test(Pe): add dry run tests for pe ( #14254 )
...
Dry run tests will skip `Op.Run` and just perform job scheduling. It helps to analysis dead lock in PE.
test=develop
7 years ago
Qiao Longfei
6449faec37
Merge pull request #14259 from jacquesqiao/optimize-thread-pool
...
Optimize thread pool
7 years ago
Qiao Longfei
4062f00f2a
optimize thread pool code
...
test=develop
7 years ago
dzhwinter
2835e04409
merge develop branch. test=develop
7 years ago
qingqing01
db8c52da5e
Revert " Exhaustive search for cuDNN conv. ( #14043 )"
...
This reverts commit ce7d9b0799
.
7 years ago
qingqing01
ce7d9b0799
Exhaustive search for cuDNN conv. ( #14043 )
...
* exhaustive search for cuDNN conv.
* Refine code and add unit testing.
* Clean code
* Fix model load in fluid/inference and unit testing in conv2d
* Follow comments.
7 years ago
Xin Pan
cb2d33a851
resolve conflict
...
test=develop
7 years ago
peizhilin
a37918c31f
fix python package issue
7 years ago
Xin Pan
25123a3b7e
add tests
...
test=develop
7 years ago
Xin Pan
8c11d3fed6
clean up
7 years ago
Xin Pan
0a89650507
fix more tests
...
test=develop
7 years ago
Xin Pan
a3b27e3237
fix
...
test=develop
7 years ago
Xin Pan
f25eb9a71d
fix some tests.
...
test=develop
7 years ago
Xin Pan
adf5615e54
clean kGraphOp
...
test=develop
7 years ago
Xin Pan
fb576cb5cb
allow to compare type
...
test=develop
7 years ago
Xin Pan
ead94bfc6c
fix destructor
...
test=develop
7 years ago
Xin Pan
2e14999942
clean1
...
test=develop
7 years ago
Xin Pan
34b401fc6c
clean up a global graph attr.
7 years ago
Zeng Jinle
8ac2242b6e
Merge pull request #14075 from sneaxiy/remove_some_locks_in_pe
...
Remove some locks in ParallelExecutor
7 years ago
sneaxiy
faac8a76ce
remove unnecessary codes
...
test=develop
7 years ago
Qiao Longfei
fe4cd50286
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-thread-pool
...
test=develop
7 years ago
Qiao Longfei
ac415c0094
change lock_guard to unique_lock
7 years ago
Qiao Longfei
f4a76078d0
optimize thread pool
7 years ago
Wu Yi
306236c2c0
feature/DC asgd ( #12722 )
...
* wip
* add ref_by_trainer_id op
* ready to test
* fix ref inputs
* refine rpc_op_handle
* fix merge bug
7 years ago
peizhilin
71d7980f69
fix build issue 1
7 years ago
peizhilin
9d67c1fb69
cpu build support
7 years ago
Xin Pan
c2d70fca30
fix to only check block 0
...
test=develop
7 years ago
sneaxiy
7ff320f8cc
merge develop
7 years ago
Xin Pan
d0459ac8d0
Merge pull request #14223 from panyx0718/fix5
...
add more debug info.
7 years ago
Xin Pan
aaeedd0ff3
make it warn
...
test=develop
7 years ago
Xin Pan
ddd2225b56
add more debug info.
...
test=develop
7 years ago
sneaxiy
bbc818a5a1
test=develop
7 years ago
sneaxiy
203027ca86
test=develop
7 years ago
Tao Luo
d2a56f7909
Merge pull request #14159 from sfraczek/sfraczek/depthwise-conv-mkldnn-pass
...
add depthwise conv mkldnn pass
7 years ago
dzhwinter
eb2f7ed21b
refine tests. test=develop
7 years ago
Xin Pan
08d22cf7e1
Merge pull request #14091 from panyx0718/fix2
...
add program check
7 years ago
Wu Yi
91b2851cdc
enable pyreader use pin memory ( #14066 )
...
* enable pyreader use pin memory
* add py reader pin memory test test=develop
7 years ago
sneaxiy
cf1944af2a
test=develop
7 years ago
Yan Chunwei
f76fee644c
fix graph pattern detector ( #14186 )
7 years ago
chengduo
e1742050ea
fix merge lod_tensor bug ( #14199 )
...
test=develop
7 years ago
dzhwinter
1ace55c8ee
merge develop branch
7 years ago
dzhwinter
9da7b33515
details
7 years ago
Xin Pan
913b569903
Merge pull request #14151 from panyx0718/fix
...
add a small test to verify tensor type
7 years ago
chengduo
2ccf77d1c1
Refine GetTensorFromVar ( #14160 )
...
* fix GetTensorFromVar
test=release/1.1
* refine GetTensorFromVar
test=develop
7 years ago
Yan Xu
d10b8efcc0
Merge pull request #14152 from Yancey1989/add_fused_broadcast_unittest
...
add fused broadcast op unit test
7 years ago
Yu Yang
c21597cf07
fix(PE): use shared_ptr<BlockingQueue> for cross thread communication ( #14136 )
...
It seems that the blocking queue might be destroyed early than Run
method complete. It might because the Run method throw some unhandled
exception. However, it should be shared_ptr when multthread access an
resource. So change BlockingQueue as a shared_ptr.
test=develop
7 years ago
Yan Chunwei
62a0fe0860
fix tensor array bug ( #14166 )
...
remove the optimized but buggy implementation
7 years ago
chengduo
ed087f8232
refine op_handle ( #14178 )
...
test=develop
7 years ago
Yu Yang
90d9e5aee8
feat(platform): lazy initialization of devicecontext in pool ( #14067 )
...
* feat(platform): lazy initialization of devicecontext in pool
Use std::async(deferer, []{...}) to lazy initialize DeviceContext in Pool
test=develop
* Add future includes
test=develop
7 years ago
Sylwester Fraczek
4e2aaf01bc
add depthwise conv mkldnn pass
...
added depthwise conv mkldnn pass which for MKLDNN changes depthwise_conv operator to conv operator because for mkldnn this is the same api
test=develop
7 years ago
Yancey1989
6bfa6a0a33
add fused broadcast op unit test, test=develop
7 years ago
Xin Pan
e2db0b9bf3
add a small test to verify tensor type
...
test=develop
7 years ago
dzhwinter
bf2e4cb188
cleard. staged
7 years ago
Xin Pan
a943134a97
fix a few more tests
...
test=develop
7 years ago
chengduo
2f639113ee
Fix sum_op's GetExpectedKernelType ( #14112 )
...
* fix sum_op's GetExpectedKernelType
test=develop
* fix ci fail
test=develop
7 years ago
Xin Pan
5839e3236b
add program check
...
test=develop
7 years ago
sneaxiy
2494ca83ab
test=develop
7 years ago
dzhwinter
ebfe5a02b3
merge develop branch
7 years ago
sneaxiy
f2eed667c0
test=develop
7 years ago
Xin Pan
16dfedb8b8
Merge pull request #14103 from jacquesqiao/cpu-for-1.1-merge-with-shape
...
[1.1] Cpu for 1.1 merge with shape
7 years ago
sneaxiy
cef8cc81db
merge develop
7 years ago
Xin Pan
177720a737
Merge pull request #14116 from chengduoZH/release/1.1.0
...
[1.1]Fix op_role value
7 years ago
chengduozh
acec4cb8ca
[1.1]fix op_role value
...
test=release/1.1
7 years ago
Qiao Longfei
3d4e050802
fix compile, optimize code test=develop
7 years ago
Yu Yang
c01696f8c2
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
...
test=develop
7 years ago
Qiao Longfei
d26ff8cb2d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into cpu-for-1.1-merge-with-shape
7 years ago
Wu Yi
26200f2e42
[1.1] [project] train imagenet using large batch size ( #13766 )
...
* fix nccl2 lars dist support
* put lars in momentum op
* add tests lars
* fix ci
* fix cpu kernel
* soft warning
* remove lars in test_recognize_digits.py
* move to another op
* add file
* update api.spec test=develop
* update test=develop
* fix api.spec test=develop
* wip
* wip, finish grad merge ops
* wip, finish graph build
* wip test running
* work on 1 gpu
* workable version
* update
* fix tests
* fuse broadcast op
* fix compile failed
* refine
* add batch merge test mnist
* fix CI test=develop
* fix build
* use independent bn params for batch merge test=develop
* update api.spec
* follow comments and for test
* wip
* refine tests test=develop
* follow comments test=develop
* remove startup bn modify test=develop
* follow comments test=develop
* fix merge test=develop
7 years ago
sneaxiy
2414f92f54
test=develop
7 years ago
sneaxiy
45559d042c
move to pass
...
test=develop
7 years ago
Qiao Longfei
7cd2417fe2
Merge branch 'develop' into cpu-for-1.1-merge-with-shape
...
test=develop
7 years ago
Xin Pan
0a80f06ec4
Merge pull request #14086 from panyx0718/fix6
...
delete unused codes.
7 years ago
sneaxiy
a314a80cdb
merge develop
7 years ago
Yan Chunwei
ee74be3a49
[1.1] Bugfix/tensorarray ( #14044 )
7 years ago
seiriosPlus
c34610f86d
Fix lookup table at CPU Reduce strategy, test=develop
7 years ago
Qiao Longfei
641369f92b
Merge branch 'dist-table-do-not-init-on-trainer' of ssh://github.com/jacquesqiao/Paddle into cpu-for-1.1-merge
7 years ago
Qiao Longfei
1ed9ef6d70
Merge branch 'shape_int_to_int64' of https://github.com/seiriosPlus/Paddle into cpu-for-1.1-merge
7 years ago
Qiao Longfei
f1a3fb041b
Merge branch 'fix_lookuptable_in_reduce' of https://github.com/seiriosPlus/Paddle into cpu-for-1.1-merge
7 years ago
Wu Yi
9da9b1926b
[1.1] fix graph num hang ( #14072 )
...
* fix graph num hang test=develop
* re-enable tests test=develop
* re-enable graph num check test=develop
* fix multi device pass role check test=develop
7 years ago
Qiao Longfei
fad42fe7cc
broadcast handle not inited parameter
7 years ago
dzhwinter
7141debe38
add cudnn back. staged.
7 years ago
Zeng Jinle
60058180cb
Merge pull request #13945 from sneaxiy/unify_mixed_vector_api
...
Unify API of mixed_vector in GPU and CPU
7 years ago
Xin Pan
bcc9126e7b
Merge pull request #14056 from panyx0718/fix
...
Fix threadpool
7 years ago
Tao Luo
961baea16c
Merge pull request #14063 from wojtuss/wojtuss/remove-unused-EnableMKLDNN
...
remove unused method from naive executor
7 years ago
Xin Pan
bba0c4a9f2
delete unused codes.
...
test=develop
7 years ago
tangwei12
318ba99124
revert changes in protobuf.cc and type_defs
7 years ago