Xin Pan
8138391631
add OpBase and unify with VarBase
...
test=develop
6 years ago
Xin Pan
f6f0692451
clean up
...
test=develop
6 years ago
Xin Pan
0318c95149
rebase develop
6 years ago
Xin Pan
aeb74af54c
allow operator to run imperatively
6 years ago
Xin Pan
b1f6fda5e5
run forward
6 years ago
Xin Pan
a6d23083f0
some tracing
...
test=develop
6 years ago
Xin Pan
dac92e560c
initial commit
6 years ago
barrierye
08233beed7
add the comment for CheckFile function. test=develop
6 years ago
barrierye
d62a3dd72d
add the comment for CheckFile function. test=develop
6 years ago
barrierye
d89108766c
update CheckFile function in data_feed for ignore the space at the end of each line of data(for example, it may be added '\t' character to the end of the reduce task output when processes data by hadoop, which does not affect the correctness of the data). test=develop
6 years ago
phlrain
9f7eae861d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
6 years ago
Tao Luo
61ae88b760
Revert "Fix for accuracy problem for inplace operators when MKL-DNN mode is enabled"
6 years ago
phlrain
25df78eaf3
fix api spec; test=develop
6 years ago
phlrain
4c256ca6be
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
6 years ago
phlrain
b65722d3cf
fix uni test; test=develop
6 years ago
Tao Luo
99177b424b
Merge pull request #14693 from kbinias/fix-for-accuracy-problem-for-inlplace-operators
...
Fix for accuracy problem for inplace operators when MKL-DNN mode is enabled
6 years ago
tangwei12
618f7620e2
add enforce for auc ( #14687 )
...
* add enforce for AUC, test=develop
6 years ago
Krzysztof Binias
bc7db6cec9
Fix for accuracy problem for inplace operators when MKL-DNN mode is enabled
...
test=develop
6 years ago
phlrain
2770ea1a73
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
6 years ago
chengduozh
3f4aca618f
code refine
...
test=develop
6 years ago
chengduozh
af8c2cec13
fix operator.cmake
...
test=develop
6 years ago
chengduozh
679d8fc6fe
rename op name
...
test=develop
6 years ago
chengduozh
1013d6d05d
Merge branch 'add_cudnn_lstm' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
6 years ago
jerrywgz
3df0538940
replace -100 to kIgnoreIndex
6 years ago
Wang Guibao
41e19eb431
AsyncExecutor ( #14627 )
...
* AsyncExecutor: C++ side
* Google naming conventions
* Rename MultiExecutor to AsyncExecutor
* pybind with async_executor
* Naming convention
* remove some flags and unused code
* add refactored file of async_executor and data_feed
* clear async executor interface and add data feed factory
* split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
* Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
* refine async_executor_refactor.cc
* add some files about datafeed
* Revert "add some files about datafeed"
This reverts commit 8ee8133ab841196925a2812b76f18d2812a6701d.
* Interface rework
* add MultiSlotDataFeed
* Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
* update data_feed for add MultiSlotDataFeed
* update datafeed and async_executor to run bow_net demo
* fix bug that finish_set_filelist failed in multithread
* delete finish_binding_memory_(flag), because it can not be marked under the current interface
* Fix bug
* update async_executor.py for support set_use_slots
* update async_executor.py for support set_use_slots and set set_dense_slots
* fix bug that when the number of files is less than the number of threads, it will fetch nan
* remove redundant code, and make executor exit when set a illegal queue size
* add batch_size check
* add MultiSlotDesc
* Revert "add MultiSlotDesc"
This reverts commit 2e72ebfad364ed6b5dcc75f38ffb2a1fdec83d8e.
* add some checkpoint in DataFeedDesc
* add CheckFile function in MultiSlotDataFeed
* update something error info
* fix deaded lock bug
* Fix fetch variable
* Merge error
* fix code style in async_executor
* using one lock blocking queue replace two lock blocking queue because of some bugs
* update code style
* add utest for data_feed
* Fix fetch var
* update utest for data_feed for multithread
* update SetFileList info
* fix bug in utest of data_feed
* Add comments for python
* Add comments for python code
* Fix pybind.cc with new pybind11 version
* add note for DataFeedDesc's set_use_slots function
* Add save_model
* update data_feed_test for multi-type
* add comment for executor_thread_worker
* Remove unused code
* update data_feed_test for generate test data file
* removed unnecessary interfaces and add comments
* c++ style check
* update data_feed.cc
* AsyncExecutor: C++ side
Google naming conventions
Rename MultiExecutor to AsyncExecutor
pybind with async_executor
Naming convention
remove some flags and unused code
add refactored file of async_executor and data_feed
clear async executor interface and add data feed factory
split async executor into executor_thread_worker and async_executor, refactor pybind, add datafeed and corresponding proto
Fix async_executor interfaces: 1) Remove all protobufs; 2) Stop after each epoch
refine async_executor_refactor.cc
add some files about datafeed
Revert "add some files about datafeed"
This reverts commit 8ee8133ab841196925a2812b76f18d2812a6701d.
add MultiSlotDataFeed
Interface rework
Creating DataFeedDesc from .proto file, then manipulate it (add/del fields etc) from python side
update datafeed and async_executor to run bow_net demo
update async_executor.py for support set_use_slots
Fix bug
update async_executor.py for support set_use_slots and set set_dense_slots
fix bug that when the number of files is less than the number of threads, it will fetch nan
remove redundant code, and make executor exit when set a illegal queue size
add MultiSlotDesc
Revert "add MultiSlotDesc"
This reverts commit 2e72ebfad364ed6b5dcc75f38ffb2a1fdec83d8e.
add some checkpoint in DataFeedDesc
Fix fetch variable
fix code style in async_executor
Fix fetch var
add utest for data_feed
Add comments for python
update utest for data_feed for multithread
fix bug in utest of data_feed
Add comments for python code
Fix pybind.cc with new pybind11 version
add note for DataFeedDesc's set_use_slots function
update data_feed_test for multi-type
Add save_model
update data_feed_test for generate test data file
removed unnecessary interfaces and add comments
add comment for executor_thread_worker
Remove unused code
update data_feed.cc
c++ style check
* commit for code style
* commit for code style
* commit for code style
* commit for code style
* Comment away __init__ in async_executor.py
* clang-format fix test=develop
* use PADDLE_THROW instead of exit(-1); use unique_ptr to manage scope var in data_feed_test.cc
* commit for update code style
* commit for update code style
* Add async_executor demo; Remove some methods
test=develop
* commit for update code style
* commit for update code style
* commit for update code style
* update API.spec
* AsyncExecutor
test=develop
* AsyncExecutor
test=develop
* AsyncExecutor
test=develop
* AsyncExecutor
test=develop
* Fix API.spec
test=develop
* Fix API.spec
test=develop
* Fix windows build error
test=develop
* FIx windows build error
test=develop
* FIx windows build error
test=develop
* FIx windows build error
test=develop
* Fix Windows Build
test=develop
* Fix Windows Build
test=develop
* Fix Windows Build
test=develop
* Fix code style
test=develop
* Fix code style
test=develop
* update datafeed
* Fix code style
test=develop
* update data_feed_test for test Tensor test=develop
* Fix code style
test=develop
* Fix windows build failure
test=develop
* Fix code style and windows build failure
test=develop
* Fix PYTHON3.5 build failure
test=develop
* AsyncExecutor API
test=develop
6 years ago
JiabinYang
a770d5c9db
fix error don't interupt shell
...
, test=develop
6 years ago
whs
1b9753d109
Make pad2d support for variable paddings. ( #14667 )
...
* Make pad2d support for variable paddings.
test=develop
* Rename get_paddings and add inline modifier.
test=develop
* Fix comments.
6 years ago
Tao Luo
2af5762cf8
Merge pull request #14668 from wzzju/use_small_dam
...
support the small dam model. test=develop
6 years ago
Tao Luo
ff16c47898
Merge pull request #14671 from luotao1/box_coder
...
speedup box_coder_op for multi-threads
6 years ago
baojun-nervana
fc61bf1b16
Renamed methods
...
test=develope
6 years ago
sneaxiy
096673f675
refactor eager deletion
...
test=develop
6 years ago
ZhenWang
6e48e47406
test=develop
6 years ago
ZhenWang
e1da6cd754
add the normal dam and the small dam
6 years ago
luotao1
bcc90123f0
speedup box_coder_op for multi-threads
...
test=develop
6 years ago
ZhenWang
d5947b0ed7
test=develop
6 years ago
ZhenWang
33b4963505
unify the normal and small dam model.
6 years ago
Yan Chunwei
4b7617740e
fix container not cleared ( #14231 )
6 years ago
Tao Luo
c856ac8721
add OpHasAttr in node.h, update is_test_pass and mkldnn_placement_pass
...
test=develop
6 years ago
ZhenWang
8f2e556e65
support the small dam model. test=develop
6 years ago
phlrain
6ce4250172
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
6 years ago
Qiao Longfei
44debca844
Merge pull request #14589 from jacquesqiao/refactor-prefetch
...
Refactor prefetch
6 years ago
phlrain
bd94ab0ef3
rename op; test=develop
6 years ago
phlrain
92f5be1d82
remove inputvarname in operator; test=develop
6 years ago
Xin Pan
40f1c4a6f0
fix
...
test=develop
6 years ago
phlrain
cf1fe61004
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
6 years ago
luotao1
5db273d874
enhance HasAttr to fix ci
...
test=develop
6 years ago
Yu Yang
589b863b98
Add EstiminateFlops
...
test=develop
6 years ago
phlrain
4b9689379f
fix cudnn lstm; test=develop
6 years ago
phlrain
d1a17cadd4
fix cudnn rnn; test=develop
6 years ago
JiabinYang
4124253796
add mac ci check on import, test=develop
6 years ago
Qiao Longfei
9450048acb
add PADDLE_ENABLE_REMOTE_PREFETCH to enable remote prefetch
...
test=develop
6 years ago
Xin Pan
75939c2059
fix
...
test=develop
6 years ago
Tao Luo
20120d9c97
Merge pull request #14608 from jczaja/prv-conv2d-transpose-mkldnn
...
[MKL-DNN]conv2d transpose
6 years ago
Qiao Longfei
3e45a5a5ec
lookup_table gpu kernel support prefetch
...
test=develop
6 years ago
Zhaolong Xing
d215293c92
Merge pull request #14649 from NHZlX/add_params_sync_pass
...
Add params sync pass
6 years ago
Qiyang Min
055da6e00d
Merge pull request #14656 from velconia/disable_dist_transpiler_ut_in_mac
...
Change pip to correct version when install wheel package
6 years ago
qingqing01
731d45a39a
Enable BatchNorm to use global mean and variane during training ( #14630 )
...
* Enable BatchNorm to use global mean and variane during training
* Update doc and follow comments.
6 years ago
nhzlx
49c28b8c52
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_params_sync_pass
...
test=develop
6 years ago
nhzlx
3c83a2f720
fix comments
6 years ago
Xin Pan
ad6ed5b745
fix py3
...
test=develop
6 years ago
Xin Pan
0cc9ab3dc2
enable API check for readers
...
test=develop
6 years ago
luotao1
4a4daa8ab4
Merge branch 'develop' into has_attr
6 years ago
Qiao Longfei
75eba6108d
Add scope doc ( #14582 )
...
* add doc for scope
* update doc for force_init_on_cpu
test=develop
* follow comment test=develop
* update format test=develop
6 years ago
Tao Luo
ea47685f91
Merge pull request #14646 from jczaja/prv-softmax-mkl-sasum
...
Softmax for inference MKL further changes
6 years ago
Qiao Longfei
3a3cfc2d8d
prefetch support gpu
...
test=develop
6 years ago
minqiyang
fe0dee88d8
Change pip version to correct version when install wheel package
...
test=develop
6 years ago
baojun-nervana
d5ee05e6c3
Replaced VarIsTensor
...
test=develop
6 years ago
baojun-nervana
e6bd53be60
Named to RuntimeInferShape
...
test=develop
6 years ago
Sang Ik Lee
24e70920db
Refactor some build settings.
...
test=develop
6 years ago
baojun-nervana
a29696146c
Added annotation
...
test=develop
6 years ago
Sang Ik Lee
d6125a5eec
Include ngraph in inference demo build.
...
test=develop
6 years ago
baojun-nervana
caf4b937b3
Added RunInferShape
...
test=develop
6 years ago
baojun-nervana
1d19eb2bd4
Implemented ngraph engine
...
test=develop
6 years ago
Qiao Longfei
4b9082a4cd
follow comment
6 years ago
Tao Luo
b4de023ee1
Merge pull request #14636 from Superjomn/fix/word2vec
...
fix word2vec bug
6 years ago
luotao1
fe915901cd
update Opdesc's HasAttr
...
test=develop
6 years ago
chengduo
6776e92846
refine tensor_array_write_read ( #14643 )
...
test=develop
6 years ago
nhzlx
d3e140a572
Merge branch 'develop' of https://github.com/paddlepaddle/paddle into add_params_sync_pass
...
test=develop
6 years ago
nhzlx
d666c8eb1d
fix benchmark
6 years ago
nhzlx
900fbb83f9
add params sync pass
6 years ago
superjomn
9c665c81ae
update
...
test=develop
6 years ago
Jacek Czaja
48e1b97e8e
- Coding style fixes
...
test=develop
6 years ago
Qiao Longfei
d32de7e6e1
fix code format test=develop
6 years ago
Qiao Longfei
5a660aee7d
update log level in parameter prefetch test=develop
6 years ago
Qiao Longfei
8ebde595c9
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
...
test=develop
6 years ago
Qiao Longfei
b9d3d75fc4
fix prefetch dependency test=develop
6 years ago
Qiao Longfei
145c535750
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
...
test=develop
6 years ago
minqiyang
9d7c3b18c0
Polish code
...
test=develop
6 years ago
minqiyang
2b430adaee
Polish code
...
test=develop
6 years ago
minqiyang
a02ce58f2c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
...
test=develop
6 years ago
Jiabin Yang
12e1719f96
Merge pull request #14352 from JiabinYang/enhance_hierachical_sigmod_op
...
Enhance hierarchical sigmoid op
6 years ago
Qiao Longfei
40f68b1349
unit test ready
6 years ago
Qiao Longfei
36e26a53b0
Optimize bilinear tensor product op ( #14485 )
...
* optimize bilinear_tensor_product
* add set zero to set grad to 0.
6 years ago
Tao Luo
4ec9de0122
Merge pull request #14628 from Sand3r-/mgallus/mkldnn-elementwise_mul
...
EltwiseMul: Changes from previous PR
6 years ago
Qiao Longfei
35b79ab865
Merge pull request #13983 from jacquesqiao/add-ctr-reader
...
Add ctr reader
6 years ago
wopeizl
b1dbbb7f88
Merge pull request #14629 from wopeizl/windows/port
...
fix the build issue on manylinux1
6 years ago
Qiao Longfei
da387720d7
fix infer compile test=develop
6 years ago
Jacek Czaja
cf40daee58
- Building fix to softmax for inference
6 years ago
Clementine
6c71c1f8f9
Add activation gelu ( #14569 )
6 years ago
Michal Gallus
9455be0ba5
EltwiseMul: Extract StringToFormat to MKLDNN helper
...
test=develop
6 years ago
peizhilin
351dc78e1c
code style fix
...
test=develop
6 years ago
Jacek Czaja
1540df51cf
- Fix to test_conv2d_transpose_mkldnn for GPU
...
test=develop
6 years ago
JiabinYang
eda069068d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enhance_hierachical_sigmod_op
6 years ago
JiabinYang
a08dc83eb0
remove arg 'non_leaf_num', test=develop
6 years ago
chengduo
6648f5ed6f
add ShareLoD for dropout_grad ( #14616 )
...
test=develop
6 years ago
peizhilin
b6b8626e9c
fix the build issue on manylinux1
6 years ago
Qiao Longfei
18fd2d01b7
update embedding api
6 years ago
JiabinYang
7594787deb
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enhance_hierachical_sigmod_op
6 years ago
JiabinYang
c469334cfb
polish python code and comment, test=develop
6 years ago
Xin Pan
3c77ce3751
Merge pull request #14593 from panyx0718/fix5
...
Protect important header files.
6 years ago
Qiao Longfei
92afbb923c
fix compile problem test=develop
6 years ago
Tao Luo
e8ef14d2a7
Merge pull request #14610 from Superjomn/revert/cache_fix
...
Revert "fix transfer cache thread_local bug (#14581 )"
6 years ago
Qiao Longfei
97cbec9b74
clean code
6 years ago
Qiao Longfei
1edd435da6
fix ci problem test=develop
6 years ago
JiabinYang
87648f8edf
merge develop, test=develop
6 years ago
Yiqun Liu
726f2cefe3
Fix bug of referencing a temporary variable. ( #14614 )
...
test=develop
6 years ago
wopeizl
db9284ecde
Merge pull request #14617 from wopeizl/windows/online
...
Windows/online
6 years ago
JiabinYang
c3c3c0b33c
polish code, test=develop
6 years ago
gongweibao
867c312bc4
Fix allreduce dependency order. ( #14586 )
6 years ago
Jacek Czaja
8bfa1fa9bb
- ASUM MKL integration
6 years ago
phlrain
487ee36aec
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
6 years ago
tangwei12
56a4912b76
Make NCE_OP more efficient and support SelectedRows ( #14469 )
...
* Fix truncated normal.
* Fix.
* Make nce support more distribution.
* Fix API.spec.
* Fix python API.
* Fix.
test=develop
* Fix API.spec
test=develop
* Fix sampler.
* Fix order of arguments in python API.
test=develop
* NCE add selectedrows support
* NCE update weighted sampling
* fix bugs in nce_op, and assign_value_op optimized
* fix bugs in nce_op, revert assign_value_op
* nce_op optimize
* nce_op optimize
* nce_op optimize
* add selectedRows test later
test=develop
* add selectedRows supported
* add selectedRows supported
test=develop
* add selectedRows supported
* add nce selectedRows supported, test=develop
* add nce selectedRows supported
* add nce selectedRows supported, test=develop
* fix height in nce, test=develop
* add ut
* add ut, test=develop
* make AutoGrownIndex inline
test=develop
* fix tinny error, test=develop
6 years ago
liuhongyu
1ffe41d722
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_cudnn_lstm
6 years ago
Qiao Longfei
9589babe12
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
...
test=develop
6 years ago
liuhongyu
05917c3c79
add cudnn lstm; test=develop
6 years ago
Zeng Jinle
1c48d61442
Merge pull request #14599 from sneaxiy/fix_mac_unittest_bug
...
Fix Mac unittest bug
6 years ago
Qiao Longfei
f35f3fe77a
ctr reader can not be used in windows
...
test=develop
6 years ago
peizhilin
6a85dd3278
Merge remote-tracking branch 'upstream/develop' into windows/build
...
test=develop
6 years ago
peizhilin
38715e6fd0
minor fix
6 years ago
JiabinYang
7389597ce2
Update API.spec, test=develop
6 years ago
peizhilin
511cc9024a
fix for build issue
6 years ago
Qiao Longfei
6bef565dac
clean code test=develop
6 years ago
Qiao Longfei
e7d1f524f3
change log level
...
test=develop
6 years ago
JiabinYang
7e4bd695e6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enhance_hierachical_sigmod_op
6 years ago
Qiao Longfei
fe54adf70c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-ctr-reader
6 years ago
JiabinYang
b10df8bcfa
refine code and add none bias ut, test=develop
6 years ago
Kaipeng Deng
251a1bb0f4
Merge pull request #14588 from heavengate/revert_interpolate
...
fix interpolate_op incompatible. test=develop
6 years ago
Qiao Longfei
668ae9083e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-ctr-reader
6 years ago
Qiyang Min
30e47bce8b
Merge branch 'develop' into revert_vlog
6 years ago
tensor-tang
3ae6692a0d
Merge pull request #14512 from tensor-tang/fea/jit/rnn
...
Fea/jit/rnn
6 years ago
superjomn
4babc6b06c
update
...
test=develop
6 years ago
sneaxiy
f3522a11d2
fix mac unittest bug
...
test=develop
6 years ago
Qiao Longfei
87e4edd2ea
fix grad_varname in remote prefetch
6 years ago
Qiyang Min
6232d1f1dd
Merge pull request #14578 from velconia/add_production_dockerfile
...
Add python3.6 and python3.7 support to production generated Dockerfile
6 years ago
superjomn
dc249d3b69
Revert "fix transfer cache thread_local bug ( #14581 )"
...
This reverts commit 5c073a4db2
.
6 years ago
Qiao Longfei
d98c59fd2c
support none sliced variable
6 years ago
dengkaipeng
bb489d4cc9
add interp_method default bilinear. test=develop
6 years ago
dengkaipeng
78f563917c
revert interpolate_op to bilinear_interp_op & nearest_interp_op. test=develop
6 years ago
Jacek Czaja
fb24690a58
- conv2d transpose MKL-DNN
...
test=develop
- Added new header for MKLDNN reuse functionality
- Extended conv2d_transpose GetExpectedKernelType for MKL-DNN supporrt
- Buildable conv transpose mkldnn and conv mkldnn using conv template
- Conv2d transpose roughlt implemented and buildable
- Added modifications conv2d transpose MKLDNN unit tests
- Fix to UT of conv2d transpose mkldnn op
- Wrong type of MKLDNN primitive was chosen for conv2d transpose
- HAcks for conv2d transpose
- UT enalbed
- Replaced copying loop with memcpy
- Draft of passing lambda into AcquireMemory
- Made reorder (IOHW->OIHW) to be called only once
6 years ago
tensor-tang
7a91271436
Merge branch 'develop' into fea/jit/rnn
6 years ago
minqiyang
be04d99fe4
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into revert_vlog
...
test=develop
6 years ago
wopeizl
05b7ee7eeb
Merge pull request #14545 from wopeizl/windows/online
...
Windows/online
6 years ago
JiabinYang
81e145764d
refine code and comments, test=develop
6 years ago
minqiyang
bcaa8a3b67
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_production_dockerfile
...
test=develop
6 years ago
Qiao Longfei
af2f5fc824
fix some bugs
6 years ago
JiabinYang
2f6b529aff
refine code and comments, test=develop
6 years ago
Xin Pan
e32f4c5423
fix
...
test=develop
6 years ago
Xin Pan
3e665862b8
Protect important header files.
...
test=develop
6 years ago
minqiyang
e43f5bc77c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_dist_resnet_ut_in_py36
...
test=develop
6 years ago
minqiyang
53433d7f2e
Revert the changes of VLOG
...
test=develop
6 years ago
tensor-tang
1f0291a51e
add comments and follow comments
...
test=develop
6 years ago
tensor-tang
557229bd39
Merge remote-tracking branch 'ups/develop' into fea/jit/rnn
6 years ago
Qiao Longfei
ed9fa4b301
can run
6 years ago
peizhilin
30849d1f20
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
qingqing01
6224e61fd9
Transpose-Flatten-Concat fusion operator. ( #14568 )
...
* Transpose-Flatten-Concat fusion operator.
* Add unit testing and fix bug.
6 years ago
Yan Chunwei
5c073a4db2
fix transfer cache thread_local bug ( #14581 )
6 years ago
Xin Pan
87332bb18d
Merge pull request #14579 from Superjomn/fix/transfer-cache-compile-error
...
fix compile
6 years ago
minqiyang
8b154c172f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_dist_resnet_ut_in_py36
...
test=develop
6 years ago
Qiao Longfei
686d15c8e0
update grpc_variable_response
6 years ago
Jiabin Yang
13bc7619f5
Merge pull request #14552 from JiabinYang/fix_mac/fix_pinned_memory
...
fix Mac unittest error on reading pined memory flag
6 years ago
tangwei12
3639d99f99
Fix save and load lookup table/optimizer vars ( #14301 )
...
* fix mkdir conflict
* fix load/save lookup tables
test=develop
* add lookup_table_utils
* fix load optimize vars on pserver
* delete lookup table utils
* fix save and load lookup tables
* fix load optimizer var
* fix load optimizer var, test=develop
* fix python 3 style, test=develop
* move lookup_table_utils to contrib utils
6 years ago
peizhilin
36cd18b549
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
qingqing01
39ec80def4
Remove the memory copy of feeding data in C++ inference API ( #14577 )
...
* Remove the memory copy for feeding data in C++ inference API
* Fix compling dependence
* Fix compling in ONLY_CPU mode
6 years ago
peizhilin
b2f8d4183d
Given the different fraction_of_gpu_memory_to_use depends on platform
6 years ago
Qiao Longfei
d827881502
fix pserver and prefetch rpc
6 years ago
peizhilin
1afa9492af
Recover the profiler
6 years ago
Yiqun Liu
bf222f197d
Use sub scope in tensor_array_to_tensor op. ( #14524 )
...
test=develop
6 years ago
superjomn
4b40c0013b
fix compile
...
test=develop
6 years ago
JiabinYang
02d68051db
add sparsed bias grad, test=develop
6 years ago
dzhwinter
840c1b29ad
test=develop ( #14562 )
...
* test=develop
remove code.
* test=develop
6 years ago
Qiao Longfei
5856c2f332
change Var to FindVar
6 years ago
Yu Yang
26af9cf90c
Merge pull request #14565 from chengduoZH/fix_cublas_warp_error
...
Fix cublas warp error
6 years ago
Qiao Longfei
312b7786d9
clean code
6 years ago
Qiao Longfei
2b6c0c09d6
add unit test
6 years ago
Yan Chunwei
923c8e3332
add benchmark for inference ( #14571 )
6 years ago
minqiyang
c92c440fa1
Add python3.6 and python3.7 support to production generated Dockerfile
...
test=develop
6 years ago
Qiao Longfei
47280ef8b4
lookup table op support prefetch
6 years ago
Yan Chunwei
a7188d5bc7
fix executor transfer cache bug ( #14518 )
6 years ago
gongweibao
c1bf9664cd
Add options to disable SO_REUSEPORT of grpc. ( #14269 )
6 years ago
minqiyang
ee73810fd5
Fix API.spec
...
test=develop
6 years ago
Qiao Longfei
4ad5fd8f54
add parameter prefetch
6 years ago
Qiao Longfei
9d276fe8a8
add parameter prefetch
6 years ago
minqiyang
d2045260a5
Change visibilities of variant_visitor of pybind11
...
test=develop
6 years ago
minqiyang
b67229187e
Change to PYBIND11_MODULE because the deprecation of PYBIND11_PLUGIN
...
test=develop
6 years ago
minqiyang
81994e84e0
Change the include files because the version changes of pybind11
...
test=develop
6 years ago
Tao Luo
e90afec47b
Merge pull request #14543 from luotao1/threads
...
add thread related inference api
6 years ago
qingqing01
64ca3d176c
Add bias_attr in sequence_conv_pool API. ( #14553 )
6 years ago
chengduozh
f7847ca6a3
fix cublas warp error
...
test=develop
6 years ago
Zhaolong Xing
e52d90a35e
Merge pull request #14527 from hjchen2/develop
...
Refine split TensorRT plugin
6 years ago
Qiyang Min
4531281386
Merge pull request #14526 from velconia/add_python36and37_to_paddle_build
...
Add python 3.6 and python 3.7 support to paddle build
6 years ago
JiabinYang
47c4e65d60
test=develop
6 years ago
luotao1
116979a40a
refine api name
...
test=develop
6 years ago
luotao1
e66b4c6bff
adjust tester_helper to make multi-instance multi-thread work
...
test=develop
6 years ago
luotao1
a5c4b463c9
add SetMKLDNNThreadId api
6 years ago
luotao1
e21edb26f6
add Set/GetCPUNumThreads api
6 years ago
Qiao Longfei
9851a53478
add prefetch part in pserver
6 years ago
JiabinYang
5cd2fc9fd0
just for test
6 years ago
JiabinYang
42470f14b7
test=develop
6 years ago
peizhilin
445fff24dc
add the bigobj option to NVCC compile
...
fix code style
6 years ago
sabreshao
61c5f13fcf
Fix cmake for AMDGPU platform ( #13801 )
...
* HIP cmake.
Enable whole archieve build for pybind library.
Disable two warning.
Rollback to C++11.
Link RCCL to WA gpu kernel loading issue.
Update eigen to fix build failure.
Add more include directories.
Fix O3 build failure.
Update eigen.
fix tensor_util_test segment fault issue
add more macro check in hip.cmake.
we may consider refine hip.cmake to inherit all add_definitions() in parrent scope, in the future.
Fix rocRAND load.
Update eigen to fix gru_unit_op and reduce_op.
Add HIP support to testing.
Update eigen to support int16 and int8 in arg min and arg max.
* add rocprim as cub library used by nv implementation
* Reduce build time in rocprim.
* Add rocprim introduction, remove useless cmake code.
* Remove useless flags and format cmake file.
6 years ago
qingqing01
36f08eef3b
CUDA kernel for density_prior_box_op. ( #14513 )
...
* CUDA kernel for density_prior_box_op.
* Support flatten to 2D.
6 years ago
tensor-tang
6a7f83d45d
enable gru jitcode and refine act and lstm jitcode
...
test=develop
6 years ago
tensor-tang
686eaf20ba
Merge remote-tracking branch 'ups/develop' into fea/jit/rnn
6 years ago
peizhilin
81bd7eeff4
rollback the format
6 years ago
Qiao Longfei
1f87f263a2
clean code
6 years ago
Qiao Longfei
361cb0e078
lookup remote table can compile
6 years ago
JiabinYang
0fca16847c
temp
6 years ago
JiabinYang
e9be3366a9
test=develop
6 years ago
Zeng Jinle
bfc34ac19f
Merge pull request #14536 from sneaxiy/dlpack_integration
...
Add dlpack support
6 years ago
chengduo
00b9e9a135
Refine cublas to support CUBLAS_TENSOR_OP_MATH ( #13929 )
...
* refine cublase
test=develop
* code refine
* refine cublas
* add GEMME_EX
* add enable_cublas_tensor_op_math doc and add cublasCall
test=develop
* fix CublasCall for cuda version
test=develop
* fix error
test=develop
* fix GEMM_EX to be compatible with gcc 4.8
test=develop
* add GEMM_EX
test=develop
* to compatiable with gcc4.8
test=develop
6 years ago
peizhilin
dfbac60398
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
peizhilin
7c8c9dc9bf
fix unit test cases
6 years ago
tensor-tang
0c5ed5f6fc
enable peephole jitcode
...
test=develop
6 years ago
JiabinYang
3c6102a367
test=develop
6 years ago
Qiao Longfei
7c3ce2952d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refactor-prefetch
6 years ago
Qiao Longfei
60a4f69b3c
add lookup remote table op
6 years ago
Qiao Longfei
e0b48f7e29
init lookup remote table
6 years ago
tensor-tang
e3b61cf52b
init gru jitcode and fix lstm jitcode
...
test=develop
6 years ago
tensor-tang
0f25446574
Merge remote-tracking branch 'ups/develop' into fea/jit/rnn
6 years ago
minqiyang
d68b9ede44
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_python36and37_to_paddle_build
...
test=develop
6 years ago
Dun
ae7d22862b
Group Norm ( #13843 )
...
Add group normalization operator.
6 years ago
hjchen2
1adda8e06c
Add more unit tests for split plugin
...
test=develop
6 years ago
sneaxiy
488610a65a
merge develop
...
test=develop
6 years ago
Jiabin Yang
de2db11735
Merge pull request #14537 from reyoung/feature/fix_macos_ut
...
fix(Cpu): fix cpu compile and unittest
6 years ago
wopeizl
d9a1f3e58e
Windows/online ( #14474 )
...
* add recordio support
* disable the openblas multi-thread on windows since no support
adjust the python script
* code style
* code style
test=develop
* add create_recordio_file_reader back
* fix code style
test=develop
* fix the gtest.cmake on windows
* fix cc_test on windows
* fix the win build
test=develop
* remove fused compile support on windows
test=develop
* add the jit support
test=develop
* add the jit support, test=develop
* add the jit support, test=develop
* add the jit back
fix compile error on windows
* rollback test=develop
* test case fix
* disable DSO by default on windows
* exclude warpctc_op on windows
* exclude the dynload_warpctc out on windows
test=develop
* fix the scripts error
test=develop
* disable avx on windows by default
test=develop
* re-organize the cmake file
* disable mkl on windows by default
* add warp_ctc back
* fix the dependency
* fix the dependency
* fix the build issue on windows
* remove unsupported flag on windows
* code style
* code style
test=develop
* fix issue
* add profiler, parallel_executor back
* clean up the pre-definitions on windows
* fix build issue
* test=develop
6 years ago
Yu Yang
533c5d5803
fix(Cpu): fix cpu compile and unittest
...
test=develop
6 years ago
sneaxiy
3912545ffe
add dlpack support
...
test=develop
6 years ago
JiabinYang
57a18e32a1
test=develop
6 years ago
peizhilin
bef475c92b
Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Tao Luo
5d4d117edc
Merge pull request #14502 from qingqing01/cudnn5_fix
...
Fix compling with cuDNN v5
6 years ago
Jiabin Yang
f7b55de9e5
Merge branch 'develop' into enhance_hierachical_sigmod_op
6 years ago
Yu Yang
e68c1fcd5a
Merge pull request #14522 from reyoung/feature/fix_op_header_deps
...
fix(Compile): fix depends error when compile op using cub
6 years ago
hjchen2
6eba5bd276
Fix direct copy and refine split ut
...
test=develop
6 years ago
Qiao Longfei
fd290c2580
fix mac compile of analysis
...
test=develop
6 years ago
hjchen2
5857fb3014
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into develop
...
test=develop
6 years ago
tensor-tang
3562051302
add gru refer code and remove redundant avx code
...
test=develop
6 years ago
JiabinYang
af9a3301da
test=develop
6 years ago
hjchen2
3e3599f3d9
Refine split tensorrt plugin
6 years ago
peizhilin
f10e196fc8
fix build issue
6 years ago
Yu Yang
6a128dea32
Merge pull request #14515 from reyoung/feature/fix_macos_build
...
fix(Macos): fix compile on macos
6 years ago