dzhwinter
57be5c6c74
"fix double type error" ( #10322 )
...
* "fix double type error"
* "fix ci"
7 years ago
Qiao Longfei
faebadd938
Merge pull request #10228 from jacquesqiao/use-multi-thread-todo-update
...
Use multi thread to do update
7 years ago
Yancey
ff99d94197
Merge pull request #10164 from Yancey1989/lookup_sparse_table_op
...
add lookup_sparse_table_op
7 years ago
typhoonzero
b3cf429e02
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_grpc_server_ready_condition
7 years ago
typhoonzero
e7ac709b4b
done
7 years ago
Abhinav Arora
1945b729b6
Fix CPPLint issues with math/sequence_padding ( #10317 )
...
* Fix cpplint issues in sequence_padding
* Fix typo in cu file
* Fix dependencies of sequence_padding
* Add include
7 years ago
chengduo
9bcd9f661b
fix cpplint error ( #10329 )
7 years ago
Abhinav Arora
55f0d84029
Fix Cpplint Issues in fluid/inference/tensorrt/ ( #10318 )
...
* Fix CPPLint issues in fluid/inference/tensorrt/
* Fix compile errors
7 years ago
Yu Yang
4db43c6c9f
Naive implement cblas
7 years ago
fengjiayi
a1a401eb26
fix
7 years ago
fengjiayi
d11b8e56e5
fix
7 years ago
Luo Tao
9945265f09
Merge branch 'develop' into tr_convert_init
7 years ago
typhoonzero
a131c73fcf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_grpc_server_ready_condition
7 years ago
chengduo
3222cf16f7
Merge pull request #10325 from chengduoZH/fix_shfl_sync
...
Fix shfl_sync for CUDA8.0
7 years ago
Kexin Zhao
4613aeba0e
Merge pull request #10272 from kexinzhao/save_fp16
...
Add float16 support to save op
7 years ago
Yu Yang
60d6348e69
Revert develop
7 years ago
Yu Yang
86af6bdc81
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/clean_blas
7 years ago
Yang yaming
9a8be9daca
Merge pull request #10223 from pkuyym/fix-10219
...
Change `customize_loss_grad` to `use_default_grad_scale`.
7 years ago
Yu Yang
49dedfad17
Polish code and tests
7 years ago
chengduoZH
90d73c79c3
fix shfl_sync for CUDA8.0
7 years ago
qiaolongfei
d86626df84
optimize log
7 years ago
qiaolongfei
ebf0027391
use IOThreadPool to dispatch async update task
7 years ago
qiaolongfei
ea372b3452
add more log
7 years ago
Abhinav Arora
738585476d
Fix more CPPLint issues in fluid/operators/math ( #10276 )
...
* Fix CPPLint issues in lstm_cpu_kernel.h
* Fix CPPLint issues in math/math_function_test
* Fix CPPLint issues in math/math_function_test
* Fix CPPLint issues in math/concat.cc
* Fix CPPLint issues in math/concat.cc
* Fix CPPLint issues in math/concat.cc
* Fix CPPLint issues in math/gru_cpu_kernel
* Fix CPPLint issues in math/selected_rows_functor_test.cu
* Fix compile error
* Fix compile error
7 years ago
Helin Wang
d25fdb0a47
fix build: cuda_helper.h not found
7 years ago
dzhwinter
eb6f9dd5de
Feature/cuda9 cudnn7 ( #10140 )
...
* "re-commit "
* "picked up"
* "fix ci"
* "fix pdb hang up issue in cuda 9"
7 years ago
typhoonzero
008f6df9b2
update
7 years ago
chengduo
f61dfeedcc
Merge pull request #10263 from chengduoZH/add_FLAGS_use_deterministic_algo
...
Add FLAGS_cudnn_algo_use_autotune
7 years ago
typhoonzero
ef48f3c766
wip
7 years ago
Yu Yang
c888e01660
Refactor GEMM in blas
7 years ago
yangyaming
13fac4232a
Fix to pass CI.
7 years ago
Yu Yang
c0ac0cd6b3
Complete rename
7 years ago
Qingsheng Li
79be1bb3df
Merge branch 'develop' into fix-10026
7 years ago
ktlichkid
48466b4424
auto => auto*
7 years ago
Yu Yang
6c18410487
Revert code to develop
7 years ago
chengduoZH
9fda5c92cd
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_FLAGS_use_deterministic_algo
7 years ago
yangyaming
f456cd8079
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-10219
7 years ago
Yu Yang
76174ec0e9
Clean cross entropy and add sync in executor
7 years ago
Yu Yang
25779c982d
Merge pull request #10265 from reyoung/feature/polish_code
...
Add comments and polish code
7 years ago
chengduoZH
c5774e3282
add FLAGS_use_deterministic_algo
7 years ago
Abhinav Arora
e735359631
Fix more CPPlint issues in fluid/operators/math ( #10249 )
...
* Fix CPPLint errors
* Fix CPPLint errors in sequence2batch
* Fix compilation
* Fix LSTM op and GRU op
* Fix LSTMP op
* Fix more cpplint errors in operators/math
* Address Code review feedback
7 years ago
Kexin Zhao
efba1c7dcb
address comments
7 years ago
Kexin Zhao
6c88f1ae6e
add save op float16 support
7 years ago
qiaolongfei
f82cb635cf
optimize code, add more log
7 years ago
Yu Yang
9612c7e599
Add comments and polish code
7 years ago
Yu Yang
deabc8ca0b
Merge branch 'develop' into feature/clean_memcpy_async
7 years ago
fengjiayi
71fa3ca9c4
Merge pull request #10232 from JiayiFeng/fix_unittests
...
Fix unittests
7 years ago
qingqing01
76c4ae856f
Fix reshape op. ( #10253 )
7 years ago
Yancey1989
1a93253f16
fix unittest
7 years ago
fengjiayi
30f9dc92e5
fix errors
7 years ago
whs
2f9fa9b721
Merge pull request #10167 from wanghaoshuang/fluid_init
...
Add init interface for customize devices.
7 years ago
Luo Tao
6f6f330423
update the register method
7 years ago
ktlichkid
9997c916fc
Pull origin
...
Merge branch 'fix-10026' of github.com:ktlichkid/Paddle into fix-10026
7 years ago
ktlichkid
709a9edd46
Code clean up
7 years ago
fengjiayi
330fa95cbd
Follow comments
7 years ago
dyning
4a5bfa89c3
Modify RoI pooling op to use LoDTensor and expose it into Python API ( #10208 )
...
* modify roi pool with lod and expose ROI Pooling into Python API
* make lod code brief
* make doc more clearly
* make doc more clearly
7 years ago
Tomasz Patejko
e498e1fc56
Adam operator optimized with Eigen ( #10229 )
...
* Some changes for Adam profiling
* Adam optimization: initial Eigen optimization
* Eigen Adam: flavour of adam can be chosen
* Eigen Adam used for CPU by default. Plain Adam used for GPU
* Eigen Adam: missing call to the Eigen functor added
* Eigen Adam: revert changes in benchmarks
* Eigen Adam: typo corrected
7 years ago
Kexin Zhao
0ecc6fa8f3
Add float16 transpiler and image classification example ( #10109 )
...
* add float16 transpiler
* fix feed fetch target names mismatch
* fix cast op input change issue
* fix program desc flush error
* fix inconsistent var names in block desc bug
* code clean up
* add float16 infernce C++ example and fix prune bug
7 years ago
Abhinav Arora
83b1a8f6bf
Pending more CPPLint errors in fluid/operators/math ( #10243 )
...
* Fix CPPLint issue in test_engine
* Fix CPPLint errors in operators/math
* Fix compilation
7 years ago
baiyf
c816121d11
optimized iou_similarity_op ( #10231 )
7 years ago
fengjiayi
b88721213f
fix broadcast_op_test and reduce_op_test
7 years ago
fengjiayi
bcf260e1e8
fix several unit tests
7 years ago
qiaolongfei
b058189941
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into use-multi-thread-todo-update
7 years ago
Qiao Longfei
6d934560c7
Merge pull request #10042 from jacquesqiao/add-async-listen-and-serv-op
...
listen_and_serv_op support async update
7 years ago
Abhinav Arora
f457d5da06
Fix more CPPLint errors ( #10218 )
...
* Fix more CPPLint issues
* Fix more CPPLint issues
* Fix more CPPLint issues
* Fix CPPLint issues in operators/math and operators/reader
7 years ago
qiaolongfei
0d491b670a
use-multi-thread-todo-update
7 years ago
qiaolongfei
3295f31076
optimize naming
7 years ago
yangyaming
18d6254d44
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-10219
7 years ago
qiaolongfei
46342a2306
delete useless code
7 years ago
wanghaoshuang
848fb00215
Fix comments.
7 years ago
fengjiayi
9c7fa6ff69
Merge pull request #10206 from JiayiFeng/blocking_queue_for_reader
...
Blocking queue for reader
7 years ago
qiaolongfei
0264ec3957
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-listen-and-serv-op
7 years ago
qiaolongfei
63bd38bd74
code optimize
7 years ago
Yu Yang
c02ba51de0
Merge pull request #10191 from reyoung/feature/strict_dynload
...
Make dyload strictly use the same ABI in header
7 years ago
Yancey1989
dccd013bd3
refine distribute transpiler
7 years ago
fengjiayi
8bd34664f1
fix unit test error
7 years ago
fengjiayi
17c51d69d1
fix unit test error
7 years ago
fengjiayi
304b6b7138
Follow comments
7 years ago
yangyaming
82571deb89
Change `customize_loss_grad` to `use_default_grad_scale`.
7 years ago
Luo Tao
326221acec
Merge branch 'develop' into tr_convert_init
7 years ago
Abhinav Arora
4c8ff72615
Fix CPPLint errors with rxecutor ( #10212 )
7 years ago
Siddharth Goyal
5fe1fe3a27
Fix signed/unsigned comparison warning ( #10211 )
7 years ago
fengjiayi
4cb63d8451
Remove unnecessary header files
7 years ago
Yancey1989
e393c86c4a
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into lookup_sparse_table_op
7 years ago
Luo Tao
c4e3010b14
use template to do registry
7 years ago
Yan Chunwei
2d57158e2b
fea/init tensorrt engine ( #10003 )
7 years ago
Luo Tao
d599de5c41
auto registray op converters
7 years ago
fengjiayi
e057ba6877
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into blocking_queue_for_reader
7 years ago
fengjiayi
e2ca42408b
Replace Channel in DoubleBufferReader with BlockingQueue
7 years ago
fengjiayi
a786611314
Replace Channel in MultiFileReader with BlockingQueue
7 years ago
fengjiayi
1a25f3cd07
Add reader blocking queue
7 years ago
Yu Yang
0c24b3f937
Clean memcpy async
7 years ago
Xin Pan
64babc9aeb
Merge pull request #10189 from reyoung/feature/fix_matmul_bug
...
Fix batch_gemm bugs
7 years ago
Yu Yang
580dad0c2c
Fix compile when there is no mkl
7 years ago
Yu Yang
3d53631bad
Make dyload strictly use the same ABI in header
7 years ago
Yu Yang
2a06e307d0
Fix batch_gemm bugs
...
stride should be int64_t, not int
7 years ago
Yancey1989
8aea5cac0a
add attr auto_grown_table
7 years ago
gongweibao
2f53cd0a76
Fix beam_search memory leak. ( #10185 )
7 years ago
ktlichkid
5afc2a9972
Keep up with upstream
...
Merge remote-tracking branch 'upstream/develop' into fix-10026
7 years ago
Qiao Longfei
63bf82ddea
Merge branch 'develop' into add-async-listen-and-serv-op
7 years ago
Tao Luo
8b2391858f
Merge pull request #10181 from abhinavarora/cpplint_advanced
...
Fix CPPLint errors with framework/op_desc
7 years ago
Wu Yi
3fdfa940be
Merge pull request #10135 from typhoonzero/unify_blocking_queue
...
Unify fluid blocking queue
7 years ago
wanghaoshuang
ad3f6f4ad5
Fix devices 'not undefined' error.
7 years ago
Abhinav Arora
edd3587e50
Fix CPPLint errors with op_desc
7 years ago
Yang Yang(Tony)
81dfc0cf0e
Clean up unused code in operator class ( #10035 )
...
* delete unused IsNetOp() and Rename()
* rm OperatorBase::Rename implementation
* delete Operator::InputVars()
* remove unused OperatorBase::ShareLoD; ShareLoD has been implemented in infershape
* organize operatorbase; remove unused set_type
* add comments
* fix comment
7 years ago
Abhinav Arora
f09aed0475
Fix CPPLint issues in framework/data_transform framework/prune.cc ( #10178 )
...
* Fic CPPLint issues with data_transform
* Fic CPPLint issues with prune.cc
7 years ago
wangyang59
72ee737f3f
Merge pull request #9308 from wangyang59/bilinear
...
Bilinear interp op
7 years ago
Yang Yang(Tony)
2182ecfbbd
remove duplicated ShareLoD in gru_op and sequence_conv_op ( #10149 )
...
* remove share lod; it has already been performed in infershape
* slightly release test bound of test_image_classification
7 years ago
gongweibao
6171705a2c
Potential bug in paddle/fluid/platform/CMakeLists.txt ( #9723 )
...
* fix
* nv_library
* add with_gpu
* revert
7 years ago
gongweibao
fc025f5265
Fix memory leak of pserver ( #10173 )
7 years ago
wanghaoshuang
3d96b3811a
Fix InitGflags.
7 years ago
Luo Tao
48473dddf4
Merge branch 'develop' into tr_convert_init
7 years ago
wanghaoshuang
a4b452a2d6
Remove initP2P(bool) and init function in framework.
7 years ago
Yu Yang
4ecc9b7bae
Merge pull request #10166 from reyoung/feature/train_and_test_recordio
...
Add demo for recordio train/test and parallel executor
7 years ago
wanghaoshuang
e4708565f4
Fix cpplint format.
7 years ago
wanghaoshuang
a0b258278e
Reuse 'initP2P(bool, std::vector)' in 'initP2P(bool)'
7 years ago
wanghaoshuang
f31bb1476c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fluid_init
7 years ago
Tao Luo
5a433ecb60
Merge pull request #10134 from luotao1/tensorrt_include
...
refine tensorrt cmake and dockerfile
7 years ago
chengduo
bfbbe19fbb
Merge pull request #10150 from chengduoZH/fix_elementwise_gradient
...
Fix elementwise_gradient bug
7 years ago
wanghaoshuang
48b7b54321
Refine code.
7 years ago
Abhinav Arora
5ce57555ee
Fix CPPLint issues in init.cc, init.h and library_type.h ( #10148 )
...
* Fix CPPLint issues in init
* Fix compilation
* Fix typo in init.cc
* Fix CPPLint issues in library_type.h
* Fix compilation in init.h
7 years ago
chengduoZH
0f5d5b1ffc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_elementwise_gradient
7 years ago
wanghaoshuang
1bdea0a8d2
Add init interface for customize devices.
7 years ago
qiaolongfei
8081e15774
fix send_recv_op_test
7 years ago
Yu Yang
2c8fe4e544
Merge pull request #10143 from typhoonzero/fix_multiGPU_dist_train
...
Fix multi gpu dist train
7 years ago
Yu Yang
e5f2cb8021
Merge pull request #10160 from JiayiFeng/develop
...
fix fetch_op_handle
7 years ago
Yu Yang
54ada9449e
Add demo for recordio train/test and parallel executor
7 years ago
Tao Luo
44fa823841
Merge pull request #9949 from mozga-intel/mozga-intel/Mul_mkldnn
...
Initial implementation of multiplication operator for MKLDNN
7 years ago
Yancey1989
e8d802159e
add lookup_sparse_table_op
7 years ago
chengduoZH
d06c79c7a7
fix elementwise_grad op kernel and add unit test
7 years ago
fengjiayi
00b54d5803
fix fetch_op_handle
7 years ago
fengjiayi
dd5946528a
Merge pull request #10142 from JiayiFeng/Add_TensorCopySync
...
Add synchronous TensorCopy
7 years ago
Yi Wang
2486d563ba
Create README.md of fluid/recordio ( #10145 )
...
* Create README.md
* Update README.md
7 years ago
wangyang59
469a349ae3
polishing after qingqing's comments
7 years ago
wangyang59
7436b36875
make bilinear_op registration up-to-date
7 years ago
wangyang59
4a3c99f334
after rebase
7 years ago
wangyang59
d61738311a
remove dropout and nccl test due to frequent CI failures
7 years ago
wangyang59
3e6718e2de
simplified include structure
7 years ago
wangyang59
d87ac4de34
GPU of bilinear_interp_op done
7 years ago
wangyang59
ad3b3d9dc1
ported old paddle gpu bilinear_interp
7 years ago
wangyang59
67ce586453
gpu implementation of bilinear interp
7 years ago
wangyang59
f67f0cae50
finished testing cpu bilinear_interp_op
7 years ago
wangyang59
c7cd6d130b
cpu implement of bilinear interp
7 years ago
Abhinav Arora
504e60a881
Fix Cpplint issues in framework/data_type.h and framework/feed_fetch_type.h ( #10146 )
...
* Fix CPPLint issues with data_type.h
* Fix CPPLint issues with feed_fetch_type.h
7 years ago
qiaolongfei
3503c47f9a
listen and serv default sync mode
7 years ago
qiaolongfei
0881d809cd
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-listen-and-serv-op
7 years ago
Luo Tao
42febfa928
tensorrt convert init
7 years ago
typhoonzero
f03415207a
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_multiGPU_dist_train
7 years ago
typhoonzero
1fba0c578a
fix multi gpu dist train
7 years ago
fengjiayi
c5e178f4fd
fix errors
7 years ago
fengjiayi
9f11da5931
Add synchronous TensorCopy and use it in double buffer
7 years ago
ktlichkid
64509fd93b
Style fix
7 years ago
ktlichkid
294b58a9ba
Changed registered type
7 years ago
ktlichkid
df80b6ea8c
Added InferVarType
7 years ago
ktlichkid
f57efeb6d1
Added GetExpectedKernelType and Debug message
7 years ago
ktlichkid
6f06b32258
Added GetExpectedKernelType and Debug message
7 years ago
Yang yaming
f3e4e42def
Merge pull request #10130 from reyoung/feature/skip_loss
...
Add customize_loss_grad option to PE
7 years ago
qiaolongfei
a29e352b80
optimize code
7 years ago
qiaolongfei
a0ced3df82
async update can run
7 years ago
Yu Yang
55feba9b5a
Fix CPU compile
7 years ago
Wu Yi
3863c6a9a8
Merge pull request #10126 from typhoonzero/fix_para_exe_dist_train
...
Fix para exe dist train
7 years ago
typhoonzero
251e4a8ee5
unify fluid blocking queue
7 years ago
Yu Yang
7a395881d4
Add customize_loss_grad option to PE
7 years ago
Luo Tao
71f51ff64a
refine tensorrt cmake and dockerfile
7 years ago
qiaolongfei
42a15a43b7
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-listen-and-serv-op
7 years ago
typhoonzero
2b06b4b4e9
updates follow up para exe
7 years ago
typhoonzero
8ee837255e
fix send op handle local scope
7 years ago
qiaolongfei
63055a3e08
complete grad_to_id
7 years ago
chengduo
6c0356e41d
Merge pull request #10119 from chengduoZH/rollback_delay_op
...
Enable delay op feature
7 years ago
chengduoZH
f5b4ac6e50
enable delay op feature
7 years ago
Yancey1989
8023c6d749
Create sub socpe when it is necessary
7 years ago
qiaolongfei
4b86b49ecd
Merge branch 'fix-build-activation_op' of ssh://github.com/jacquesqiao/Paddle into add-async-listen-and-serv-op
7 years ago
qiaolongfei
108e71cc94
fix build activation_op.cc on mac
7 years ago
qiaolongfei
c6937abdd1
tmp
7 years ago
Tao Luo
58fd4c0994
Merge pull request #10111 from abhinavarora/cpplint_tensor_util_test
...
Fix CPPLint issues in tensor_util_test
7 years ago
qiaolongfei
1d75674614
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-listen-and-serv-op
7 years ago
Abhinav Arora
1c70600d09
Fix compile error
7 years ago
Abhinav Arora
6402b59a7c
Fix CPPLint issues in some tests in fluid/framework ( #10068 )
...
* Fix CPPLint in data_device_transform_test
* Fix compilation error
* Fix compilation error
* Fix CPPLint errors in data_layout_transform_test
* Fix CPPLint errors in data_type_transform_test
* Fix CPPLint errors in data_type_transform_test.cu
* Fix compilation error
* Fix CPPLint issues in threadpool_test
* Fix CPPLInt issues in op_registry_test
* Fix CPPLint issues in operator_test
* Fix compilation error
* test
7 years ago
Abhinav Arora
c08752ca43
Fix compile error
7 years ago
Abhinav Arora
feaf168d95
Fiux compile error
7 years ago
Abhinav Arora
b6ec45d8da
Fix CPPLint warnings in tensor_util_test
7 years ago
Siddharth Goyal
cb7f096da1
Fix cpplint error in mkldnn_activation ( #10105 )
7 years ago
Abhinav Arora
e66f0c73af
Fix CPPLint errors in framework/details ( #10104 )
7 years ago
Abhinav Arora
744ebcfa18
Fix CPPlint issues in fluid/inference ( #10075 )
7 years ago
Qiao Longfei
7a993ee4f7
Merge pull request #10080 from jacquesqiao/refine-listen-and-serve-op
...
Refine listen and serve op
7 years ago
Yu Yang
f2e400d65b
Revert "accelerate dropout ( #9902 )" ( #10082 )
...
* Revert "accelerate dropout (#9902 )"
This reverts commit 2e331c6593
.
* Correct discard
7 years ago
qiaolongfei
0763ae9a1a
remove unused file
7 years ago
qiaolongfei
dc3d2dc8ff
rename grad_map to grad_to_id
7 years ago
qiaolongfei
260bf5aceb
add sync_mode
7 years ago
chengduo
eb8e14c9fd
Merge pull request #10081 from chengduoZH/refine/gather_broadcast
...
Fix scope of gather and broadcast, and code clean
7 years ago
qiaolongfei
63fbdcf979
update send_recv_op_test
7 years ago
qiaolongfei
e2ace032ae
rename RunAsyncUpdate to RunAsyncLoop
7 years ago
qiaolongfei
f997c9b702
Merge branch 'refine-listen-and-serve-op' of ssh://github.com/jacquesqiao/Paddle into add-async-listen-and-serv-op
7 years ago
qiaolongfei
0f5a9cc9fc
change RunSyncUpdate to RunSyncLoop
7 years ago
ktlichkid
df70d5f1ce
Fixed some bugs
7 years ago
qiaolongfei
0a881a1ecf
init RunAsyncUpdate
7 years ago
qiaolongfei
36083018c1
Merge branch 'refine-listen-and-serve-op' of ssh://github.com/jacquesqiao/Paddle into add-async-listen-and-serv-op
7 years ago
Qiao Longfei
acd7309ea1
Merge pull request #10079 from jacquesqiao/fix-reduce_op_handle_test
...
fix reduce_op_handle_test
7 years ago
Yu Yang
f738691777
Merge pull request #9740 from dzhwinter/memory/activation
...
"polish activation"
7 years ago
chengduoZH
9a4ae4df79
fix scope of gather broadcast
7 years ago
qiaolongfei
d144dba4a1
simplify code
7 years ago
qiaolongfei
9c2d7df8ad
optimize code
7 years ago
Tao Luo
8e00540778
Merge pull request #10058 from Xreki/core_fix_flush
...
Add flush of program desc to update the proto information.
7 years ago
qiaolongfei
570be39195
fix build
7 years ago
qiaolongfei
9a0ad10ff1
fix reduce_op_handle_test
7 years ago
qiaolongfei
8f7c77309d
refine listen_and_serv_op
7 years ago
chengduo
cec4e6ed0d
Merge pull request #9946 from chengduoZH/feature/add_reduce_op_handle
...
Feature/add reduce op handle
7 years ago
chengduo
23a21c86b1
Merge pull request #9922 from chengduoZH/feature/refine_gather_reduce
...
Refine gather and broadcast
7 years ago
qiaolongfei
1e30c41e7b
add split string
7 years ago
qiaolongfei
d002aa7abf
update
7 years ago
Yu Yang
92b5f49ee4
Eager destruct local scope
7 years ago
qiaolongfei
a39e607798
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-async-listen-and-serv-op
7 years ago
Qiao Longfei
ba927b8811
Merge pull request #10060 from jacquesqiao/update-variable-response
...
VariableResponse support deserialize var into local scope
7 years ago
Abhinav Arora
324ab7a39a
Fix CPPLint issues with select_op ( #10072 )
7 years ago
Siddharth Goyal
122141249d
Fix cpplint for print_op ( #10070 )
...
* Fix print op cpplint errors
* Remove commented code
7 years ago
Abhinav Arora
8113de9425
Fix more CPPLint errors ( #10069 )
7 years ago
qiaolongfei
65b3138e98
add check
7 years ago
Qiao Longfei
bb4b9af7d4
Merge pull request #10056 from typhoonzero/fix_splitbyref_macbuild
...
fix split byref op mac build
7 years ago
ktlichkid
d060b5dfac
Registered beam search op
7 years ago
ktlichkid
b94c518884
Implemented BeamSearchKernel
7 years ago
qiaolongfei
0ffd33d30e
VariableResponse support deserialize var into local scope
7 years ago
Liu Yiqun
7ffbcbcaf0
Add flush of program desc to update the proto information.
7 years ago
typhoonzero
ba8b0a5dc2
fix mac build
7 years ago
whs
aa4d51b095
Merge pull request #10029 from wanghaoshuang/fix_gru_unit
...
Remove intermediate output's gradient from inputs of grad_op.
7 years ago
dzhwinter
2e331c6593
accelerate dropout ( #9902 )
...
* accelerate dropout
* accelerate dropout
* "fix the dropout test"
* "rerun ci"
* "fix ci"
* "rerun ci"
* "fix ci"
* "fix"
* "stage"
* disable
7 years ago
ktlichkid
881ea62bbf
Added BeamSearchOpMaker class
7 years ago
qiaolongfei
e84f353e1a
optimize
7 years ago
Yancey
0b8630b9b4
Merge pull request #9897 from Yancey1989/auto_grwon_sparse_table
...
Auto-grown sparse table
7 years ago
Tao Luo
82b192a3fd
Merge pull request #10041 from abhinavarora/cpplint_framework
...
Fix CPPLint errors in some framework files
7 years ago
chengduo
88f8183576
Merge branch 'develop' into feature/refine_gather_reduce
7 years ago
dzhwinter
1fc9fe773d
merge develop
7 years ago
chengduoZH
035712822c
fix VisitVariable
7 years ago
wanghaoshuang
216443de15
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_gru_unit
7 years ago
Abhinav Arora
cbbf08aee9
Fix CPPLint errors in some framework files
7 years ago
Yang Yang(Tony)
e04c43d543
add semicolon to op registry ( #10034 )
...
* script to add semicolon
* fix typo
7 years ago
Abhinav Arora
c5c7dc2e82
Fix CPPLint errors in multiclass_nms, nccl, nce, reduce and save_load_combine ( #10032 )
...
* Fix CPPLint errors in multiclass_nms, nccl, nce, reduce and save_load_combine
* Fix
7 years ago
Yiqun Liu
598035f985
Fix a bug in save_inference_model and prune when the program is initailized by load_inference_model ( #10011 )
...
* Fix bug in save_inference_model and prune when the program is initialized by load_inference_program.
* Save the transpiled program instead.
7 years ago
Abhinav Arora
9ca578d49e
Fix CPPLint issues in expand_op, gather_op and get_places_op ( #10000 )
7 years ago
Xin Pan
98c12b1a08
Clean up C++ codes. ( #10022 )
...
* Privatize OpHandleBase
* Clean up a few private members
7 years ago
qiaolongfei
1a43828780
implement main logic
7 years ago
dzhwinter
9b8ca0cfa8
"fix after merge"
7 years ago
dzhwinter
7a517dc93e
merge develop
7 years ago
wanghaoshuang
00548a1601
Remove intermediate output's gradient from inputs of grad_op.
7 years ago
Tao Luo
d84cdb7b59
Merge pull request #9911 from tonyyang-svail/unify_op_registry
...
Unify REGISTER_OP and REGISTER_OPERATOR
7 years ago
Wu Yi
d655417f90
Merge pull request #9956 from typhoonzero/split_byref_op
...
Split byref op
7 years ago
ktlichkid
17212696cd
Added BeamSearchOpMaker class
7 years ago
Yancey1989
f12b3f3617
use memcpy
7 years ago
dzhwinter
61cb4f2fdb
"fix ci"
7 years ago
Yu Yang
60e0d1aabd
Merge pull request #10023 from reyoung/feature/DtorOfPE
...
Correctly implement destructor of ParallelExecutor
7 years ago
Yu Yang
28013ef9e2
Merge pull request #10019 from reyoung/feature/feed_fetch_tensor_on_cpu
...
Fix FetchTensor on CPU
7 years ago
Yancey1989
13e7194ebd
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into auto_grwon_sparse_table
7 years ago
Yancey1989
70bf732f82
refine get interface
7 years ago
typhoonzero
ff0d9341ea
remove not used code
7 years ago
qiaolongfei
79a1a7cda0
init async gprc server
7 years ago
chengduoZH
fbb75c6bb6
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_gather_reduce
7 years ago
Yu Yang
5305c5f845
Correctly implement destructor of ParallelExecutor
7 years ago
Yu Yang
38d75c912c
Follow comments
7 years ago
chengduoZH
4760ac44f1
check the generate_op is null or not and add DEPS of broadcast_op_handle and gather_op_handle
7 years ago
mozga-intel
171471eada
Merge branch 'develop' into mozga-intel/Mul_mkldnn
7 years ago
Yu Yang
6de5fd9e8a
Fix FetchTensor on CPU
7 years ago
Yu Yang
35ebe2eca7
Clean MultiDevicesGraphBuilder
7 years ago
typhoonzero
184835856c
fix copy size
7 years ago
Yu Yang
d24ef931b5
Clean Code
7 years ago
Yancey1989
b920b51686
rename parameter
7 years ago
Yu Yang
c3c7b7bd1b
Merge pull request #9928 from reyoung/feature/stablize_code
...
Use mutex to stablize ncclCtxMap
7 years ago
typhoonzero
788636f078
update by comments
7 years ago
Yancey1989
2917a75b7a
add some comments for sparse table
7 years ago
fengjiayi
fbe562478d
Merge pull request #9994 from reyoung/feature/debug
...
Fix bugs in local_scopes
7 years ago
qingqing01
ee9832a7a9
Add Top-k Python API. ( #9973 )
...
* Add topk Python API.
* Add unit test.
* Remove the repeated API.
7 years ago
chengduo
e5b3eb9899
Merge pull request #10001 from abhinavarora/disable_test_channel
...
Disabling Channel Test to fix CI
7 years ago
chengduoZH
e63013a86f
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_reduce_op_handle
7 years ago
dzhwinter
425a1e76a2
Merge remote-tracking branch 'origin/develop' into memory/activation_memory
7 years ago
Tao Luo
2617d8bcda
Merge pull request #9993 from jczaja/prv-softmax-mkldnn-fix
...
- Added Epsilon (preventing softmax output from being too small) for softmax MKLDNN op
7 years ago
typhoonzero
e2d56832e1
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into split_byref_op
7 years ago
Abhinav Arora
74ff0be234
Merge remote-tracking branch 'origin/develop' into disable_test_channel
7 years ago
chengduoZH
1eeb2e0042
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_reduce_op_handle
7 years ago
Abhinav Arora
ed681d5235
Fix conv_mkldnn_op.cc which is causing CI failure
7 years ago
Abhinav Arora
1b5c1bcb57
Disabling Channel Test to fix CI
7 years ago
Yang Yang
68d96385e4
remove REGISTER_OP and REGISTER_OP_EX
7 years ago
Abhinav Arora
6f83142353
Fix cpplint issues with beam_search_op and beam_search_decode_op ( #9962 )
...
* Fix cpplint warnings in beam_search_decode_op
* Fix cpplint warnings in beam_search_op
* Fix test
* fix
7 years ago
Yang Yang
4b1a32db34
fix pybind.h generator
7 years ago
Abhinav Arora
2d1a6f8d4a
Fix cpplint issues in Detection_map_op ( #9969 )
...
* Fix conv_op.h
* Fix conv_mkldnn_op
* Fix cpplint issues in detection_map_op
7 years ago
Abhinav Arora
d08791d11a
Fix CPPLint issues with Chunk_eval_op ( #9964 )
7 years ago
Wu Yi
8352f93809
Merge pull request #9933 from typhoonzero/add_dist_unit_test
...
Add dist unitest data compare, ensure that dist train have same behavior as local training
7 years ago
Jacek Czaja
acdf7cbd19
- Added EPS for softmax MKLDNN op
...
- EPS added to softmax mkldnn primitive outcome is limited to training
phase
Fixes after review
clang format fixes
clang format fixes
7 years ago
dzhwinter
626227eb72
"fix ci"
7 years ago
Yu Yang
9b60d0decb
Merge pull request #9975 from reyoung/feature/VarHandleCtor
...
Using constructor for VarHandle
7 years ago
Yu Yang
06fb055a2f
New group
7 years ago
typhoonzero
c6d7c2bdff
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into split_byref_op
7 years ago
mozga-intel
6e7b883bdd
Initial implementation of multiplication operator for MKLDNN
7 years ago
Yancey1989
ca327508cc
update
7 years ago
chengduoZH
e4de957f19
code refine
7 years ago
Tao Luo
038dbb386e
Merge pull request #9958 from luotao1/find_tensorrt
...
auto find tensorrt library and install in user root
7 years ago
Yu Yang
71a2e6b73c
Reverse create var
7 years ago
Yu Yang
2e8459be71
DebugCode
7 years ago
Yu Yang
fc6f0be276
Merge pull request #9942 from reyoung/feature/tuning_pe_trans
...
Feature/tuning pe trans
7 years ago
Yu Yang
d7527681c6
Fix scale grad
7 years ago
Yu Yang
89728f8e66
update
7 years ago
Yu Yang
eb2e4eeade
Debug
7 years ago
typhoonzero
948628563f
update
7 years ago
Yu Yang
64c139e854
Using constructor for VarHandle
7 years ago
chengduoZH
4abef50123
code refine
7 years ago
Yu Yang
b4aaa00a8a
Polish logic of ParallelExecutor
7 years ago
Yu Yang
2ab12ca248
Add comments and clean code
7 years ago
Yu Yang
a822f8ddbb
Add wait
7 years ago
Yu Yang
e9e27e0f32
Revert
7 years ago
Yu Yang
7289e75836
Udpate
7 years ago
Yu Yang
e6bc7bb09a
Tuning code
7 years ago
Yu Yang
0ca28b8507
Sync Copy
7 years ago
Yancey1989
ed6241cd44
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into auto_grwon_sparse_table
7 years ago
Yu Yang
4999f85f6a
Clean RunDelayedOp method
7 years ago
Yu Yang
c0f4e0332c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/simplify_delay_logic
7 years ago
Kexin Zhao
64bf3df0f9
add print support to float16 ( #9960 )
7 years ago
Yang Yang
411e888c56
fix duplication
7 years ago
Yang Yang
ce7c2e86e5
script to fix all
7 years ago
Yang Yang
dafe06af46
first commit
7 years ago
chengduoZH
3301d4402e
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_reduce_op_handle
7 years ago
Luo Tao
d4682247e1
auto find tensorrt library
7 years ago
chengduoZH
3c5bbf42c4
make unit test to work
7 years ago
Yan Chunwei
186659798f
add tensorrt build support( #9891 )
7 years ago
typhoonzero
04c559e3aa
wip split byref op
7 years ago
chengduoZH
2aaa75ecf8
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_gather_reduce
7 years ago
typhoonzero
f86d35a269
add sharable tensor
7 years ago
chengduoZH
e39adc8600
add reduce op handle
7 years ago
guru4elephant
a097d082c7
Merge pull request #9938 from guru4elephant/fix_testcase
...
Fix testcase
7 years ago
dzhwinter
b92b408e50
"fix ci"
7 years ago
typhoonzero
745aacfc38
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_dist_unit_test
7 years ago
dzhwinter
877e2aed03
"fix ci"
7 years ago
Yu Yang
5b84c9b59c
CreateOpHandleIOs
7 years ago
typhoonzero
8c1b257596
add dist unitest data compare
7 years ago
dongdaxiang
fb873713e7
update program_desc_test testcase
7 years ago
Yu Yang
d0342f12aa
Simplify DelayOps Logic
7 years ago
dzhwinter
e54f203c55
"move to a new PR"
7 years ago
chengduoZH
690cd1f748
refine gather and broadcast
7 years ago
Yu Yang
093d227a77
Use mutex to stablize ncclCtxMap
7 years ago
whs
494c262a26
Fix average_accumulate_op for parallel executor. ( #9852 )
7 years ago
Yi Wang
630943c7a7
Update documentation ( #9918 )
7 years ago
Yi Wang
b48cf1712b
Fix cpplint errors in transform_test.cu ( #9915 )
...
* Fix cpplint errors with transformer_test.cu
* Update
7 years ago
Tao Luo
b66893829b
Merge pull request #9914 from abhinavarora/split_ids_op_warning
...
Fix compiler warnings in split_op and chunks_test
7 years ago
Abhinav Arora
7b86da7195
Fix CPPLint errors in operators ( #9826 )
...
* Fix CPPLint errors in operators
* Fix cast in softmax
* Fix softmax_mkldnn
* Fix send_recv_op_test
* Send_recv
* Fix softmax mkldnn
7 years ago
Kexin Zhao
92913027fc
fix unused var error ( #9908 )
7 years ago
Yi Wang
47609ab2b8
Document transform.h and fix cpplint errors ( #9913 )
7 years ago
chengduo
b43d87c917
Merge pull request #9825 from chengduoZH/feature/add_gather_and_BCast_op_handle
...
feature/Add Broadcast and Gather op handle
7 years ago
Yu Yang
e4cfe477db
Merge pull request #9898 from reyoung/feature/mix_cpu_gpu_op
...
Feature/mix cpu gpu op
7 years ago
fengjiayi
c1bf06f9d9
Merge pull request #9289 from dzhwinter/speed/sequence_expand
...
Speed/sequence expand
7 years ago
chengduoZH
384d6ee8ac
follow comments
7 years ago
Yu Yang
ed2d7d7d26
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/mix_cpu_gpu_op
7 years ago
Yu Yang
925c17aba7
Merge pull request #9895 from reyoung/feature/fix_transformer_hang
...
Fix Transformer Hang Problem
7 years ago
Yu Yang
4452ff76b7
Fix CPU compile
7 years ago
fengjiayi
51c219c9cd
Merge pull request #9892 from JiayiFeng/refine_reshape_op
...
Add Wait() for reshape_op
7 years ago
chengduoZH
02842cfc25
enhance broadcast_op_handle and gather_op_handle
7 years ago
Yancey1989
19152541b2
fix ci
7 years ago
Yu Yang
79be06045c
Support CPU/GPU mixture for ParallelExecutor
7 years ago
Yancey1989
ac7cb949d0
auto-grown sparse table
7 years ago
Yu Yang
7083c2a6b3
Merge branch 'feature/fix_transformer_hang' into feature/mix_cpu_gpu_op
7 years ago
Yu Yang
cedade9494
Stash
7 years ago
Yancey
41a9146de3
Merge pull request #9787 from Yancey1989/random_selected_rows_value
...
Initialize large table value randomly
7 years ago
fengjiayi
121166750e
Merge pull request #9894 from JiayiFeng/refine_double_buffer_code
...
Refine double_buffer code
7 years ago
Wu Yi
1bdc7261a5
Merge pull request #9578 from typhoonzero/threadpool_for_io
...
Multi stream thread pool
7 years ago
Tao Luo
2c552d4e61
Merge pull request #9630 from Xreki/core_inference_prepare
...
Split Executor.Run to Executor.Prepare and Executor.RunPreparedContext for inference
7 years ago
Yu Yang
6b20b35589
Fix Transformer Hang Problem
7 years ago
fengjiayi
3fa0ef3d71
Refine double_buffer code
7 years ago
typhoonzero
a08bf76f74
refine name
7 years ago
fengjiayi
c20cc2bd8a
Add Wait() for reshape_op
7 years ago
Tao Luo
5a4d93286d
Merge pull request #9888 from abhinavarora/fix_warnings_
...
Fix warnings in sgd_op.h
7 years ago
Yu Yang
06ddaa73f2
Merge pull request #9840 from reyoung/feature/polish_reshape_op
...
Polish reshape op
7 years ago
chengduoZH
b0267ac93a
refine broadcast op
7 years ago
chengduoZH
8b597d9ddc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_gather_and_BCast_op_handle
7 years ago
Tao Luo
f7a54b514d
Merge pull request #9881 from mozga-intel/mozga-intel/fc_without_mkldnn
...
The fully connected: the operator is removed when the MKLDNN flag is OFF
7 years ago
Abhinav Arora
855992dab0
Fix warnings in chunk_test
7 years ago
Abhinav Arora
c241959e48
Fix CPPLint errors in operators ( #9828 )
...
* Fix CPPLint errors in operators
* Fix prior box op
* Fix Prior Box op
* Fix top_k_op.cu
* Fix pool mkmldnn
* Fix pool mkmldnn
7 years ago
Abhinav Arora
9b63b7dde0
Fix warnings in split_ids_op
7 years ago
Abhinav Arora
3794027d7f
Fix warnings in sgd_op.h
7 years ago
Kexin Zhao
617e790a59
fix cuda 7.5 compile error ( #9885 )
7 years ago
Tao Luo
859fedf3e7
Merge pull request #9871 from qingqing01/fix_bn
...
Refine batch_norm_op.
7 years ago
mozga-intel
d24b5e060f
The fully connected: the operator is removed when the MKLDNN flag is OFF
7 years ago
Yancey1989
8eac2a46f7
update by comment
7 years ago
Liu Yiqun
449bdde58a
Correct some typos.
7 years ago
chengduoZH
e26c6d78ad
code refine
7 years ago
Qiao Longfei
4c55a6022a
Dist transpiler support prefetch ( #9714 )
...
* init
* add some check
* add dist transpile logic
* add insert op for block
* init change get_pserver_program
* optimize code
* fix a bug
* can run now
* start to do table split
* start to process table gradient
* complete pserver part
* can send_vars now
* revert cpplint
* fix a bug
* optimize code
* move dist test to models
* revert the interface of distribute_transpiler.transpile
* fix prefetch_block
* optimize trainspiler code
* add comment to sum_op
* add warning log
* fix comment
* fix test_send_recv
* fix test_send_recv
* fix train with no distributed table
* optimize GetDims
7 years ago
typhoonzero
26cfc634b9
multi stream thread pool
7 years ago
Liu Yiqun
2762959f79
Merge branch 'develop' into core_inference_prepare
7 years ago
Liu Yiqun
339be6254e
Refine the order of arguments.
7 years ago
Yu Yang
ad73b331c7
Eagerly drop local scope in iteration ( #9838 )
...
* Eagerly drop local scope in iteration
* Correct create var
* Fix typo
* Debug
7 years ago
Dang Qingqing
1204d9f3d1
Refine batch_norm_op.
7 years ago
Yancey1989
9e9f5d8080
fix ci
7 years ago
Yancey1989
7132bbe6b7
update by comment
7 years ago
Yiqun Liu
e90e7ab237
Remove the use of ARCHIVE_START/END ( #9844 )
...
* Add USE_OP of all operators and kernels and remove ARCHIVE_START/END in CMakeLists.txt of inference unittests.
* Remove ARCHIVE_START/END when linking inference shared library.
* Disable some fluid related cmake operations for cross-compiling.
7 years ago
typhoonzero
70500398b6
wip
7 years ago
Yang Yang(Tony)
a64edbf14e
delete backward.cc related code on the python side ( #9854 )
7 years ago
Yang Yang(Tony)
b26f505002
remove net op and cond_op ( #9663 )
...
* remove net op and cond_op
* fix cpplint
* fix dependency
* delete backward_test; fix compile
* disable batch_norm backward
* rm test_net.py
* make batchnorm test independent of backward.cc
* make test_layer_norm_op independent of backward.cc
* make test_layer_norm_op independent of backward.cc
* delete unused code
* clean up
7 years ago
Siddharth Goyal
8d3ce01f36
Fix cpplint errors for a set of operators ( #9837 )
...
* Fix cpplint errors, round2
* Fix pointer issue
7 years ago
chengduoZH
ae5923e205
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_gather_and_BCast_op_handle
7 years ago
chengduoZH
e7684911fd
add gather op handle
7 years ago
Liu Yiqun
bf485999f4
Merge branch 'develop' into core_inference_prepare
7 years ago
Yu Yang
daa5011722
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/polish_reshape_op
7 years ago
Yu Yang
f605f647eb
Merge pull request #9847 from reyoung/feature/disable_test_infer_fit_a_line
...
Disable unstable unittest
7 years ago
Yu Yang
5ceea265bb
Disable unstable unittest
7 years ago
Yancey1989
1aada35206
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into random_selected_rows_value
7 years ago
Yu Yang
52987902c9
Polish reshape op
7 years ago