Yancey
41a9146de3
Merge pull request #9787 from Yancey1989/random_selected_rows_value
...
Initialize large table value randomly
7 years ago
fengjiayi
121166750e
Merge pull request #9894 from JiayiFeng/refine_double_buffer_code
...
Refine double_buffer code
7 years ago
Wu Yi
1bdc7261a5
Merge pull request #9578 from typhoonzero/threadpool_for_io
...
Multi stream thread pool
7 years ago
Tao Luo
2c552d4e61
Merge pull request #9630 from Xreki/core_inference_prepare
...
Split Executor.Run to Executor.Prepare and Executor.RunPreparedContext for inference
7 years ago
Yu Yang
6b20b35589
Fix Transformer Hang Problem
7 years ago
fengjiayi
3fa0ef3d71
Refine double_buffer code
7 years ago
typhoonzero
a08bf76f74
refine name
7 years ago
fengjiayi
c20cc2bd8a
Add Wait() for reshape_op
7 years ago
Tao Luo
5a4d93286d
Merge pull request #9888 from abhinavarora/fix_warnings_
...
Fix warnings in sgd_op.h
7 years ago
Yu Yang
06ddaa73f2
Merge pull request #9840 from reyoung/feature/polish_reshape_op
...
Polish reshape op
7 years ago
chengduoZH
b0267ac93a
refine broadcast op
7 years ago
chengduoZH
8b597d9ddc
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_gather_and_BCast_op_handle
7 years ago
Tao Luo
f7a54b514d
Merge pull request #9881 from mozga-intel/mozga-intel/fc_without_mkldnn
...
The fully connected: the operator is removed when the MKLDNN flag is OFF
7 years ago
Abhinav Arora
855992dab0
Fix warnings in chunk_test
7 years ago
Abhinav Arora
c241959e48
Fix CPPLint errors in operators ( #9828 )
...
* Fix CPPLint errors in operators
* Fix prior box op
* Fix Prior Box op
* Fix top_k_op.cu
* Fix pool mkmldnn
* Fix pool mkmldnn
7 years ago
Abhinav Arora
9b63b7dde0
Fix warnings in split_ids_op
7 years ago
Abhinav Arora
3794027d7f
Fix warnings in sgd_op.h
7 years ago
Kexin Zhao
617e790a59
fix cuda 7.5 compile error ( #9885 )
7 years ago
Tao Luo
859fedf3e7
Merge pull request #9871 from qingqing01/fix_bn
...
Refine batch_norm_op.
7 years ago
mozga-intel
d24b5e060f
The fully connected: the operator is removed when the MKLDNN flag is OFF
7 years ago
Yancey1989
8eac2a46f7
update by comment
7 years ago
Liu Yiqun
449bdde58a
Correct some typos.
7 years ago
chengduoZH
e26c6d78ad
code refine
7 years ago
Qiao Longfei
4c55a6022a
Dist transpiler support prefetch ( #9714 )
...
* init
* add some check
* add dist transpile logic
* add insert op for block
* init change get_pserver_program
* optimize code
* fix a bug
* can run now
* start to do table split
* start to process table gradient
* complete pserver part
* can send_vars now
* revert cpplint
* fix a bug
* optimize code
* move dist test to models
* revert the interface of distribute_transpiler.transpile
* fix prefetch_block
* optimize trainspiler code
* add comment to sum_op
* add warning log
* fix comment
* fix test_send_recv
* fix test_send_recv
* fix train with no distributed table
* optimize GetDims
7 years ago
typhoonzero
26cfc634b9
multi stream thread pool
7 years ago
Liu Yiqun
2762959f79
Merge branch 'develop' into core_inference_prepare
7 years ago
Liu Yiqun
339be6254e
Refine the order of arguments.
7 years ago
Yu Yang
ad73b331c7
Eagerly drop local scope in iteration ( #9838 )
...
* Eagerly drop local scope in iteration
* Correct create var
* Fix typo
* Debug
7 years ago
Dang Qingqing
1204d9f3d1
Refine batch_norm_op.
7 years ago
Yancey1989
9e9f5d8080
fix ci
7 years ago
Yancey1989
7132bbe6b7
update by comment
7 years ago
Yiqun Liu
e90e7ab237
Remove the use of ARCHIVE_START/END ( #9844 )
...
* Add USE_OP of all operators and kernels and remove ARCHIVE_START/END in CMakeLists.txt of inference unittests.
* Remove ARCHIVE_START/END when linking inference shared library.
* Disable some fluid related cmake operations for cross-compiling.
7 years ago
typhoonzero
70500398b6
wip
7 years ago
Tao Luo
80d7560a5f
Merge pull request #9855 from helinwang/make
...
make -j nproc when making inference_lib_dist
7 years ago
Helin Wang
35b0ed369c
make -j nproc when making inference_lib_dist
7 years ago
Yang Yang(Tony)
a64edbf14e
delete backward.cc related code on the python side ( #9854 )
7 years ago
Yang Yang(Tony)
b26f505002
remove net op and cond_op ( #9663 )
...
* remove net op and cond_op
* fix cpplint
* fix dependency
* delete backward_test; fix compile
* disable batch_norm backward
* rm test_net.py
* make batchnorm test independent of backward.cc
* make test_layer_norm_op independent of backward.cc
* make test_layer_norm_op independent of backward.cc
* delete unused code
* clean up
7 years ago
Siddharth Goyal
8d3ce01f36
Fix cpplint errors for a set of operators ( #9837 )
...
* Fix cpplint errors, round2
* Fix pointer issue
7 years ago
chengduoZH
ae5923e205
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/add_gather_and_BCast_op_handle
7 years ago
chengduoZH
e7684911fd
add gather op handle
7 years ago
Liu Yiqun
bf485999f4
Merge branch 'develop' into core_inference_prepare
7 years ago
Yu Yang
daa5011722
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/polish_reshape_op
7 years ago
Yu Yang
f605f647eb
Merge pull request #9847 from reyoung/feature/disable_test_infer_fit_a_line
...
Disable unstable unittest
7 years ago
Yu Yang
5ceea265bb
Disable unstable unittest
7 years ago
Yancey1989
1aada35206
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into random_selected_rows_value
7 years ago
Yu Yang
52987902c9
Polish reshape op
7 years ago
dzhwinter
62d1f9a7cb
"done"
7 years ago
typhoonzero
dfc60251bf
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multigpumultinode
7 years ago
dzhwinter
80bd1ca01f
"fix the style"
7 years ago
fengjiayi
90084a25d2
Merge pull request #9743 from JiayiFeng/modify_readers_to_fit_parallel_executor
...
Modify readers to fit the parallel executor
7 years ago
chengduoZH
124c93081d
remove ContextMap
7 years ago
typhoonzero
e2ca8e1097
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into threadpool_for_io
7 years ago
typhoonzero
d1e63a1d92
fix ci
7 years ago
typhoonzero
16a9dfe480
finish
7 years ago
dzhwinter
996b3e15fa
Merge remote-tracking branch 'origin/develop' into speed/sequence_expand
7 years ago
Yu Yang
c64190ecbb
Polish NCCLHelper
7 years ago
Yu Yang
7483555a81
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/change_int64
7 years ago
JiayiFeng
273f4892b2
update recordio unittest
7 years ago
qingqing01
129859e732
Support data type int64 in NCCL. ( #9818 )
7 years ago
chengduoZH
6db96ec23c
follow comments
7 years ago
JiayiFeng
72b5de05fe
update unittest
7 years ago
chengduoZH
8eaec5dd7c
add BCast and Gather
7 years ago
Tao Luo
0ad892a0d4
Merge pull request #9816 from luotao1/remove_op
...
add remove_op, remove_var in Python end
7 years ago
typhoonzero
ec6976810d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multigpumultinode
7 years ago
Kexin Zhao
7ed457e77a
Fix cuda 7.5 error with cublas GEMM ( #9811 )
...
* fix gemm error for cuda 7.5
* fix version number
7 years ago
Xin Pan
20f202ac91
Merge pull request #9824 from wanghaoshuang/fix_parall_exe
...
Fix lost of LoD while splitting tensor in parallel executor.
7 years ago
Siddharth Goyal
cea391217a
Fix cpplint errors ( #9800 )
7 years ago
fengjiayi
7a78294666
Remove Readers' HasNext()
7 years ago
fengjiayi
a84b81502c
Remove Readers' HasNext()
7 years ago
wanghaoshuang
19c1a68ee9
Fix lost of LoD while splitting tensor in parallel executor.
7 years ago
typhoonzero
ce08dc8751
have stream removed error
7 years ago
Yu Yang
40e3fe173c
Make cuda_helper.h Pass cpplint
7 years ago
typhoonzero
0bf799a523
wip testing
7 years ago
chengduo
e0babe7c3e
Merge pull request #9780 from chengduoZH/feature/fix_batch_size_is_littler_than_gpu_count
...
Crash training, if the number of samples is less than the count of devices.
7 years ago
typhoonzero
b9c28df922
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multigpumultinode
7 years ago
Luo Tao
e7467d94a0
add remove_op, remove_var in Python end
7 years ago
Yancey1989
cb7bbf426c
revert uniform_random_op
7 years ago
Yancey1989
291aa23153
Merge branch 'random_selected_rows_value' of github.com:Yancey1989/Paddle into random_selected_rows_value
7 years ago
Yancey1989
1cc09c7bc5
Merge branch 'develop' of github.com:PaddlePaddle/Paddle into random_selected_rows_value
7 years ago
Yancey1989
3f6fc10b9f
new op that init table value randomly
7 years ago
chengduo
b1224da8d9
Move reduceSum to elementwise_op_function.h ( #9773 )
...
* add cuda_device_functions.h
* move reduceSum to elementwise_op_function.h
7 years ago
JiayiFeng
ee178d5aeb
fix bugs
7 years ago
Yu Yang
161344bf47
Merge pull request #9774 from reyoung/feature/simplify_data_structures
...
Simplify DataStructure in SSAGraph
7 years ago
Kexin Zhao
0f38bb4593
add fp16 support to activation op ( #9769 )
7 years ago
Yi Wang
8dbd9c394e
Fix part of the cpplint errors in fluid/platform ( #9802 )
7 years ago
qingqing01
add367c3f4
Code cleanup in the profiler code. ( #9782 )
7 years ago
Xin Pan
326b434ab5
Merge pull request #9699 from panyx0718/test
...
Avoid init_p2p all the times
7 years ago
Tao Luo
ddff83ff14
Merge pull request #9302 from Xreki/core_inference_multi_thread
...
Add multi-thread inference example which shares the inference_program and parameters
7 years ago
typhoonzero
22f03a1b3b
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into multigpumultinode
7 years ago
Yu Yang
17bfe3f6be
Change to avoid GCC4.8 bug
7 years ago
Liu Yiqun
720f6196ea
Change the seed and make it not fixed for multi-threads cases.
7 years ago
Liu Yiqun
e24172eb54
Simplify the inference unittest of fit a line and add some comment.
7 years ago
JiayiFeng
759979998c
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into modify_dev
7 years ago
Tao Luo
44c346be89
Merge pull request #9776 from abhinavarora/cpplint_ops_a
...
Fix CPPlint issues in some operators
7 years ago
Tao Luo
554c917e7b
Merge pull request #9779 from abhinavarora/cpplint_ops_t
...
Fix CPPLint issues in spp_op, sum_op, topk_op, transpose_op, unpool_Op and warpctc_op
7 years ago
Yancey1989
f909ff1a36
update unit test
7 years ago
fengjiayi
5416bac5d8
Make shared decorated readers' creater be only in main_program
7 years ago
Liu Yiqun
bdb21f6bc3
Merge branch 'develop' into core_inference_multi_thread
7 years ago
chengduoZH
7e7611d067
when the number of samples of current batch is less than the count of devices, let it crash.
7 years ago
Abhinav Arora
1ea4490b14
Resolve cuda compilation error
7 years ago