Commit Graph

8942 Commits (1958654d6f15087c28b44759c1a8d004826f00ce)

Author SHA1 Message Date
qiaolongfei 2b9ff39f5f fix the default value prefetch_var_name_to_block_id
7 years ago
qingqing01 19fd071785
Make the normalization operator more general and fix bug in l2_normalize. (#11348)
7 years ago
Lei Wang 24391c76de Build: add make before make install to catch up Makefile change.
7 years ago
whs adc09087c1
Add slice op. (#11052)
7 years ago
qiaolongfei 6dd3f3cf27 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-merge-splited-ids
7 years ago
qiaolongfei 16658f7b59 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine-prefetch
7 years ago
Xin Pan 1d198494d7
Merge pull request #11370 from panyx0718/dist
7 years ago
chengduo 183377f410
Merge pull request #11306 from chengduoZH/enable_cpu_on_pe
7 years ago
qiaolongfei 83a577e8ce fix build problem
7 years ago
qiaolongfei fe65064827 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into refine-prefetch
7 years ago
Luo Tao 7bdb573d79 update with comments
7 years ago
qiaolongfei 506fc8d9e8 optimize code
7 years ago
gongweibao d9de6b8621
Add brpc surpport. (#11263)
7 years ago
Xin Pan 1509ae3a53 Make status update thread-safe
7 years ago
qiaolongfei ea106c91e0 optimize comment and code
7 years ago
Luo Tao 7694199050 refine docs of elementwise_op etc.
7 years ago
qiaolongfei 7f4b9656a4 set status before Finish in prefetch process
7 years ago
dzhwinter bfa3fd6f15
add inplace attribute to op_proto_maker (#10665)
7 years ago
qiaolongfei 5aba10b585 set the thread pool of prefetch to 1 to fix a bug
7 years ago
gongweibao 9087c6687f
polish (#11363)
7 years ago
qiaolongfei 8fb78f6c07 fix grpc_server_test
7 years ago
chengduoZH 173d72b481 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into enable_cpu_on_pe
7 years ago
tensor-tang b3fd9da60e
Merge pull request #11101 from mozga-intel/mozga-intel/Pool_mkldnn_layout
7 years ago
chengduoZH aadaadf735 replace use_event with use_cuda, because use_event means the program running with CUDA, so use_cuda maybe more intuitive.
7 years ago
qiaolongfei 4e36c0ecab update prefetch logic in grpc_server
7 years ago
gongweibao 627d7a64f8
Clean `sendop` `recv` operator. (#11309)
7 years ago
chengduo fa29ef0b0d
Merge pull request #11277 from chengduoZH/check_ssa_graph
7 years ago
chengduoZH 961fbce8e2 follow comments
7 years ago
Yu Yang 3fd3e500cc
Merge pull request #11346 from reyoung/feature/add_lock_to_device_ctx
7 years ago
yuyang18 2955ff5887 Polish documentation
7 years ago
qiaolongfei 0d3d4ae775 refine prefetch logic
7 years ago
chengduoZH 7b723839ef Add cpu test for parallel_executor_crf executor_fetch_feed, and enable these tests
7 years ago
sneaxiy 831909ce69
Merge pull request #11313 from sneaxiy/argmin_argmax
7 years ago
chengduoZH d24e046c1e fix allReduce bug
7 years ago
yuyang18 a1254a86ba Add lock to record_event.
7 years ago
Tao Luo 69b5a62c65
Merge pull request #11319 from luotao1/mkldnn
7 years ago
yuyang18 9b43edeae0 Polish arg_min_max_op
7 years ago
chengduoZH a57e8a4338 add cpu test
7 years ago
Yu Yang 9328c3cf7b
Merge pull request #11308 from reyoung/feature/polish_api_ref
7 years ago
qiaolongfei 0485405b3d add more debug string
7 years ago
Luo Tao 045589fae4 fix compiler error in high-level api
7 years ago
Luo Tao 79d555b9f2 Merge branch 'develop' into mkldnn
7 years ago
gongweibao 062d5a56b4
Add comments to a singleton. (#11333)
7 years ago
mozga-intel 7d5643562f MKLDNN layout: Support for batch norm operator
7 years ago
mozga-intel 9908d3cfbc MKLDNN layout: Support for convolution operator
7 years ago
mozga-intel 36031cb50f MKLDNN layout: Support for pool operator
7 years ago
qiaolongfei 509cb0bc76 add unit test, pass the unit test
7 years ago
qiaolongfei 7cebec4b7e init merge_ids_op
7 years ago
chengduoZH 1e731f5964 small fix
7 years ago
chengduoZH 495368c243 ADD CPU_NUM
7 years ago
chengduoZH 27073c284d nccl_all_reduce_op_handle => all_reduce_op_handle
7 years ago
chengduoZH 2d94697a82 code refine
7 years ago
chengduoZH 5a3c8bf813 fix in c++ side
7 years ago
Wu Yi 7bcc98089a
Merge pull request #11321 from Yancey1989/polish_sparse_update
7 years ago
guochaorong eced973091
Merge pull request #11317 from guochaorong/fix_bad_code
7 years ago
guochaorong 310598f99b
Update device_tracer.cc
7 years ago
fengjiayi fae3d8d2dc
Merge pull request #11311 from JiayiFeng/a_small_fix
7 years ago
sneaxiy 6d32e96096 remove redundant comments
7 years ago
Yancey1989 56964946d4 polish sparse update logic
7 years ago
Luo Tao c6d230e03e add FLAGS_use_mkldnn to global control use_mkldnn
7 years ago
guochaorong 04b8d3d03c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into paddle_fix
7 years ago
guochaorong 0fec9469f9 fix some bugs introduced by unfreed memory
7 years ago
yuyang18 8c9041f486 Refine LinearCRF
7 years ago
sneaxiy 568c4e5ec4 recommit using account sneaxiy
7 years ago
Yan Chunwei 145aaa4b49
loose threshold of TRT for CI in different model (#11305)
7 years ago
fengjiayi d745840a6e fix a small compile error on Mac
7 years ago
yuyang18 0d29e65924 Add resize_bilinear
7 years ago
yuyang18 b000e0de5d Simplize API Reference Documentation
7 years ago
chengduoZH 0c851cab22 add SSA graph checker
7 years ago
fengjiayi b587a7f66e
Merge pull request #11293 from JiayiFeng/update_crop_op
7 years ago
Xin Pan 259e63d4a1
Merge pull request #11248 from panyx0718/dist
7 years ago
Xin Pan 2d7c836d32
Merge pull request #11295 from panyx0718/doc
7 years ago
Yu Yang 8deff48db0
Merge pull request #11081 from reyoung/feature/python_doc
7 years ago
fengjiayi c7bbfb33ad Fix a GPU bug
7 years ago
Yancey1989 1239fce771 polish sparse update code
7 years ago
chengduoZH 1076e85135 refine logic
7 years ago
Yancey 0aa9546eed
fix dist train error (#11281)
7 years ago
Xin Pan e80c6b3c24 Refine API doc string
7 years ago
tensor-tang 80e882a398
Merge pull request #11247 from tensor-tang/infer_api
7 years ago
cuichaowen 9141bee1e7 add Anakin api for paddle (#11228)
7 years ago
fengjiayi 24649a780d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update_crop_op
7 years ago
dzhwinter d48172f22a
split reduce op into multiple libraries, accelerate the compiling (#11029)
7 years ago
fengjiayi 5803115720
Merge pull request #11288 from JiayiFeng/fix_bug_of_ExecutionContext
7 years ago
fengjiayi 9c61409a18 Make crop op supporting taking offsets as one of its inputs
7 years ago
dzhwinter f7c96f079b
Big data op_test benchmark, for checking output consistent in different runs. (#10646)
7 years ago
fengjiayi 9ce0885067 Merge branch 'fix_bug_of_ExecutionContext' into update_crop_op
7 years ago
fengjiayi dc8e0b494d fix bugs in the implementation of 'HasInput' and 'HasOutput'
7 years ago
tensor-tang e030741df9 fix gpu fraction
7 years ago
fengjiayi 4f46a98fa9 stash
7 years ago
tensor-tang 746a62ebe6 add gpu tests
7 years ago
tensor-tang 35e820dc2b Merge remote-tracking branch 'ups/develop' into infer_api
7 years ago
mozga-intel 3ff9ba0e6b Mkldnn layout (#11040)
7 years ago
chengduoZH 8291b916d6 replace graph_builder_factory with ssa_graph_builder_factory
7 years ago
chengduoZH 9ac785be39 check graph's validation
7 years ago
fengjiayi a1e046bfc0
Merge pull request #11270 from JiayiFeng/fix_a_error_on_max
7 years ago
Yu Yang 03073df182
Merge pull request #11237 from chengduoZH/add_fuse_var_op_handle
7 years ago
Tao Luo 6d80dd5a50
Merge pull request #11222 from luotao1/trt
7 years ago
fengjiayi 499dbe0536 fix a multi-thread bug in readers
7 years ago
fengjiayi 7344210070 Merge branch 'fix_a_error_on_max' into fix_reader_bug
7 years ago
fengjiayi 2f5e310167 fix a compile error
7 years ago