Commit Graph

1978 Commits (c34945800e8d68fddd681595dc12ea4b541b1342)

Author SHA1 Message Date
yuyang18 c3632b879b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/faster_recordio
7 years ago
yuyang18 53dab95b75 Static DSO handle
7 years ago
yuyang18 c5115950a8 Use static for dlsym
7 years ago
yuyang18 7cf8b656a2 Remove lock in device context
7 years ago
yuyang18 86a61c177f Add ScopeBufferedSSAGraphExecutor
7 years ago
tensor-tang 9c687a9789 Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang 7e9f0790e0 fix scope in thread
7 years ago
Yan Chunwei 9503dbb173
fix compile error (#11119)
7 years ago
Xin Pan 7eca286159
Merge pull request #11078 from panyx0718/improve_profiler
7 years ago
tensor-tang 3206bcd929 refine log and add QPS
7 years ago
tensor-tang 06adccf6eb Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang 4a24c238c1 refine code
7 years ago
Yan Chunwei 0c0c5df4cb
feature/add TRT fc converter (#11043)
7 years ago
whs 86d8659c8d
Add python wrapper for gather op. (#11033)
7 years ago
whs 28dc9ba3c1
Add shape op to get the shape of variable. (#11048)
7 years ago
yuyang18 15193c9e93 Faster RecordIO Scanner
7 years ago
tensor-tang a4822ed897 add thread setting
7 years ago
fengjiayi d6997e5bc8
Merge pull request #11083 from JiayiFeng/dev_refine_programdesc_copy
7 years ago
tensor-tang 5387562576 add multi-thread test
7 years ago
whs 85c203b117
Make bilinear_interp_op support attrs from input. (#11041)
7 years ago
chengduo 01bbe532da
Merge pull request #11079 from chengduoZH/balance_parameter_update
7 years ago
fengjiayi 31f0533c5d fix compile errors
7 years ago
fengjiayi 04ccbed5b8 fix a compile error
7 years ago
gongweibao 4fb7cc7f5e
Move sync_mode device ctx from grpc server (#10881)
7 years ago
fengjiayi a3aca2a3cf fix bugs
7 years ago
Tao Luo 47f3cae2af
Merge pull request #11038 from sneaxiy/zjl-dev
7 years ago
tensor-tang 733718c3e7 remove the ugly test
7 years ago
Yan Chunwei 97b7502772
inference API little fix (#11069)
7 years ago
fengjiayi 2a3c58d3fe refine programdesc copy
7 years ago
chengduoZH e330cd032e balance parameter update
7 years ago
yuyang18 f9d93bfde1 Add document to random crop operator
7 years ago
tensor-tang 708bec2e56 add test
7 years ago
tensor-tang d13dd3b6a7 revert profiling
7 years ago
tensor-tang 4d11c8e9c6 retest single thread
7 years ago
Xin Pan 75ea577fd3 allow profiler and timeline to work when dev_ctx is nullptr.
7 years ago
Tao Luo ccf61b3066
Merge pull request #10703 from mozga-intel/mozga-intel/Mul_mkldnn_withdraw
7 years ago
Luo Tao aa4f685b66 fix compiler error when do not have TensorRT library
7 years ago
Yan Chunwei 211e131525
feature/tensorrt engine op (#11001)
7 years ago
Xin Pan 35d5b1b9e2
Merge pull request #11036 from panyx0718/dist_timeline
7 years ago
Xin Pan f14e579cc3 clean up
7 years ago
fengjiayi 32c0e82ca0 fix two bugs
7 years ago
Yu Yang 14905516aa
Merge pull request #10970 from JiayiFeng/dev_add_random_crop_op
7 years ago
Xin Pan cb01c59481 add back reduce_op
7 years ago
yuyang18 d2c1fac15d Merge branch 'dev_add_random_crop_op' of https://github.com/JiayiFeng/Paddle into dev_add_random_crop_op
7 years ago
yuyang18 a6c11a5d95 Fix bug in CUDA
7 years ago
fengjiayi 3165ffa42f Merge branch 'dev_add_random_crop_op' of https://github.com/JiayiFeng/Paddle into dev_add_random_crop_op
7 years ago
fengjiayi 3bce3dbce1 fix a bug
7 years ago
Xin Pan 3cb6395688 better profiler and benchmark
7 years ago
mozga-intel 30d32035b2 Withdraw mkldnn mul
7 years ago
Qiao Longfei 654f5d3c91
Merge pull request #11012 from jacquesqiao/add-auto_grown_mutex
7 years ago
sneaxiy 46f13237dc Fix bugs in framework/tensor_impl.h and polish framework/reader.cc
7 years ago
fengjiayi c95cd4742f
Merge pull request #10975 from JiayiFeng/fix_bug_in_uint8_support
7 years ago
yuyang18 45530c772e Fix GPU compile
7 years ago
fengjiayi 7c42e5de1a Polish RandomCropOp
7 years ago
qingqing01 3a29821bd5
Develop a fake dequantized op for fixed-point quantization training framework. (#10965)
7 years ago
Tao Luo 66ec827a92
Merge pull request #10941 from mozga-intel/mozga-intel/Top-K
7 years ago
tensor-tang 77599415ba enable read dataset
7 years ago
fengjiayi 56419caa1f Add .cu
7 years ago
qiaolongfei fa2079b71a add mutex header
7 years ago
qiaolongfei 167312484b use unique ptr to hold auto_grown_mutex_ for default copy and move constructor
7 years ago
qiaolongfei add9ed3388 change order of member
7 years ago
Qiyang Min 580340eeb2 Shutdown pserver gracefully when SIGINT and SIGTERM was sent (#10984)
7 years ago
mozga-intel 278f55bcef Support for mklml version of code
7 years ago
mozga-intel e7f0696513 Top K with omp
7 years ago
Qiao Longfei ab953baec6
Merge pull request #10973 from jacquesqiao/fix-prefetch
7 years ago
qiaolongfei e3c041d319 add auto_grown_mutex for selected rows
7 years ago
chengduo 8c54f1fb9c
Merge pull request #10906 from chengduoZH/fix_data_trans
7 years ago
chengduo 7d1332f696
Merge pull request #11006 from chengduoZH/fix_add_check_nan_inf_in_operator
7 years ago
tensor-tang c00843f4e8 enable multi-threads
7 years ago
qiaolongfei 74701b2692 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-prefetch
7 years ago
chengduoZH cb1c657ce4 move check_nan_inf to operator
7 years ago
chengduoZH 17a076d8ed replace TensorCopy with TensorCopySync
7 years ago
Yancey d92a75bee4
Merge pull request #10550 from Yancey1989/overlap_send_op
7 years ago
fengjiayi 4785c00e1e Add support in cast op kernel
7 years ago
Yan Chunwei f5fc9c3bc1
feature/mul converter (#10841)
7 years ago
qiaolongfei 85d0301a10 fix typo
7 years ago
Yancey1989 5d7c58e469 fix code style
7 years ago
Yancey1989 8b630ae1b5 fix unit test
7 years ago
Yancey1989 6b91d407de revert variable mutex
7 years ago
Yancey1989 60d827a8b9 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
Yancey1989 20c24c05aa singleton rpc_client
7 years ago
qiaolongfei 5825196db9 fix sgd for SelectedRows bug
7 years ago
qiaolongfei c797adede7 append table grad to grad_to_block_id
7 years ago
fengjiayi 9f81f1ce57 Add uint8 support in pybind.cc
7 years ago
fengjiayi 632108dc2e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_add_random_crop_op
7 years ago
fengjiayi 20c8ff0f5f Add comments
7 years ago
Yancey1989 28596a3386 add gflag ssa_graph_path
7 years ago
qiaolongfei 25f47fc0af fix prefetch bugs, optimize code
7 years ago
fengjiayi d44dbc4a52 fix errors
7 years ago
Yancey1989 ad6c0142c4 clean up codes
7 years ago
fengjiayi 3e7ce5836f stash
7 years ago
qingqing01 3ba75d4a69
Check label range in cross entropy calculation. (#10954)
7 years ago
qingqing01 91bd5835df
Fix fill_constant_batch_size_like_op when input is LoDTensor. (#10943)
7 years ago
Yancey1989 268e9dc1c6 polish code
7 years ago
Xin Pan 2f0df56422 add inference interface impl
7 years ago
Yancey1989 ceefbf3259 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
whs 376c948e88 Polygon box transform op for OCR East detection. (#10802)
7 years ago
chengduoZH fa613206e4 update
7 years ago
tensor-tang 400f5e7c3c add threads test
7 years ago
Yancey1989 b348e159f2 fix rename var
7 years ago
Yancey c770d5c0ba
fix build error with testing and gpu on (#10932)
7 years ago
fengjiayi cd8700f172
Merge pull request #10872 from JiayiFeng/dev_CustomReader
7 years ago
tensor-tang ce20dfa236 enable more choices
7 years ago
tensor-tang 602e28bf1c use the actual data
7 years ago
Yancey1989 0aa6f9e934 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
Yu Yang d736fb8047 Disable unstable test (#10920)
7 years ago
tensor-tang 1b8b253ec1 Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang 98fb8e58fd test infer nlp
7 years ago
fengjiayi 8147063539 follow comments
7 years ago
yuyang18 85a41df32d Init commit
7 years ago
Tomasz Patejko 3e876b3e49 MKL optimized elementwise add: fix style check
7 years ago
Tomasz Patejko 9241011b31 MKL elementwise add backward: backward works for integral types with fall back to default impl
7 years ago
Tomasz Patejko fde47aae61 MKL elementwise add backward: grad inputs copied when they are not null
7 years ago
Tomasz Patejko 996d12f16d MKL optimized elementwise add backward: coding style fixes
7 years ago
Tomasz Patejko 5a622c2906 MKL elementwise add backward: Initial implementation with vector copy
7 years ago
Tomasz Patejko 01fb2be908 MKL elementwise add: default implementation used for integral types, float16 and/or GPU
7 years ago
Tomasz Patejko 6f932482f4 MKL elementwise_add: BLAS version compiles with integral types
7 years ago
Tomasz Patejko e43c8f33cd MKL elementwise add: elementwise_add uses vAdd VML function when MKL is used
7 years ago
fengjiayi 0457f06434 add comments
7 years ago
fengjiayi e4e9d3624f fix a potential bug
7 years ago
fengjiayi 239546a6c1 add unit test and fix a bug
7 years ago
chengduoZH 4bfadcd136 fix data trans
7 years ago
Yancey1989 fc06222ae9 fix async worker
7 years ago
Yan Chunwei b1d446856c
fix inference api (#10867)
7 years ago
Guo Sheng 6d2deedf1d
Merge pull request #10814 from guoshengCS/fix-ElementwiseOpInferVarType
7 years ago
Yu Yang d406c76a30
Merge pull request #10744 from reyoung/feature/refine_parallel_executor
7 years ago
Yu Yang 8653cf3004
Merge pull request #10656 from reyoung/feature/support_op_role
7 years ago
Xin Pan 0d598cf9f6
Merge pull request #10822 from panyx0718/dist_opt
7 years ago
tangwei12 397a69d942
Merge pull request #10532 from seiriosPlus/checkpoint
7 years ago
Yibing Liu 20bdc3e109
Merge pull request #10846 from kuke/deconv_group
7 years ago
fengjiayi 530556dd97
Merge pull request #10864 from JiayiFeng/dev_expose_random_gen
7 years ago
fengjiayi 2e42b31f45 refine code
7 years ago
Xin Pan 2643868c66 follow comments
7 years ago
Yibing Liu 4bafbf411c Enable groups for conv3d transpose op
7 years ago
Xin Pan 08e4970e45 follow comments
7 years ago
fengjiayi 4b395b0809 fix errors
7 years ago
yuyang18 fcbf19bf93 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_parallel_executor
7 years ago
Yancey1989 540b45350d use req_count as atomic type
7 years ago
yuyang18 ad6a8715ff Disable unstable unittests
7 years ago
Wu Yi c8919d8ee2
Merge pull request #10821 from typhoonzero/use_pinned_memory
7 years ago
Yan Chunwei 1153144fbb
Inference analysis/init data flow graph analysis (#10776)
7 years ago
fengjiayi c2436f2ca5 expose random_data_generator
7 years ago
Yu Yang a9f9fbadd9
Merge pull request #10832 from typhoonzero/fix_server_port_file_conflict
7 years ago
yuyang18 9ab12df62d Disable no good tests
7 years ago
Yancey1989 6debbcd9f9 connect fetch barrier and concat op
7 years ago
yuyang18 3b04f0099c Fix bug
7 years ago
whs 8655904be2
Enhance reduce op (#10708)
7 years ago
fengjiayi df8fbf80d9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_CustomReader
7 years ago
fengjiayi b48eba1902 complete python API and unit test
7 years ago
Yibing Liu adbf97b42d Merge branch 'develop' of upstream into deconv_group
7 years ago
yuyang18 39a44e3834 Add Debug log
7 years ago
Yibing Liu 6e13c86dff Enable multiple groups for cudnn conv transpose
7 years ago
yuyang18 fb370f4411 Refine code
7 years ago
yuyang18 53a250e45e Disable hanged tests
7 years ago
Yibing Liu 669c0df69e Add groups for conv transpose
7 years ago
yuyang18 b857ff1b45 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_parallel_executor
7 years ago
yuyang18 03e4da6d04 Fix bug
7 years ago
Xin Pan a848303e10 follow comments
7 years ago
Xin Pan 722c078b15 fix test and clean up
7 years ago
yuyang18 27e4ce7287 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/support_op_role
7 years ago
Yu Yang 62559ace44
Merge pull request #10834 from reyoung/feature/sequnce_run_tests
7 years ago
yuyang18 982ac832a7 Disable GRPC server test
7 years ago
Yancey1989 147d54ba62 update
7 years ago
Tao Luo 565606a42d
Merge pull request #10827 from luotao1/paddle_script
7 years ago
Yancey1989 e9abc66910 fix pe
7 years ago
yuyang18 b920d2c281 enable serial tests
7 years ago
yuyang18 1b69c25c92 Merge branch 'feature/sequnce_run_tests' into feature/refine_parallel_executor
7 years ago
yuyang18 91007fe974 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_parallel_executor
7 years ago
yuyang18 1426d794ff Force some unittests serial
7 years ago
yuyang18 c978259027 Add default value of op_role
7 years ago
Xin Pan 11fe3c796b clean up
7 years ago
yi.wu 5fe8de9d8a fix dist test
7 years ago
Luo Tao 91d8373b18 change CMAKE_INSTALL_PREFIX to FLUID_INSTALL_DIR
7 years ago
fengjiayi 718f63c0d2
Merge pull request #10548 from JiayiFeng/fix_warpctc_op
7 years ago
tensor-tang 406c1dd143
Merge pull request #10701 from tensor-tang/usemkldnn
7 years ago
yuyang18 0e5a278c23 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/support_op_role
7 years ago
guosheng 01fdf17e97 Fix ElementwiseOpInferVarType in elementwise_op to use the default InferVarType to find var recursively
7 years ago
Xin Pan b4dd4c048d multi-thread handlerequest
7 years ago
yi.wu 8a49a88793 send use pinned memory
7 years ago
fengjiayi 0d10514d4b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_warpctc_op
7 years ago
fengjiayi 983c9a2a30 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_CustomReader
7 years ago
tensor-tang 7205d331f9
Merge pull request #10597 from kbinias/mkldnn-activations-improvments
7 years ago
qingqing01 2a77fc507b
Enhance detection_map_op and more check in prior_box API. (#10796)
7 years ago
Qiao Longfei be26b71b9e
Add cpp trainer lib and demo (#10681)
7 years ago
tangwei12 06aa23b060 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into checkpoint
7 years ago
tangwei12 be05056524 delete old checkpoint code
7 years ago
Yancey1989 952fa04009 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
Yancey1989 62af10d440 support multiple devices
7 years ago
Krzysztof Binias 24904b914c Unique key for input data
7 years ago
Krzysztof Binias 32929cdff0 Cache input data
7 years ago
Krzysztof Binias 0aa01929c1 Add backward
7 years ago
Krzysztof Binias 0cc25a400b Realloc for forward
7 years ago
Krzysztof Binias a76d0dd488 MKL-DNN activations improvements
7 years ago
Krzysztof Binias 1c81301e6d Update activations for MKL-DNN
7 years ago
guosheng f176a9cfa1 Remove ElementwiseOpInferVarType in elementwise_op to use the default InferVarType to find var recursively
7 years ago
Qingsheng Li 9ff6715f8a
Enhanced is_empty_op for our seq2seq model (#10704)
7 years ago
dzhwinter 5828101c23
make uint8 support in data_type transform and memory optimize (#10715)
7 years ago
Tao Luo 85b6bb5886
Merge pull request #10747 from jczaja/prv-mkldnn-pooling-reuse
7 years ago
Xin Pan e526dd58ec
Merge pull request #10775 from panyx0718/fix_comile
7 years ago
Yancey 7ae03ec08b
fix lack split_byref op in parallel executor (#10770)
7 years ago