Commit Graph

2131 Commits (1296d96e2e1d143bf002732b4bb138d93a1187cd)

Author SHA1 Message Date
sneaxiy 568c4e5ec4 recommit using account sneaxiy
7 years ago
Yan Chunwei 145aaa4b49
loose threshold of TRT for CI in different model (#11305)
7 years ago
fengjiayi d745840a6e fix a small compile error on Mac
7 years ago
yi.wu 5be454bf33 polish docs
7 years ago
yuyang18 0d29e65924 Add resize_bilinear
7 years ago
yuyang18 b000e0de5d Simplize API Reference Documentation
7 years ago
chengduoZH 0c851cab22 add SSA graph checker
7 years ago
fengjiayi b587a7f66e
Merge pull request #11293 from JiayiFeng/update_crop_op
7 years ago
Xin Pan 259e63d4a1
Merge pull request #11248 from panyx0718/dist
7 years ago
Xin Pan 2d7c836d32
Merge pull request #11295 from panyx0718/doc
7 years ago
Yu Yang 8deff48db0
Merge pull request #11081 from reyoung/feature/python_doc
7 years ago
fengjiayi c7bbfb33ad Fix a GPU bug
7 years ago
Yancey1989 1239fce771 polish sparse update code
7 years ago
chengduoZH 1076e85135 refine logic
7 years ago
Yancey 0aa9546eed
fix dist train error (#11281)
7 years ago
Xin Pan e80c6b3c24 Refine API doc string
7 years ago
tensor-tang b8d315fb69 make scope thread safe
7 years ago
fengjiayi 24649a780d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update_crop_op
7 years ago
dzhwinter d48172f22a
split reduce op into multiple libraries, accelerate the compiling (#11029)
7 years ago
fengjiayi 5803115720
Merge pull request #11288 from JiayiFeng/fix_bug_of_ExecutionContext
7 years ago
tensor-tang bfd42683ca Merge remote-tracking branch 'ups/develop' into refine
7 years ago
fengjiayi 9c61409a18 Make crop op supporting taking offsets as one of its inputs
7 years ago
dzhwinter f7c96f079b
Big data op_test benchmark, for checking output consistent in different runs. (#10646)
7 years ago
fengjiayi 9ce0885067 Merge branch 'fix_bug_of_ExecutionContext' into update_crop_op
7 years ago
fengjiayi dc8e0b494d fix bugs in the implementation of 'HasInput' and 'HasOutput'
7 years ago
fengjiayi 4f46a98fa9 stash
7 years ago
mozga-intel 3ff9ba0e6b Mkldnn layout (#11040)
7 years ago
chengduoZH 8291b916d6 replace graph_builder_factory with ssa_graph_builder_factory
7 years ago
chengduoZH 9ac785be39 check graph's validation
7 years ago
fengjiayi a1e046bfc0
Merge pull request #11270 from JiayiFeng/fix_a_error_on_max
7 years ago
Yu Yang 03073df182
Merge pull request #11237 from chengduoZH/add_fuse_var_op_handle
7 years ago
Tao Luo 6d80dd5a50
Merge pull request #11222 from luotao1/trt
7 years ago
fengjiayi 499dbe0536 fix a multi-thread bug in readers
7 years ago
fengjiayi 7344210070 Merge branch 'fix_a_error_on_max' into fix_reader_bug
7 years ago
fengjiayi 2f5e310167 fix a compile error
7 years ago
yuyang18 8149b0a9aa Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_fuse_var_op_handle
7 years ago
fengjiayi 5d45793936 hot fix
7 years ago
yuyang18 59d75bda74 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/python_doc
7 years ago
Luo Tao f6fb51a164 add test_mode in trt/activation_op
7 years ago
tensor-tang 0693b4148d Merge remote-tracking branch 'ups/develop' into refine
7 years ago
fengjiayi 65a94be1a1
Merge pull request #11223 from JiayiFeng/dev_reverse_op
7 years ago
Luo Tao c73977af03 Merge branch 'develop' into trt
7 years ago
Yu Yang ff9b1a0f95
Merge pull request #11234 from reyoung/feature/refine_code
7 years ago
Yu Yang 08823146ec
Merge pull request #11232 from reyoung/feature/extract_tensor
7 years ago
tensor-tang f326b0117e refine scope lock
7 years ago
tensor-tang 9cf1f351d2 refine nlp test
7 years ago
gongweibao 2028a8ef6d
Add rpc_client interface. (#11154)
7 years ago
Xin Pan ca2d6d3c66
Merge pull request #11224 from dzhwinter/fix/cudnn
7 years ago
tensor-tang 3a294042c8
Merge pull request #11233 from tensor-tang/multithreads
7 years ago
Yan Chunwei 4f95bc9463
feature/trt engine op test (#11182)
7 years ago
qingqing01 e0a32074bd
Fix PADDLE_ASSERT. (#10981)
7 years ago
Xin Pan e0895e49dc remove some seems unused codes.
7 years ago
tensor-tang 4b7b17a84f fix conflcts
7 years ago
yuyang18 d9af153232 SSA Graph Builder Factory
7 years ago
yuyang18 b6c8701e45 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/extract_tensor
7 years ago
chengduoZH a584bc86dd add fuse var op handle
7 years ago
Xin Pan 106ee9d1cc
Merge pull request #11243 from panyx0718/scope
7 years ago
tensor-tang 64323b1caf Merge remote-tracking branch 'ups/develop' into multithreads
7 years ago
Luo Tao e8e8ad0491 Merge branch 'develop' into trt
7 years ago
dzhwinter 44c662b4e1 Merge remote-tracking branch 'origin/develop' into fix/cudnn
7 years ago
fengjiayi ea73fb8416 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_reverse_op
7 years ago
Xin Pan 73aa5d230b small clean up and document pointer ownership.
7 years ago
tensor-tang 4ae935e2cf refine the lock in scope
7 years ago
yuyang18 df681fd4e0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/python_doc
7 years ago
Yu Yang c36dd3b338
Merge pull request #11114 from reyoung/feature/yep
7 years ago
tensor-tang 944bdee738 Merge remote-tracking branch 'ups/develop' into multithreads
7 years ago
tensor-tang 9b34f8dabd fix abort issue in cpu multi-threads
7 years ago
tensor-tang 6840953305 refine nlp multi-threads
7 years ago
yuyang18 fc9f2d282f Extract method from tensor_impl.h to tensor.cc
7 years ago
dzhwinter 2b9ef7e249 "fix"
7 years ago
dzhwinter 75d8e8ca33 "fix compiled in manylinux"
7 years ago
fengjiayi 12d17941a1 fix a bug
7 years ago
fengjiayi 41ced8e2af Refine code
7 years ago
dzhwinter 4777aec9be "done"
7 years ago
fengjiayi aa9383f3b0 Add unit tests
7 years ago
Luo Tao e116129f03 rewrite unittest of trt_activation_op
7 years ago
fengjiayi e2bb4d072d complete C++ part
7 years ago
Yan Chunwei df87e63baa
add dfg graphviz pass (#11211)
7 years ago
dzhwinter 7971d4a310
Feature/deterministic (#11205)
7 years ago
Yuan Gao 666c94e3be Add default prior box var for box_coder_op (#11164)
7 years ago
Wu Yi 036a90f125 Refine rpc client wait sync (#11132)
7 years ago
whs d74838bda0
Prune dims supported by reduce op. (#11113)
7 years ago
Qiao Longfei 238124909e
fix protobuf memory leak (#11177)
7 years ago
sneaxiy 8164a752ba
Merge pull request #11176 from sneaxiy/zjl-dev
7 years ago
Yu Yang 78afcbffc3
Merge pull request #11116 from reyoung/feature/faster_recordio
7 years ago
sneaxiy 02cc80b30d Fix dangling pointer bug
7 years ago
Siddharth Goyal 71b6bdb5d4
Fix signed-unsigned comparison warning (#11167)
7 years ago
Qiao Longfei c7002f5ba2
Merge pull request #11134 from jacquesqiao/fix-mac-build
7 years ago
qiaolongfei 1766406f76 delete unused code
7 years ago
fengjiayi 87a5590bb7
Merge pull request #11151 from JiayiFeng/dev_update_open_files_op
7 years ago
fengjiayi 3526ac1136 refine code
7 years ago
tensor-tang 6ac47a3d90 rename Mkldnn to MKLDNN
7 years ago
fengjiayi 744cc412b7 fix a bug
7 years ago
tensor-tang 07c48dbf7a
Merge pull request #11080 from tensor-tang/nlp
7 years ago
tensor-tang 6ae7cbe252 follow comments
7 years ago
fengjiayi ee4e567dea Creating readers before training begining
7 years ago
Yuan Gao d3e99aeec3 add normalize switch to box_coder_op (#11129)
7 years ago
qiaolongfei 906334a6a4 fix build error on mac
7 years ago
tensor-tang 99d00cce93 follow comment: refine where time started
7 years ago
tensor-tang 38f8182df6 work around with dummy test
7 years ago
tensor-tang eaeb76c419 add some comments
7 years ago
yuyang18 c3632b879b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/faster_recordio
7 years ago
yuyang18 53dab95b75 Static DSO handle
7 years ago
yuyang18 c5115950a8 Use static for dlsym
7 years ago
yuyang18 7cf8b656a2 Remove lock in device context
7 years ago
yuyang18 86a61c177f Add ScopeBufferedSSAGraphExecutor
7 years ago
tensor-tang 9c687a9789 Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang 7e9f0790e0 fix scope in thread
7 years ago
Yan Chunwei 9503dbb173
fix compile error (#11119)
7 years ago
Xin Pan 7eca286159
Merge pull request #11078 from panyx0718/improve_profiler
7 years ago
tensor-tang 3206bcd929 refine log and add QPS
7 years ago
tensor-tang 06adccf6eb Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang 4a24c238c1 refine code
7 years ago
Yan Chunwei 0c0c5df4cb
feature/add TRT fc converter (#11043)
7 years ago
whs 86d8659c8d
Add python wrapper for gather op. (#11033)
7 years ago
whs 28dc9ba3c1
Add shape op to get the shape of variable. (#11048)
7 years ago
yuyang18 15193c9e93 Faster RecordIO Scanner
7 years ago
tensor-tang a4822ed897 add thread setting
7 years ago
fengjiayi d6997e5bc8
Merge pull request #11083 from JiayiFeng/dev_refine_programdesc_copy
7 years ago
tensor-tang 5387562576 add multi-thread test
7 years ago
whs 85c203b117
Make bilinear_interp_op support attrs from input. (#11041)
7 years ago
chengduo 01bbe532da
Merge pull request #11079 from chengduoZH/balance_parameter_update
7 years ago
fengjiayi 31f0533c5d fix compile errors
7 years ago
fengjiayi 04ccbed5b8 fix a compile error
7 years ago
gongweibao 4fb7cc7f5e
Move sync_mode device ctx from grpc server (#10881)
7 years ago
fengjiayi a3aca2a3cf fix bugs
7 years ago
Tao Luo 47f3cae2af
Merge pull request #11038 from sneaxiy/zjl-dev
7 years ago
tensor-tang 733718c3e7 remove the ugly test
7 years ago
Yan Chunwei 97b7502772
inference API little fix (#11069)
7 years ago
fengjiayi 2a3c58d3fe refine programdesc copy
7 years ago
chengduoZH e330cd032e balance parameter update
7 years ago
yuyang18 f9d93bfde1 Add document to random crop operator
7 years ago
tensor-tang 708bec2e56 add test
7 years ago
tensor-tang d13dd3b6a7 revert profiling
7 years ago
tensor-tang 4d11c8e9c6 retest single thread
7 years ago
Xin Pan 75ea577fd3 allow profiler and timeline to work when dev_ctx is nullptr.
7 years ago
Tao Luo ccf61b3066
Merge pull request #10703 from mozga-intel/mozga-intel/Mul_mkldnn_withdraw
7 years ago
Luo Tao aa4f685b66 fix compiler error when do not have TensorRT library
7 years ago
Yan Chunwei 211e131525
feature/tensorrt engine op (#11001)
7 years ago
Xin Pan 35d5b1b9e2
Merge pull request #11036 from panyx0718/dist_timeline
7 years ago
Xin Pan f14e579cc3 clean up
7 years ago
fengjiayi 32c0e82ca0 fix two bugs
7 years ago
Yu Yang 14905516aa
Merge pull request #10970 from JiayiFeng/dev_add_random_crop_op
7 years ago
Xin Pan cb01c59481 add back reduce_op
7 years ago
yuyang18 d2c1fac15d Merge branch 'dev_add_random_crop_op' of https://github.com/JiayiFeng/Paddle into dev_add_random_crop_op
7 years ago
yuyang18 a6c11a5d95 Fix bug in CUDA
7 years ago
fengjiayi 3165ffa42f Merge branch 'dev_add_random_crop_op' of https://github.com/JiayiFeng/Paddle into dev_add_random_crop_op
7 years ago
fengjiayi 3bce3dbce1 fix a bug
7 years ago
Xin Pan 3cb6395688 better profiler and benchmark
7 years ago
mozga-intel 30d32035b2 Withdraw mkldnn mul
7 years ago
Qiao Longfei 654f5d3c91
Merge pull request #11012 from jacquesqiao/add-auto_grown_mutex
7 years ago
sneaxiy 46f13237dc Fix bugs in framework/tensor_impl.h and polish framework/reader.cc
7 years ago
fengjiayi c95cd4742f
Merge pull request #10975 from JiayiFeng/fix_bug_in_uint8_support
7 years ago
yuyang18 45530c772e Fix GPU compile
7 years ago
fengjiayi 7c42e5de1a Polish RandomCropOp
7 years ago
qingqing01 3a29821bd5
Develop a fake dequantized op for fixed-point quantization training framework. (#10965)
7 years ago
Tao Luo 66ec827a92
Merge pull request #10941 from mozga-intel/mozga-intel/Top-K
7 years ago
tensor-tang 77599415ba enable read dataset
7 years ago
fengjiayi 56419caa1f Add .cu
7 years ago
qiaolongfei fa2079b71a add mutex header
7 years ago
qiaolongfei 167312484b use unique ptr to hold auto_grown_mutex_ for default copy and move constructor
7 years ago
qiaolongfei add9ed3388 change order of member
7 years ago
Qiyang Min 580340eeb2 Shutdown pserver gracefully when SIGINT and SIGTERM was sent (#10984)
7 years ago
mozga-intel 278f55bcef Support for mklml version of code
7 years ago
mozga-intel e7f0696513 Top K with omp
7 years ago
Qiao Longfei ab953baec6
Merge pull request #10973 from jacquesqiao/fix-prefetch
7 years ago
qiaolongfei e3c041d319 add auto_grown_mutex for selected rows
7 years ago
chengduo 8c54f1fb9c
Merge pull request #10906 from chengduoZH/fix_data_trans
7 years ago
chengduo 7d1332f696
Merge pull request #11006 from chengduoZH/fix_add_check_nan_inf_in_operator
7 years ago
tensor-tang c00843f4e8 enable multi-threads
7 years ago
qiaolongfei 74701b2692 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-prefetch
7 years ago
chengduoZH cb1c657ce4 move check_nan_inf to operator
7 years ago
chengduoZH 17a076d8ed replace TensorCopy with TensorCopySync
7 years ago
Yancey d92a75bee4
Merge pull request #10550 from Yancey1989/overlap_send_op
7 years ago
fengjiayi 4785c00e1e Add support in cast op kernel
7 years ago
Yan Chunwei f5fc9c3bc1
feature/mul converter (#10841)
7 years ago
qiaolongfei 85d0301a10 fix typo
7 years ago
Yancey1989 5d7c58e469 fix code style
7 years ago
Yancey1989 8b630ae1b5 fix unit test
7 years ago
Yancey1989 6b91d407de revert variable mutex
7 years ago
Yancey1989 60d827a8b9 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
Yancey1989 20c24c05aa singleton rpc_client
7 years ago
qiaolongfei 5825196db9 fix sgd for SelectedRows bug
7 years ago
qiaolongfei c797adede7 append table grad to grad_to_block_id
7 years ago
fengjiayi 9f81f1ce57 Add uint8 support in pybind.cc
7 years ago
fengjiayi 632108dc2e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_add_random_crop_op
7 years ago
fengjiayi 20c8ff0f5f Add comments
7 years ago
Yancey1989 28596a3386 add gflag ssa_graph_path
7 years ago
qiaolongfei 25f47fc0af fix prefetch bugs, optimize code
7 years ago
fengjiayi d44dbc4a52 fix errors
7 years ago
Yancey1989 ad6c0142c4 clean up codes
7 years ago
fengjiayi 3e7ce5836f stash
7 years ago
qingqing01 3ba75d4a69
Check label range in cross entropy calculation. (#10954)
7 years ago
qingqing01 91bd5835df
Fix fill_constant_batch_size_like_op when input is LoDTensor. (#10943)
7 years ago
Yancey1989 268e9dc1c6 polish code
7 years ago
Xin Pan 2f0df56422 add inference interface impl
7 years ago
Yancey1989 ceefbf3259 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
whs 376c948e88 Polygon box transform op for OCR East detection. (#10802)
7 years ago
chengduoZH fa613206e4 update
7 years ago
tensor-tang 400f5e7c3c add threads test
7 years ago
Yancey1989 b348e159f2 fix rename var
7 years ago
Yancey c770d5c0ba
fix build error with testing and gpu on (#10932)
7 years ago
fengjiayi cd8700f172
Merge pull request #10872 from JiayiFeng/dev_CustomReader
7 years ago
tensor-tang ce20dfa236 enable more choices
7 years ago
tensor-tang 602e28bf1c use the actual data
7 years ago
Yancey1989 0aa6f9e934 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
Yu Yang d736fb8047 Disable unstable test (#10920)
7 years ago
tensor-tang 1b8b253ec1 Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang 98fb8e58fd test infer nlp
7 years ago
fengjiayi 8147063539 follow comments
7 years ago
yuyang18 85a41df32d Init commit
7 years ago
Tomasz Patejko 3e876b3e49 MKL optimized elementwise add: fix style check
7 years ago
Tomasz Patejko 9241011b31 MKL elementwise add backward: backward works for integral types with fall back to default impl
7 years ago
Tomasz Patejko fde47aae61 MKL elementwise add backward: grad inputs copied when they are not null
7 years ago
Tomasz Patejko 996d12f16d MKL optimized elementwise add backward: coding style fixes
7 years ago
Tomasz Patejko 5a622c2906 MKL elementwise add backward: Initial implementation with vector copy
7 years ago
Tomasz Patejko 01fb2be908 MKL elementwise add: default implementation used for integral types, float16 and/or GPU
7 years ago
Tomasz Patejko 6f932482f4 MKL elementwise_add: BLAS version compiles with integral types
7 years ago
Tomasz Patejko e43c8f33cd MKL elementwise add: elementwise_add uses vAdd VML function when MKL is used
7 years ago
fengjiayi 0457f06434 add comments
7 years ago
fengjiayi e4e9d3624f fix a potential bug
7 years ago
fengjiayi 239546a6c1 add unit test and fix a bug
7 years ago
chengduoZH 4bfadcd136 fix data trans
7 years ago
Yancey1989 fc06222ae9 fix async worker
7 years ago
Yan Chunwei b1d446856c
fix inference api (#10867)
7 years ago
Guo Sheng 6d2deedf1d
Merge pull request #10814 from guoshengCS/fix-ElementwiseOpInferVarType
7 years ago
Yu Yang d406c76a30
Merge pull request #10744 from reyoung/feature/refine_parallel_executor
7 years ago
Yu Yang 8653cf3004
Merge pull request #10656 from reyoung/feature/support_op_role
7 years ago
Xin Pan 0d598cf9f6
Merge pull request #10822 from panyx0718/dist_opt
7 years ago
tangwei12 397a69d942
Merge pull request #10532 from seiriosPlus/checkpoint
7 years ago
Yibing Liu 20bdc3e109
Merge pull request #10846 from kuke/deconv_group
7 years ago
fengjiayi 530556dd97
Merge pull request #10864 from JiayiFeng/dev_expose_random_gen
7 years ago
fengjiayi 2e42b31f45 refine code
7 years ago
Xin Pan 2643868c66 follow comments
7 years ago
Yibing Liu 4bafbf411c Enable groups for conv3d transpose op
7 years ago
Xin Pan 08e4970e45 follow comments
7 years ago
fengjiayi 4b395b0809 fix errors
7 years ago
yuyang18 fcbf19bf93 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_parallel_executor
7 years ago
Yancey1989 540b45350d use req_count as atomic type
7 years ago
yuyang18 ad6a8715ff Disable unstable unittests
7 years ago
Wu Yi c8919d8ee2
Merge pull request #10821 from typhoonzero/use_pinned_memory
7 years ago
Yan Chunwei 1153144fbb
Inference analysis/init data flow graph analysis (#10776)
7 years ago
fengjiayi c2436f2ca5 expose random_data_generator
7 years ago
Yu Yang a9f9fbadd9
Merge pull request #10832 from typhoonzero/fix_server_port_file_conflict
7 years ago
yuyang18 9ab12df62d Disable no good tests
7 years ago
Yancey1989 6debbcd9f9 connect fetch barrier and concat op
7 years ago
yuyang18 3b04f0099c Fix bug
7 years ago
whs 8655904be2
Enhance reduce op (#10708)
7 years ago
fengjiayi df8fbf80d9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_CustomReader
7 years ago
fengjiayi b48eba1902 complete python API and unit test
7 years ago