Commit Graph

2022 Commits (f2c9c33f158890e0cfba827db3a328c317962d4c)

Author SHA1 Message Date
yuyang18 0d29e65924 Add resize_bilinear
7 years ago
yuyang18 b000e0de5d Simplize API Reference Documentation
7 years ago
chengduoZH 0c851cab22 add SSA graph checker
7 years ago
fengjiayi b587a7f66e
Merge pull request #11293 from JiayiFeng/update_crop_op
7 years ago
Xin Pan 259e63d4a1
Merge pull request #11248 from panyx0718/dist
7 years ago
Xin Pan 2d7c836d32
Merge pull request #11295 from panyx0718/doc
7 years ago
Yu Yang 8deff48db0
Merge pull request #11081 from reyoung/feature/python_doc
7 years ago
fengjiayi c7bbfb33ad Fix a GPU bug
7 years ago
Yancey1989 1239fce771 polish sparse update code
7 years ago
chengduoZH 1076e85135 refine logic
7 years ago
Yancey 0aa9546eed
fix dist train error (#11281)
7 years ago
Xin Pan e80c6b3c24 Refine API doc string
7 years ago
fengjiayi 24649a780d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update_crop_op
7 years ago
dzhwinter d48172f22a
split reduce op into multiple libraries, accelerate the compiling (#11029)
7 years ago
fengjiayi 5803115720
Merge pull request #11288 from JiayiFeng/fix_bug_of_ExecutionContext
7 years ago
fengjiayi 9c61409a18 Make crop op supporting taking offsets as one of its inputs
7 years ago
dzhwinter f7c96f079b
Big data op_test benchmark, for checking output consistent in different runs. (#10646)
7 years ago
fengjiayi 9ce0885067 Merge branch 'fix_bug_of_ExecutionContext' into update_crop_op
7 years ago
fengjiayi dc8e0b494d fix bugs in the implementation of 'HasInput' and 'HasOutput'
7 years ago
fengjiayi 4f46a98fa9 stash
7 years ago
mozga-intel 3ff9ba0e6b Mkldnn layout (#11040)
7 years ago
chengduoZH 8291b916d6 replace graph_builder_factory with ssa_graph_builder_factory
7 years ago
chengduoZH 9ac785be39 check graph's validation
7 years ago
fengjiayi a1e046bfc0
Merge pull request #11270 from JiayiFeng/fix_a_error_on_max
7 years ago
Yu Yang 03073df182
Merge pull request #11237 from chengduoZH/add_fuse_var_op_handle
7 years ago
Tao Luo 6d80dd5a50
Merge pull request #11222 from luotao1/trt
7 years ago
fengjiayi 499dbe0536 fix a multi-thread bug in readers
7 years ago
fengjiayi 7344210070 Merge branch 'fix_a_error_on_max' into fix_reader_bug
7 years ago
fengjiayi 2f5e310167 fix a compile error
7 years ago
yuyang18 8149b0a9aa Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_fuse_var_op_handle
7 years ago
fengjiayi 5d45793936 hot fix
7 years ago
yuyang18 59d75bda74 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/python_doc
7 years ago
Luo Tao f6fb51a164 add test_mode in trt/activation_op
7 years ago
fengjiayi 65a94be1a1
Merge pull request #11223 from JiayiFeng/dev_reverse_op
7 years ago
Luo Tao c73977af03 Merge branch 'develop' into trt
7 years ago
Yu Yang ff9b1a0f95
Merge pull request #11234 from reyoung/feature/refine_code
7 years ago
Yu Yang 08823146ec
Merge pull request #11232 from reyoung/feature/extract_tensor
7 years ago
gongweibao 2028a8ef6d
Add rpc_client interface. (#11154)
7 years ago
Xin Pan ca2d6d3c66
Merge pull request #11224 from dzhwinter/fix/cudnn
7 years ago
tensor-tang 3a294042c8
Merge pull request #11233 from tensor-tang/multithreads
7 years ago
Yan Chunwei 4f95bc9463
feature/trt engine op test (#11182)
7 years ago
qingqing01 e0a32074bd
Fix PADDLE_ASSERT. (#10981)
7 years ago
Xin Pan e0895e49dc remove some seems unused codes.
7 years ago
tensor-tang 4b7b17a84f fix conflcts
7 years ago
yuyang18 d9af153232 SSA Graph Builder Factory
7 years ago
yuyang18 b6c8701e45 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/extract_tensor
7 years ago
chengduoZH a584bc86dd add fuse var op handle
7 years ago
Xin Pan 106ee9d1cc
Merge pull request #11243 from panyx0718/scope
7 years ago
tensor-tang 64323b1caf Merge remote-tracking branch 'ups/develop' into multithreads
7 years ago
Luo Tao e8e8ad0491 Merge branch 'develop' into trt
7 years ago
dzhwinter 44c662b4e1 Merge remote-tracking branch 'origin/develop' into fix/cudnn
7 years ago
fengjiayi ea73fb8416 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_reverse_op
7 years ago
Xin Pan 73aa5d230b small clean up and document pointer ownership.
7 years ago
tensor-tang 4ae935e2cf refine the lock in scope
7 years ago
yuyang18 df681fd4e0 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/python_doc
7 years ago
Yu Yang c36dd3b338
Merge pull request #11114 from reyoung/feature/yep
7 years ago
tensor-tang 944bdee738 Merge remote-tracking branch 'ups/develop' into multithreads
7 years ago
tensor-tang 9b34f8dabd fix abort issue in cpu multi-threads
7 years ago
tensor-tang 6840953305 refine nlp multi-threads
7 years ago
yuyang18 fc9f2d282f Extract method from tensor_impl.h to tensor.cc
7 years ago
dzhwinter 2b9ef7e249 "fix"
7 years ago
dzhwinter 75d8e8ca33 "fix compiled in manylinux"
7 years ago
fengjiayi 12d17941a1 fix a bug
7 years ago
fengjiayi 41ced8e2af Refine code
7 years ago
dzhwinter 4777aec9be "done"
7 years ago
fengjiayi aa9383f3b0 Add unit tests
7 years ago
Luo Tao e116129f03 rewrite unittest of trt_activation_op
7 years ago
fengjiayi e2bb4d072d complete C++ part
7 years ago
Yan Chunwei df87e63baa
add dfg graphviz pass (#11211)
7 years ago
dzhwinter 7971d4a310
Feature/deterministic (#11205)
7 years ago
Yuan Gao 666c94e3be Add default prior box var for box_coder_op (#11164)
7 years ago
Wu Yi 036a90f125 Refine rpc client wait sync (#11132)
7 years ago
whs d74838bda0
Prune dims supported by reduce op. (#11113)
7 years ago
Qiao Longfei 238124909e
fix protobuf memory leak (#11177)
7 years ago
sneaxiy 8164a752ba
Merge pull request #11176 from sneaxiy/zjl-dev
7 years ago
Yu Yang 78afcbffc3
Merge pull request #11116 from reyoung/feature/faster_recordio
7 years ago
sneaxiy 02cc80b30d Fix dangling pointer bug
7 years ago
Siddharth Goyal 71b6bdb5d4
Fix signed-unsigned comparison warning (#11167)
7 years ago
Qiao Longfei c7002f5ba2
Merge pull request #11134 from jacquesqiao/fix-mac-build
7 years ago
qiaolongfei 1766406f76 delete unused code
7 years ago
fengjiayi 87a5590bb7
Merge pull request #11151 from JiayiFeng/dev_update_open_files_op
7 years ago
fengjiayi 3526ac1136 refine code
7 years ago
tensor-tang 6ac47a3d90 rename Mkldnn to MKLDNN
7 years ago
fengjiayi 744cc412b7 fix a bug
7 years ago
tensor-tang 07c48dbf7a
Merge pull request #11080 from tensor-tang/nlp
7 years ago
tensor-tang 6ae7cbe252 follow comments
7 years ago
fengjiayi ee4e567dea Creating readers before training begining
7 years ago
Yuan Gao d3e99aeec3 add normalize switch to box_coder_op (#11129)
7 years ago
qiaolongfei 906334a6a4 fix build error on mac
7 years ago
tensor-tang 99d00cce93 follow comment: refine where time started
7 years ago
tensor-tang 38f8182df6 work around with dummy test
7 years ago
tensor-tang eaeb76c419 add some comments
7 years ago
yuyang18 c3632b879b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/faster_recordio
7 years ago
yuyang18 53dab95b75 Static DSO handle
7 years ago
yuyang18 c5115950a8 Use static for dlsym
7 years ago
yuyang18 7cf8b656a2 Remove lock in device context
7 years ago
yuyang18 86a61c177f Add ScopeBufferedSSAGraphExecutor
7 years ago
tensor-tang 9c687a9789 Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang 7e9f0790e0 fix scope in thread
7 years ago
Yan Chunwei 9503dbb173
fix compile error (#11119)
7 years ago
Xin Pan 7eca286159
Merge pull request #11078 from panyx0718/improve_profiler
7 years ago
tensor-tang 3206bcd929 refine log and add QPS
7 years ago
tensor-tang 06adccf6eb Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang 4a24c238c1 refine code
7 years ago
Yan Chunwei 0c0c5df4cb
feature/add TRT fc converter (#11043)
7 years ago
whs 86d8659c8d
Add python wrapper for gather op. (#11033)
7 years ago
whs 28dc9ba3c1
Add shape op to get the shape of variable. (#11048)
7 years ago
yuyang18 15193c9e93 Faster RecordIO Scanner
7 years ago
tensor-tang a4822ed897 add thread setting
7 years ago
fengjiayi d6997e5bc8
Merge pull request #11083 from JiayiFeng/dev_refine_programdesc_copy
7 years ago
tensor-tang 5387562576 add multi-thread test
7 years ago
whs 85c203b117
Make bilinear_interp_op support attrs from input. (#11041)
7 years ago
chengduo 01bbe532da
Merge pull request #11079 from chengduoZH/balance_parameter_update
7 years ago
fengjiayi 31f0533c5d fix compile errors
7 years ago
fengjiayi 04ccbed5b8 fix a compile error
7 years ago
gongweibao 4fb7cc7f5e
Move sync_mode device ctx from grpc server (#10881)
7 years ago
fengjiayi a3aca2a3cf fix bugs
7 years ago
Tao Luo 47f3cae2af
Merge pull request #11038 from sneaxiy/zjl-dev
7 years ago
tensor-tang 733718c3e7 remove the ugly test
7 years ago
Yan Chunwei 97b7502772
inference API little fix (#11069)
7 years ago
fengjiayi 2a3c58d3fe refine programdesc copy
7 years ago
chengduoZH e330cd032e balance parameter update
7 years ago
yuyang18 f9d93bfde1 Add document to random crop operator
7 years ago
tensor-tang 708bec2e56 add test
7 years ago
tensor-tang d13dd3b6a7 revert profiling
7 years ago
tensor-tang 4d11c8e9c6 retest single thread
7 years ago
Xin Pan 75ea577fd3 allow profiler and timeline to work when dev_ctx is nullptr.
7 years ago
Tao Luo ccf61b3066
Merge pull request #10703 from mozga-intel/mozga-intel/Mul_mkldnn_withdraw
7 years ago
Luo Tao aa4f685b66 fix compiler error when do not have TensorRT library
7 years ago
Yan Chunwei 211e131525
feature/tensorrt engine op (#11001)
7 years ago
Xin Pan 35d5b1b9e2
Merge pull request #11036 from panyx0718/dist_timeline
7 years ago
Xin Pan f14e579cc3 clean up
7 years ago
fengjiayi 32c0e82ca0 fix two bugs
7 years ago
Yu Yang 14905516aa
Merge pull request #10970 from JiayiFeng/dev_add_random_crop_op
7 years ago
Xin Pan cb01c59481 add back reduce_op
7 years ago
yuyang18 d2c1fac15d Merge branch 'dev_add_random_crop_op' of https://github.com/JiayiFeng/Paddle into dev_add_random_crop_op
7 years ago
yuyang18 a6c11a5d95 Fix bug in CUDA
7 years ago
fengjiayi 3165ffa42f Merge branch 'dev_add_random_crop_op' of https://github.com/JiayiFeng/Paddle into dev_add_random_crop_op
7 years ago
fengjiayi 3bce3dbce1 fix a bug
7 years ago
Xin Pan 3cb6395688 better profiler and benchmark
7 years ago
mozga-intel 30d32035b2 Withdraw mkldnn mul
7 years ago
Qiao Longfei 654f5d3c91
Merge pull request #11012 from jacquesqiao/add-auto_grown_mutex
7 years ago
sneaxiy 46f13237dc Fix bugs in framework/tensor_impl.h and polish framework/reader.cc
7 years ago
fengjiayi c95cd4742f
Merge pull request #10975 from JiayiFeng/fix_bug_in_uint8_support
7 years ago
yuyang18 45530c772e Fix GPU compile
7 years ago
fengjiayi 7c42e5de1a Polish RandomCropOp
7 years ago
qingqing01 3a29821bd5
Develop a fake dequantized op for fixed-point quantization training framework. (#10965)
7 years ago
Tao Luo 66ec827a92
Merge pull request #10941 from mozga-intel/mozga-intel/Top-K
7 years ago
tensor-tang 77599415ba enable read dataset
7 years ago
fengjiayi 56419caa1f Add .cu
7 years ago