Commit Graph

8871 Commits (ca4d528b35b105f1a33f968387aa677115286590)

Author SHA1 Message Date
fengjiayi 744cc412b7 fix a bug
7 years ago
tensor-tang 07c48dbf7a
Merge pull request #11080 from tensor-tang/nlp
7 years ago
tensor-tang 6ae7cbe252 follow comments
7 years ago
fengjiayi ee4e567dea Creating readers before training begining
7 years ago
Yuan Gao d3e99aeec3 add normalize switch to box_coder_op (#11129)
7 years ago
qiaolongfei 906334a6a4 fix build error on mac
7 years ago
tensor-tang 99d00cce93 follow comment: refine where time started
7 years ago
tensor-tang 38f8182df6 work around with dummy test
7 years ago
Yan Chunwei b5b6661857
feature/simple inference demo (#11105)
7 years ago
tensor-tang eaeb76c419 add some comments
7 years ago
yuyang18 c3632b879b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/faster_recordio
7 years ago
yuyang18 53dab95b75 Static DSO handle
7 years ago
yuyang18 c5115950a8 Use static for dlsym
7 years ago
yuyang18 7cf8b656a2 Remove lock in device context
7 years ago
yuyang18 86a61c177f Add ScopeBufferedSSAGraphExecutor
7 years ago
tensor-tang 9c687a9789 Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang 7e9f0790e0 fix scope in thread
7 years ago
Yan Chunwei 9503dbb173
fix compile error (#11119)
7 years ago
Xin Pan 7eca286159
Merge pull request #11078 from panyx0718/improve_profiler
7 years ago
tensor-tang 3206bcd929 refine log and add QPS
7 years ago
tensor-tang 06adccf6eb Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang 4a24c238c1 refine code
7 years ago
Yan Chunwei 0c0c5df4cb
feature/add TRT fc converter (#11043)
7 years ago
Yan Chunwei 18d640255e
simplify inference api (#11104)
7 years ago
whs 86d8659c8d
Add python wrapper for gather op. (#11033)
7 years ago
whs 28dc9ba3c1
Add shape op to get the shape of variable. (#11048)
7 years ago
yuyang18 15193c9e93 Faster RecordIO Scanner
7 years ago
tensor-tang a4822ed897 add thread setting
7 years ago
fengjiayi d6997e5bc8
Merge pull request #11083 from JiayiFeng/dev_refine_programdesc_copy
7 years ago
tensor-tang 5387562576 add multi-thread test
7 years ago
whs 85c203b117
Make bilinear_interp_op support attrs from input. (#11041)
7 years ago
chengduo 01bbe532da
Merge pull request #11079 from chengduoZH/balance_parameter_update
7 years ago
fengjiayi 31f0533c5d fix compile errors
7 years ago
Lei Wang 86efecb93c
Build: add dependencies for test_paddle_inference_api_impl. (#11064)
7 years ago
fengjiayi 04ccbed5b8 fix a compile error
7 years ago
gongweibao 4fb7cc7f5e
Move sync_mode device ctx from grpc server (#10881)
7 years ago
fengjiayi a3aca2a3cf fix bugs
7 years ago
Tao Luo 47f3cae2af
Merge pull request #11038 from sneaxiy/zjl-dev
7 years ago
tensor-tang 733718c3e7 remove the ugly test
7 years ago
Yan Chunwei 97b7502772
inference API little fix (#11069)
7 years ago
fengjiayi 2a3c58d3fe refine programdesc copy
7 years ago
chengduoZH e330cd032e balance parameter update
7 years ago
yuyang18 f9d93bfde1 Add document to random crop operator
7 years ago
tensor-tang 708bec2e56 add test
7 years ago
tensor-tang d13dd3b6a7 revert profiling
7 years ago
tensor-tang 4d11c8e9c6 retest single thread
7 years ago
Xin Pan 75ea577fd3 allow profiler and timeline to work when dev_ctx is nullptr.
7 years ago
Xin Pan 44ea234343 clean
7 years ago
Xin Pan 2719599e62 clean
7 years ago
Xin Pan 759cb23711 follow comments
7 years ago
Xin Pan 1555dc519a make infer test more stable.
7 years ago
dzhwinter 82e5738f53
"fix build script" (#11011)
7 years ago
Tao Luo ccf61b3066
Merge pull request #10703 from mozga-intel/mozga-intel/Mul_mkldnn_withdraw
7 years ago
Qiao Longfei 05f7d300f8
Merge pull request #11034 from jacquesqiao/fix-compile-by-std-move
7 years ago
Luo Tao aa4f685b66 fix compiler error when do not have TensorRT library
7 years ago
Yan Chunwei 211e131525
feature/tensorrt engine op (#11001)
7 years ago
Xin Pan 35d5b1b9e2
Merge pull request #11036 from panyx0718/dist_timeline
7 years ago
fengjiayi 32d50864ac
Merge pull request #11042 from JiayiFeng/fix_two_bugs
7 years ago
Xin Pan f14e579cc3 clean up
7 years ago
qiaolongfei 96be582ef3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-compile-by-std-move
7 years ago
qiaolongfei e3c4a58835 add -Wno-error=pessimizing-move only on macos
7 years ago
fengjiayi 32c0e82ca0 fix two bugs
7 years ago
Xin Pan 109ee924eb add tests and polish infer impl (#11009)
7 years ago
Yu Yang 14905516aa
Merge pull request #10970 from JiayiFeng/dev_add_random_crop_op
7 years ago
qiaolongfei 8868a54d5f use flags to fix the compile problem
7 years ago
Xin Pan cb01c59481 add back reduce_op
7 years ago
yuyang18 d2c1fac15d Merge branch 'dev_add_random_crop_op' of https://github.com/JiayiFeng/Paddle into dev_add_random_crop_op
7 years ago
yuyang18 a6c11a5d95 Fix bug in CUDA
7 years ago
qiaolongfei 4736281c5c restore
7 years ago
fengjiayi 3165ffa42f Merge branch 'dev_add_random_crop_op' of https://github.com/JiayiFeng/Paddle into dev_add_random_crop_op
7 years ago
fengjiayi 3bce3dbce1 fix a bug
7 years ago
qiaolongfei 47d9f64ff7 PaddlePredictorImpl::Clone return unique_ptr
7 years ago
qiaolongfei ad0bebd515 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-compile-by-std-move
7 years ago
Xin Pan 3cb6395688 better profiler and benchmark
7 years ago
mozga-intel 30d32035b2 Withdraw mkldnn mul
7 years ago
Qiao Longfei 654f5d3c91
Merge pull request #11012 from jacquesqiao/add-auto_grown_mutex
7 years ago
sneaxiy 46f13237dc Fix bugs in framework/tensor_impl.h and polish framework/reader.cc
7 years ago
qiaolongfei b38c78c3b8 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-compile-by-std-move
7 years ago
fengjiayi c95cd4742f
Merge pull request #10975 from JiayiFeng/fix_bug_in_uint8_support
7 years ago
yuyang18 45530c772e Fix GPU compile
7 years ago
fengjiayi 7c42e5de1a Polish RandomCropOp
7 years ago
qiaolongfei 0606818291 rm ste::move when return a unique_ptr in inference
7 years ago
qingqing01 3a29821bd5
Develop a fake dequantized op for fixed-point quantization training framework. (#10965)
7 years ago
Tao Luo 66ec827a92
Merge pull request #10941 from mozga-intel/mozga-intel/Top-K
7 years ago
tensor-tang 77599415ba enable read dataset
7 years ago
fengjiayi 56419caa1f Add .cu
7 years ago
xzl 2905dcded6 fix code format
7 years ago
qiaolongfei fa2079b71a add mutex header
7 years ago
zlx 9f3eb91d8f fix neon depthwise conv bug
7 years ago
qiaolongfei 167312484b use unique ptr to hold auto_grown_mutex_ for default copy and move constructor
7 years ago
qiaolongfei add9ed3388 change order of member
7 years ago
Qiyang Min 580340eeb2 Shutdown pserver gracefully when SIGINT and SIGTERM was sent (#10984)
7 years ago
mozga-intel 278f55bcef Support for mklml version of code
7 years ago
mozga-intel e7f0696513 Top K with omp
7 years ago
Qiao Longfei ab953baec6
Merge pull request #10973 from jacquesqiao/fix-prefetch
7 years ago
qiaolongfei e3c041d319 add auto_grown_mutex for selected rows
7 years ago
Yan Chunwei 38af7bcaaf fix manylinux compile error caused by inference lib (#10994)
7 years ago
chengduo 8c54f1fb9c
Merge pull request #10906 from chengduoZH/fix_data_trans
7 years ago
chengduo 7d1332f696
Merge pull request #11006 from chengduoZH/fix_add_check_nan_inf_in_operator
7 years ago
tensor-tang c00843f4e8 enable multi-threads
7 years ago
qiaolongfei 0858a50189 Merge branch 'fix-inference-build-on-mac' of ssh://github.com/jacquesqiao/Paddle into fix-prefetch
7 years ago
qiaolongfei d4cff07f3c fix inference build on mac
7 years ago
qiaolongfei 74701b2692 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-prefetch
7 years ago
chengduoZH cb1c657ce4 move check_nan_inf to operator
7 years ago
Wu Yi c6a53261c5
Merge pull request #11000 from typhoonzero/fix_build_scripts_nightly_build
7 years ago
chengduoZH 17a076d8ed replace TensorCopy with TensorCopySync
7 years ago
Yancey d92a75bee4
Merge pull request #10550 from Yancey1989/overlap_send_op
7 years ago
yi.wu 65f656eef2 fix nightly build and build script error
7 years ago
fengjiayi 4785c00e1e Add support in cast op kernel
7 years ago
Yan Chunwei f5fc9c3bc1
feature/mul converter (#10841)
7 years ago
qiaolongfei 85d0301a10 fix typo
7 years ago
Yancey1989 5d7c58e469 fix code style
7 years ago
Yancey1989 8b630ae1b5 fix unit test
7 years ago
Yancey1989 6b91d407de revert variable mutex
7 years ago
Yancey1989 60d827a8b9 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
Yancey1989 20c24c05aa singleton rpc_client
7 years ago
qiaolongfei 5825196db9 fix sgd for SelectedRows bug
7 years ago
Wu Yi 8f7b020ba8
fix develop build issue (#10978)
7 years ago
Darcy 8075a11f0e
Merge pull request #10983 from panyx0718/fix_test
7 years ago
Xin Pan c001961034 Fix test to use less gpu memory
7 years ago
qiaolongfei c797adede7 append table grad to grad_to_block_id
7 years ago
fengjiayi 9f81f1ce57 Add uint8 support in pybind.cc
7 years ago
fengjiayi 632108dc2e Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_add_random_crop_op
7 years ago
fengjiayi 20c8ff0f5f Add comments
7 years ago
Yancey1989 28596a3386 add gflag ssa_graph_path
7 years ago
qiaolongfei 25f47fc0af fix prefetch bugs, optimize code
7 years ago
fengjiayi d44dbc4a52 fix errors
7 years ago
Yancey1989 ad6c0142c4 clean up codes
7 years ago
tensor-tang afbc4ce4b6 Merge remote-tracking branch 'ups/develop' into mklml_funcs
7 years ago
fengjiayi 3e7ce5836f stash
7 years ago
tensor-tang 537f57a4e9 fix undefine error on gpu
7 years ago
qingqing01 3ba75d4a69
Check label range in cross entropy calculation. (#10954)
7 years ago
qingqing01 91bd5835df
Fix fill_constant_batch_size_like_op when input is LoDTensor. (#10943)
7 years ago
Yancey1989 268e9dc1c6 polish code
7 years ago
Xin Pan 2f0df56422 add inference interface impl
7 years ago
Yancey1989 ceefbf3259 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
whs 376c948e88 Polygon box transform op for OCR East detection. (#10802)
7 years ago
Kexin Zhao a62bbd1ddc
fix float16 demo location issue (#10948)
7 years ago
Lei Wang 72149c1675
scripts: clean bash scripts. (#10721)
7 years ago
chengduoZH fa613206e4 update
7 years ago
tensor-tang 400f5e7c3c add threads test
7 years ago
Houjiang Chen 83f4e9e9a6 enable eigen multi-threads on mobile device (#10938)
7 years ago
Wu Yi 0930646b63
Merge pull request #10937 from Yancey1989/fix_renamevar
7 years ago
Yan Chunwei fd45c6d1b9 feature/inference api demo impl (#10825)
7 years ago
Yancey1989 b348e159f2 fix rename var
7 years ago
Yu Yang dd428a0131
Merge pull request #10921 from reyoung/feature/change_optimizer_tests
7 years ago
Yancey c770d5c0ba
fix build error with testing and gpu on (#10932)
7 years ago
tensor-tang 315e08ebbe speedup vInvSqrt vLogqp vTanh with mklml
7 years ago
fengjiayi cd8700f172
Merge pull request #10872 from JiayiFeng/dev_CustomReader
7 years ago
tensor-tang ce20dfa236 enable more choices
7 years ago
yuyang18 c980e4cec8 Change optimizer to old paddle style
7 years ago
tensor-tang 602e28bf1c use the actual data
7 years ago
Yancey1989 0aa6f9e934 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago
Yu Yang d736fb8047 Disable unstable test (#10920)
7 years ago
tensor-tang 1b8b253ec1 Merge remote-tracking branch 'ups/develop' into nlp
7 years ago
tensor-tang 98fb8e58fd test infer nlp
7 years ago
fengjiayi 8147063539 follow comments
7 years ago
yuyang18 85a41df32d Init commit
7 years ago
Tao Luo ec7f1e99ab
Merge pull request #10907 from reyoung/feature/remove_cpplint_while_compile
7 years ago
Tomasz Patejko 3e876b3e49 MKL optimized elementwise add: fix style check
7 years ago
Tomasz Patejko 9241011b31 MKL elementwise add backward: backward works for integral types with fall back to default impl
7 years ago
Tomasz Patejko fde47aae61 MKL elementwise add backward: grad inputs copied when they are not null
7 years ago
Tomasz Patejko 996d12f16d MKL optimized elementwise add backward: coding style fixes
7 years ago
Tomasz Patejko 5a622c2906 MKL elementwise add backward: Initial implementation with vector copy
7 years ago
Tomasz Patejko 01fb2be908 MKL elementwise add: default implementation used for integral types, float16 and/or GPU
7 years ago
Tomasz Patejko 6f932482f4 MKL elementwise_add: BLAS version compiles with integral types
7 years ago
Tomasz Patejko e43c8f33cd MKL elementwise add: elementwise_add uses vAdd VML function when MKL is used
7 years ago
fengjiayi 0457f06434 add comments
7 years ago
Tao Luo 174d884d2c
Merge pull request #10904 from luotao1/ci
7 years ago
fengjiayi e4e9d3624f fix a potential bug
7 years ago
fengjiayi 239546a6c1 add unit test and fix a bug
7 years ago
yuyang18 a229734cbd Remove cpplint in cmake
7 years ago
chengduoZH 4bfadcd136 fix data trans
7 years ago
Yancey1989 fc06222ae9 fix async worker
7 years ago
Yan Chunwei b1d446856c
fix inference api (#10867)
7 years ago
Guo Sheng 6d2deedf1d
Merge pull request #10814 from guoshengCS/fix-ElementwiseOpInferVarType
7 years ago
Luo Tao 654229b232 refine fluid deploy on teamcity
7 years ago
Yu Yang d406c76a30
Merge pull request #10744 from reyoung/feature/refine_parallel_executor
7 years ago
Yu Yang 8653cf3004
Merge pull request #10656 from reyoung/feature/support_op_role
7 years ago
Xi Chen e68d196097 update cudnn version to cuda version
7 years ago
Xi Chen 1087bb0ce3 fix cudnn9 production dockerfile
7 years ago
Xin Pan 0d598cf9f6
Merge pull request #10822 from panyx0718/dist_opt
7 years ago
tangwei12 397a69d942
Merge pull request #10532 from seiriosPlus/checkpoint
7 years ago
Yibing Liu 20bdc3e109
Merge pull request #10846 from kuke/deconv_group
7 years ago
fengjiayi 530556dd97
Merge pull request #10864 from JiayiFeng/dev_expose_random_gen
7 years ago
fengjiayi 2e42b31f45 refine code
7 years ago
Xin Pan 2643868c66 follow comments
7 years ago
Yibing Liu 4bafbf411c Enable groups for conv3d transpose op
7 years ago
Xin Pan 08e4970e45 follow comments
7 years ago
fengjiayi 4b395b0809 fix errors
7 years ago
yuyang18 fcbf19bf93 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/refine_parallel_executor
7 years ago
Yancey1989 540b45350d use req_count as atomic type
7 years ago
yuyang18 ad6a8715ff Disable unstable unittests
7 years ago
Wu Yi c8919d8ee2
Merge pull request #10821 from typhoonzero/use_pinned_memory
7 years ago
Yan Chunwei 1153144fbb
Inference analysis/init data flow graph analysis (#10776)
7 years ago
fengjiayi c2436f2ca5 expose random_data_generator
7 years ago
Yu Yang a9f9fbadd9
Merge pull request #10832 from typhoonzero/fix_server_port_file_conflict
7 years ago
yuyang18 9ab12df62d Disable no good tests
7 years ago
Yancey1989 6debbcd9f9 connect fetch barrier and concat op
7 years ago
yuyang18 3b04f0099c Fix bug
7 years ago