Commit Graph

1776 Commits (557be6fc58a8fad13a830df33ec77560faaa3d7c)

Author SHA1 Message Date
sneaxiy 4b83afff6e
Merge branch 'develop' into refine_elementwise_add
7 years ago
sneaxiy b2d0ee5159 refine elementwise_add op
7 years ago
tangwei12 da2cc99f67 sampling op optimize
7 years ago
fengjiayi 7c55e08c93 stash
7 years ago
tangwei12 4973e07be3 sampling op optimize
7 years ago
tensor-tang 836068569f Merge remote-tracking branch 'ups/develop' into refine/op/gru
7 years ago
tensor-tang 18c322c2a1 seperate cpu and gpu implementations for gru kernel compute
7 years ago
tensor-tang 54c95e49f0 fix blas
7 years ago
fengjiayi b656d97e86
Merge pull request #12485 from JiayiFeng/dev_ops_tensor_support
7 years ago
qiaolongfei 1623f1ba4f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-profiler
7 years ago
tangwei12 3206970b77 sampling op rename
7 years ago
Xin Pan 99a77cfc62
Merge pull request #12468 from panyx0718/improve_profiler2
7 years ago
qiaolongfei a3f9d6a38c optimize profiler
7 years ago
tangwei12 e0ab2f7158 new sampling op
7 years ago
tensor-tang 8c23f7c4f0 fix blas and use packed weight
7 years ago
tensor-tang d9cc6b1866 replace gru compute with details
7 years ago
tensor-tang 43cee33a23 add mkl packed gemm
7 years ago
tangwei12 766ac488ac sum_op selectedRows dim bug fix
7 years ago
dzhwinter 595a2c83ae
explicit gradient of elementwise_add/elementwise_sub (#11970)
7 years ago
fengjiayi e7d8e16a66 update softmax_mkldnn_op
7 years ago
Yu Yang 2567afa35d
Merge pull request #12462 from reyoung/feature/fix_cudnn_deterministic
7 years ago
fengjiayi dc111d3476 update softmax_cudnn_op
7 years ago
fengjiayi f7bd0b227b Add unittests for softmax_op
7 years ago
gongweibao 819ac3df0a
Modify style (#12465)
7 years ago
fengjiayi b314a69523 make softmax supporting tensors
7 years ago
fengjiayi b1af7e5d9b Add unittests for lookup_table_op
7 years ago
tangwei12 c4c8f60bec sum_op selectedRows dim bug fix
7 years ago
Xin Pan 486345551d clean
7 years ago
Xin Pan caf10b474f make profiler use thread_id from g_thread_id
7 years ago
Yu Yang 040fc1c39b Fix bug in cudnn_determistic
7 years ago
fengjiayi 7efdf05ac2 make look_up_op supporting tensor ids
7 years ago
Qiao Longfei 690625fe15
Merge pull request #12456 from jacquesqiao/add-profiler-to-pserver
7 years ago
qiaolongfei 7e46a8d172 fix logical bug, optimize code
7 years ago
qiaolongfei 0b62f61d29 add init flag in __init__.py for listen_and_serv_profile_period
7 years ago
dzhwinter 91fb0156ca
Memory/reshape op (#12414)
7 years ago
qiaolongfei 0b861bbca9 add profiler for listen_and_serv op
7 years ago
tensor-tang 059b27840c
Merge pull request #12408 from tensor-tang/refine/im2col
7 years ago
qiaolongfei 147bf00ffe clear mutable rows for the output of split_ids_op
7 years ago
qiaolongfei 91b114a787 change map to unordered_map
7 years ago
tensor-tang d8d2dbcfac further optimize im2col using variables
7 years ago
qiaolongfei 91f63cd401 fix split_ids_op and add unit test
7 years ago
tensor-tang 5373fe29c2 Merge remote-tracking branch 'ups/develop' into refine/im2col
7 years ago
Qiyang Min 7da453630e
Merge pull request #12403 from velconia/fix_hang_up
7 years ago
Tao Luo 5a634786af
Merge pull request #12312 from luotao1/unify
7 years ago
Bai Yifan e12b1d1792 Add flatten op (#12341)
7 years ago
Luo Tao 062556f938 Merge branch 'develop' into unify
7 years ago
chengduo 2409d0f710
Refine regularization for selected_rows (#12369)
7 years ago
tensor-tang 687a322267 Merge remote-tracking branch 'ups/develop' into refine/im2col
7 years ago
tensor-tang 65d418f060 complete im2col with padding==1 and speedup filter width==1
7 years ago
minqiyang 053540e199 Add volatile to stopped_ member
7 years ago
minqiyang b78ffde6d5 Add stopped sign for grpc client
7 years ago
tensor-tang 52eb86e30f refine im2col benchmark
7 years ago
tensor-tang 3017f46076 add more test cases
7 years ago
tensor-tang 8d6be4fb5f refine im2col test and add benchmark
7 years ago
tensor-tang 507c143047 im2col cfo cpu code clean
7 years ago
tensor-tang 4eeed0b5e4 refine width padding and enable core copy
7 years ago
Wu Yi 73fcfc06ec
refine conv cudnn enforce (#12353)
7 years ago
tensor-tang e3131e2d73 enable width padding
7 years ago
Xin Pan d7e08c53c2
Merge pull request #12169 from panyx0718/ir_graph_sort
7 years ago
tensor-tang 92518c519f reuse sizes saving time
7 years ago
tensor-tang 660df122ce enable padding!=0 and fill height padding with 0
7 years ago
tensor-tang d8e00facf7 reuse im_size
7 years ago
tensor-tang 179dd0cb8a
Merge pull request #12337 from tensor-tang/refine/im2col
7 years ago
Luo Tao 5ba4337698 unify libpaddle_inference_api into libpaddle_fluid
7 years ago
tensor-tang b72befc5cc reuse copy size
7 years ago
Yancey 6133efd9ed
Merge pull request #12218 from Yancey1989/rpc_complete_interface
7 years ago
Zhaolong Xing 6169d724b9
Merge pull request #12324 from NHZlX/enhance_for_tensorrt_infer
7 years ago
nhzlx 4d49e61ab8 fix comments
7 years ago
tensor-tang 6788af4bf1 refine test cases
7 years ago
tensor-tang b163e601b6 add gtest
7 years ago
nhzlx bcd67bdd71 add assert for GetOutput
7 years ago
tensor-tang aae994fd26 refine im2col no padding
7 years ago
Yancey1989 fb06ed7bdc Merge branch 'develop' of github.com:PaddlePaddle/Paddle into rpc_complete_interface
7 years ago
Yu Yang 21387e3c2a Tiny refines for lod_tensor_blocking_queue and reshape_op
7 years ago
nhzlx f42ea48996 deal with conflict
7 years ago
nhzlx 940f5dbcac modify the tensorrt engine op to adapt to chage
7 years ago
Yan Chunwei 02cf54d331
bugfix lod cpu performance (#12297)
7 years ago
Qiao Longfei b41f8b9d42
Merge pull request #12295 from jacquesqiao/speedup-reduce-sum-grad-op
7 years ago
fengjiayi eec412b230
Merge pull request #12273 from JiayiFeng/update_py_reader
7 years ago
Xin Pan 21a45420f0 polish and test
7 years ago
Qiao Longfei 95a2b5f56a
fix mac build of sendrecvop_utils (#12272)
7 years ago
qiaolongfei 273f737517 optimize code
7 years ago
Xin Pan 93355cc0d2 fix control deps
7 years ago
fengjiayi ea8a375fa4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into update_py_reader
7 years ago
qiaolongfei 5d718a5886 optimize reduce_sum_grad op
7 years ago
Yancey1989 d4f51218ef Merge branch 'develop' of github.com:PaddlePaddle/Paddle into rpc_complete_interface
7 years ago
qiaolongfei b643473d31 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-mac-build
7 years ago
fengjiayi 060f421797 Some enhancement on readers
7 years ago
qingqing01 873a50ce35
Fix serious bug in nesterov momentum optimizer. (#12231)
7 years ago
Yan Chunwei b42ced8eda
bugfix/tensorrt analysis fix subgraph trigger (#12266)
7 years ago
qiaolongfei 938390b38d fix mac build of sendrecvop_utils
7 years ago
gongweibao 3a6213f493
Change grpc interface to compatible with brpc. (#12164)
7 years ago
Yu Yang b06309381b
Merge pull request #12149 from reyoung/feature/combine_open_files_and_double_buffer
7 years ago
tensor-tang be04fbff42
Merge pull request #12233 from tensor-tang/refine/mkl/gemm
7 years ago
Qiao Longfei 2b58c62aa0
Update auc op (#12199)
7 years ago
Yancey1989 efd5a84986 update executor interface
7 years ago
tensor-tang fc2b578842 add gemm_warp test
7 years ago
tensor-tang a916c52579 refine gemm
7 years ago
tensor-tang 961e754c9f mkl split gemm for better perf
7 years ago
Yancey1989 ade6675490 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into rpc_complete_interface
7 years ago
yuyang18 e9c8d930a5
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/combine_open_files_and_double_buffer
7 years ago
Yancey1989 d0771cf912 update
7 years ago
Yancey1989 7570d8e77c add rpc complete interface
7 years ago
yuyang18 8c70183ba6
Polish function names
7 years ago
yuyang18 b789a3a484
Change code
7 years ago
whs 8284947b82 Fix infershape of im2sequence. (#12183)
7 years ago
yuyang18 401e92f6e3
Change attr comment
7 years ago
yuyang18 be528f9815
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/combine_open_files_and_double_buffer
7 years ago
Tomasz Patejko b2b8b15bfe MKLDNN sum fix: remove in_place condition in loop creating memory primitives for sum
7 years ago
yuyang18 72b78154b2
Polish reader speed
7 years ago
Wu Yi 866fcb0c15
Merge pull request #12171 from typhoonzero/fix_pserver_with_condition_block
7 years ago
typhoonzero 32d81909dc fix pserver with condition block
7 years ago
tensor-tang d24fd2c6b1
Merge pull request #12099 from jczaja/prv-conv-grad-mkldnn-upstream2
7 years ago
yuyang18 e576345f5b
Try to speed up buffered reader
7 years ago
Wu Yi c5619bbcde
fix auc op (#12087)
7 years ago
Yancey 0042ba93c8
Merge pull request #12127 from Yancey1989/enforce_rpc_timeout
7 years ago
yuyang18 61b3a5977f
Refine Python Reader
7 years ago
yuyang18 b048ddf0bd
Merge error
7 years ago
yuyang18 b8975d6842
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/combine_open_files_and_double_buffer
7 years ago
yuyang18 d36e13efd8
Merge branch 'feature/add_pyreader_demo' into feature/combine_open_files_and_double_buffer
7 years ago
yuyang18 1478a5fc0b
Make open_files use buffer
7 years ago
yuyang18 dc34effd35
Extract buffered reader
7 years ago
yuyang18 392318045f
Merge branch 'feature/dctor_all_readers' into feature/combine_open_files_and_double_buffer
7 years ago
yuyang18 fecbe52200
Rewrite open_files
7 years ago
Yu Yang ba997b8ccd
Merge pull request #12097 from reyoung/feature/hide_api_cont
7 years ago
yuyang18 c680bc1d7f
Rewrite DoubleBuffer
7 years ago
yuyang18 c9cf2bdb9c
Dctor cache
7 years ago
yuyang18 ee7d8b4d66
Refine Shutdown Impl
7 years ago
Jacek Czaja 8e20d36bc8 - comment update
7 years ago
Jacek Czaja c981222b3b - Conv MKLDNN grad op reuse of mkldnn primitives
7 years ago
tensor-tang f0cd493c0d
Merge pull request #11989 from tensor-tang/feature/libxsmm
7 years ago
Sylwester Fraczek 4d55aca40e reserve vector space before loop in top-k
7 years ago
Yu Yang ebe3b5e78a
Merge pull request #11853 from sneaxiy/complete_py_reader_python
7 years ago
Yancey1989 4a91a14549 enforce rpc client timeout
7 years ago
Guo Sheng da3f766821
Merge pull request #12088 from guoshengCS/complete-hsigmoid
7 years ago
sneaxiy 31c7f6b968
Merge branch 'develop' into complete_py_reader_python
7 years ago
fengjiayi 6ff7f2380c
Merge pull request #12063 from reyoung/feature/exception_safe_pe
7 years ago
tensor-tang 2f7b09319a Merge remote-tracking branch 'ups/develop' into feature/libxsmm
7 years ago
guosheng 4ee069fdba Fix the HierarchicalSigmoidGradOpKernel and refine the codes. Now hsigmoid_op is same with V2 implementation and can pass gradient check.
7 years ago
yuyang18 c87e08c28d
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/exception_safe_pe
7 years ago
chenweihang 938319bbd2
Merge branch 'develop' into unsqueeze_op
7 years ago
Yibing Liu 092d620187
Merge pull request #11812 from chenwhql/squeeze_op
7 years ago
tensor-tang 1c5d6c5692 disable xsmm with float16
7 years ago
tensor-tang c9ba51ead8 Merge remote-tracking branch 'ups/develop' into feature/libxsmm
7 years ago
tensor-tang 64a8e6d20e refine the threshold functions
7 years ago
Tao Luo c620c522d7
Merge pull request #12093 from Noplz/fix_warning
7 years ago
lemon34 29145e1e31 change im2sequence for ctc batch inference (#11696)
7 years ago
Noplz cfa4479b06 fix warning
7 years ago
tensor-tang 32822b2a59 Merge remote-tracking branch 'ups/develop' into feature/libxsmm
7 years ago
chenweihang b8ea7a081a Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into unsqueeze_op
7 years ago
Jacek Czaja fbe25ef510 MKLDNN: Extending Conv MKLDNN op to reuse MKLDNN primitives (#11750)
7 years ago
baiyf be2d9dc2b8 Add prior_box output order control (#12032)
7 years ago
guosheng e7f7ba97fe Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into complete-hsigmoid
7 years ago
guosheng e7a4cfc0ff complete the hsigmoid_op
7 years ago
chenweihang 84a525a38a Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into squeeze_op
7 years ago
sneaxiy f85e16f1de Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into complete_py_reader_python
7 years ago
chenweihang 0ea468225b docs: fix some errors of description
7 years ago
chenweihang fbef49e772 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into unsqueeze_op
7 years ago
chenweihang 3d15968958 docs: fix some errors of description
7 years ago
achao2013 8e4b225fe4 Add fake_quantize_op. (#11359)
7 years ago
Yuan Gao 50aa6ba6f5 add rpn target assign op (#11449)
7 years ago
chenweihang 2bd65dbf71 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into squeeze_op
7 years ago
chenweihang fd01a43a3c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into unsqueeze_op
7 years ago
tensor-tang 7bb67b6788 Merge remote-tracking branch 'ups/develop' into feature/libxsmm
7 years ago
chenweihang cef8dbc1f7 refine some messages and adjust data type
7 years ago
chenweihang 05eafcca73 refine some messages and adjust data type
7 years ago
minqiyang fceaabdd81 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_grpc_destroy_bug
7 years ago
guosheng d695381677 Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into complete-hsigmoid
7 years ago
yuyang18 3aaf798182
Refine size_t and int
7 years ago
fengjiayi 26ae6111d1
Merge pull request #12051 from JiayiFeng/dev_reader_ResetAll
7 years ago
qingqing01 10fbb831ed
Skip BatchNorm when feature only has 1 element. (#11578)
7 years ago
chenweihang 8f2486ca16 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into squeeze_op
7 years ago
tensor-tang 6bc1aaaac7 refine the ColMajor replacement
7 years ago
tensor-tang c3862a7519 Merge remote-tracking branch 'ups/develop' into feature/libxsmm
7 years ago
chenweihang d552b900f0 change the copyright year form 2016 to 2018
7 years ago
qingqing01 ef4895df3b
Make IfElse operator works and fix unit testing. (#11972)
7 years ago
tensor-tang de856da9a6 fix ColMajor and RowMajor replacement
7 years ago
tensor-tang 00ee6c3c17 Merge remote-tracking branch 'ups/develop' into feature/libxsmm
7 years ago
fengjiayi 6d6f49cd56 Merge remote-tracking branch 'yuyang/feature/decorated_reader_chain' into dev_reader_ResetAll
7 years ago
chenweihang 7526eaaf13 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into unsqueeze_op
7 years ago
chenweihang 4453473f71 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into squeeze_op
7 years ago
chenweihang 1721613f1e simplify construct function
7 years ago
fengjiayi 611716e9bc Merge branch 'dev_reader_shutdown_start' of https://github.com/JiayiFeng/Paddle into dev_reader_shutdown_start
7 years ago
fengjiayi 0e9f1e2790 Make ReaderBase thread safe and remove ThreadedReader
7 years ago
yuyang18 e8ee9dc7f8
Several Polish
7 years ago
chenweihang 5f89272c89 change the bit insert to array insert for understandability
7 years ago
fengjiayi b4f0e57956 fix errors
7 years ago
Tao Luo 436bb4500b
Merge pull request #11699 from pzelazko-intel/pzelazko/workaround-for-missing-mklnn-kernels
7 years ago
fengjiayi 6fc6cc2f4c Some updates on readers
7 years ago
fengjiayi 5528f59900 Split ReInit() to Shutdown() and Start()
7 years ago
fengjiayi de9a411f1c adjust readers' inheritance relationships
7 years ago
yuyang18 c48c586aca
Use weak_ptr to implement DecoratedReaderChain
7 years ago
minqiyang 1377b332bc Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_grpc_destroy_bug
7 years ago
chenweihang fccdc1abea Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into unsqueeze_op
7 years ago
chenweihang 62a17f5053 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into unsqueeze_op
7 years ago
chenweihang 80126a7496 small fix based reviewer's advice
7 years ago
yuyang18 8e86721fe7
Fix data balance on single GPU
7 years ago
tensor-tang 21516e5cbe add unit test of smm
7 years ago
tensor-tang c3941745b3 add libxsmm_gemm
7 years ago
minqiyang 2c4fb585db Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix_grpc_destroy_bug
7 years ago