Commit Graph

6361 Commits (608ebece366e24257fea70bff8cab007473f10b4)

Author SHA1 Message Date
Luo Tao 984d09c8de Merge branch 'develop' into fluid_infer
7 years ago
wanghaoshuang 281e93bcbb Remove 'top 1' from CPU and GPU kernel
7 years ago
fengjiayi f59599a3dd refine elementwise_add_op
7 years ago
fengjiayi ead7059bf9 Refine code
7 years ago
sidgoyal78 ba2f6f71ad Modify directory structure to show plots
7 years ago
helinwang 052c05be34
Merge pull request #7537 from Yancey1989/distributed_gpu
7 years ago
fengjiayi 5f4481350e
Merge pull request #7529 from JiayiFeng/remove_functor1
7 years ago
gongweibao 535fefb7e8
Fix grpc bugs (#7435)
7 years ago
whs 448fee3db4
Merge pull request #7414 from wanghaoshuang/warpctc
7 years ago
dzhwinter b9b75377a2
Feature/hooks (#7513)
7 years ago
chengduoZH 251c6032fb set use_cudnn as default
7 years ago
Yancey1989 329f1e0f79 add some comment
7 years ago
Yancey1989 9de1809590 fluid distributed on CUDA place
7 years ago
chengduoZH 79aa51229a fix conv, pool, conv_trans to decide use cudnn or not
7 years ago
fengjiayi f5cd961900 complete elementwise_min_op
7 years ago
fengjiayi acf37ad675 Complete elementwise_max_op
7 years ago
fengjiayi 76a74f1f77 Merge branch 'remove_functor1' into dev_elementwise_max_min
7 years ago
fengjiayi 6ee8a2e1db remove unnecessary functor1
7 years ago
fengjiayi 736842e447 wip
7 years ago
wanghaoshuang 8f37c3c2a7 Fix sequence scale functor cuda kernel
7 years ago
wanghaoshuang 579f684661 Add ctc_greedy_decode_op
7 years ago
Luo Tao 816e556b98 Merge branch 'develop' into fluid_infer
7 years ago
guosheng ef129718ea Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into add-python-glu
7 years ago
guosheng c083a60d7a Add python split and glu
7 years ago
yangyaming a091d1a31c Enhance print_op.
7 years ago
Qiao Longfei 9deb1756a2
fix while_grad_op first step loss lod problem (#7490)
7 years ago
dzhwinter 5ad1aef051
"cudnn operators change to cudnn kernel" (#6660)
7 years ago
wanghaoshuang 45cf234127 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into warpctc
7 years ago
wanghaoshuang 137f0dfc21 1. Fix warpctc grad tensor initial bug.
7 years ago
Cao Ying db65f497ff Update comments for two operators. (#7457)
7 years ago
Luo Tao 3cf23bece3 create paddle_fluid_shared.so library
7 years ago
Tao Luo 04bfed660b
Merge pull request #7413 from tensor-tang/v1mac
7 years ago
Qiao Longfei 23df6c4478
Add get lod for debug (#7375)
7 years ago
Yan Chunwei 3423022e84
feature/add print op (#6799)
7 years ago
Tao Luo 03c858a9be
Merge pull request #6596 from lcy-seso/update_capi_doc
7 years ago
emailweixu d954becb1c
Merge pull request #7371 from emailweixu/assign_value_op
7 years ago
Yang Yang 208f950ccc delete todo
7 years ago
Tao Luo 29b2693afb
Merge pull request #7445 from abhinavarora/api_doc_bug
7 years ago
xuwei06 25ecd2061a Use CopyFromVector for assign_value_op
7 years ago
xuwei06 7306aab61d GetActualKernelType => GetExpectedKernelType
7 years ago
xuwei06 ce233796ea assign_value operator
7 years ago
Yang Yang 8c1025d66f first commit
7 years ago
Yu Yang a9dbdab5e2
Merge pull request #7396 from reyoung/feature/parallel_for_unittest
7 years ago
ranqiu92 95c0c12641
Merge pull request #7384 from dzhwinter/feature/sync_wait
7 years ago
Abhinav Arora f784dae3da Fix the documentation for elementwise op in fluid layers
7 years ago
tensor-tang 74c5e7c52d enbale auto set env of V1 on Mac
7 years ago
Yang Yu dc488c17d1 Merge branch 'develop' of github.com:baidu/Paddle into feature/parallel_for_unittest
7 years ago
whs 59bf85d914
Merge pull request #7325 from kuke/sequence_erase_op
7 years ago
wanghaoshuang 89de5d5e66 Fix cuda kernel of sequence scale functor
7 years ago
gongweibao da3087ada1
Async GRPC sendrecv (#7133)
7 years ago
wanghaoshuang 9eb3fb2946 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into warpctc
7 years ago
wanghaoshuang b1af5e435f 1. Fix warpctc grad op
7 years ago
Tao Luo 020630b7a3
Merge pull request #7395 from tensor-tang/fixds2
7 years ago
Tao Luo af96cd5877
Merge pull request #7397 from luotao1/openblas_env
7 years ago
Yibing Liu 861b84f557
Merge pull request #5300 from kuke/ctc_edit_distance_dev
7 years ago
tensor-tang a7e847b648 fix ds2 issue
7 years ago
Yibing Liu fe0ef91a3f fix ci error in edit_distance_op
7 years ago
Qiao Longfei 377424bf21
reorganize data transform related code (#7391)
7 years ago
Yibing Liu f3dcd00603 Merge branch 'develop' of upstream into ctc_edit_distance_dev
7 years ago
Yu Yang 6cff3c96db
Merge pull request #7381 from reyoung/feature/refine_get_places_op
7 years ago
Yibing Liu f594ca4369 Reuse the usable variable in edit_distance_op
7 years ago
Yang yaming a32027604f
Merge pull request #7176 from pkuyym/fix-7173
7 years ago
Luo Tao 929d22c622 auto set openblas env
7 years ago
Yang Yu 52ed103899 Merge branch 'feature/fix_ini_glog' into feature/parallel_for_unittest
7 years ago
Yang Yu 2f56995f7c Fix InitGLOG
7 years ago
Yibing Liu 10779460c5 Simplify calc in test_sequence_erase_op
7 years ago
Yang Yu e5e206e2b6 Merge branch 'develop' of github.com:baidu/Paddle into feature/refine_get_places_op
7 years ago
Yang Yang(Tony) 4bcc0b64cb
[WIP] feature/parallel_gpu (#7293)
7 years ago
dzhwinter a6edc0389e "fix CI"
7 years ago
dzhwinter f0316bdbbd "add flags"
7 years ago
Yang Yu 4059c9ca7f Polish GetPlacesOp
7 years ago
fengjiayi df92776850
Merge pull request #7269 from emailweixu/calc_gradient
7 years ago
dzhwinter 5f98500009 Make init device on all gpu by default (#7345)
7 years ago
Qiao Longfei efe06caa3d
change data type of beam_search op (#7374)
7 years ago
Qiao Longfei 91f80f792d
Topk share lod (#7373)
7 years ago
fengjiayi cedd9805f5
Merge pull request #7361 from JiayiFeng/refine_and_enhence_WhileGradOp
7 years ago
xuwei06 585dec3dc2 Calculating gradients for partial graph
7 years ago
xuwei06 0ef9dc6122 Fix comment for norm_op
7 years ago
yangyaming 106d063c5f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-7346
7 years ago
yangyaming 427e47459b Add grad_op_maker for sequence_pool.
7 years ago
QI JUN 45e77154cf
add general memory usage interface for both CPU/CUDA (#7352)
7 years ago
fengjiayi fbc30215d4 refine WhileGradOp code
7 years ago
Yancey e249ad1211
Test dist word2vec (#7334)
7 years ago
Yiqun Liu b5fda2723f
Port WarpCTC Operator (#5107)
7 years ago
fengjiayi 8f962f7433 Update
7 years ago
Yu Yang ce6dad3b35 Rename CopyFrom to Copy for tensors (#7292)
7 years ago
Yu Yang 1dad4bb2e9
Remove unused LoDTensor methods (#7247)
7 years ago
Qiao Longfei d762e07ecc
Merge pull request #7294 from jacquesqiao/add-back-priority
7 years ago
Yibing Liu 7b9d5b325c Add document for sequence_erase_op
7 years ago
qiaolongfei 8b1a81a9bf fix GetDims bug
7 years ago
Yibing Liu 37f933b8ad Add gpu kernel for sequence_erase_op
7 years ago
qiaolongfei 5b94948b32 disable UseAll when init
7 years ago
qiaolongfei 0b52cc886f fix priority
7 years ago
Yu Yang 6e3cc0c993
Merge pull request #7240 from reyoung/feature/make_lod_a_share_ptr
7 years ago
hedaoyuan e8a96a8e89
Merge pull request #7159 from hedaoyuan/inference
7 years ago
Yancey aa75f1e2c5
Create tensor in recv op (#7286)
7 years ago
Tao Luo b9ce55faad
Merge pull request #7290 from luotao1/profiler
7 years ago
qiaolongfei ca90356b0e add back priority
7 years ago
yangyaming 12ed53c1f2 Inherit LoD from x to x_grad and enhance the unit test.
7 years ago
Yu Yang 219fbd51bc
Merge pull request #6732 from QiJune/get_places_op
7 years ago
Luo Tao 01ee42b121 fix compile error in profiler.cc
7 years ago
Yibing Liu 2a54ddd267 Merge branch 'develop' of upstream into sequence_erase_op
7 years ago
Guo Sheng 26d451116b
Merge pull request #7251 from guoshengCS/enhance-reorderTensor
7 years ago
Yang Yu 3b0afae3d1 Add more comments
7 years ago
Yang Yu bdc82956d6 Merge branch 'develop' of github.com:baidu/Paddle into feature/make_lod_a_share_ptr
7 years ago
dzhwinter e94db381ba
Feature/add shared layout (#7233)
7 years ago
Yang Yu 63ff0b4ba9 Refine get_places
7 years ago
Yibing Liu 5e90f5e187
Merge pull request #7043 from kuke/profiler_tool
7 years ago
Tao Luo c62383e951
Merge pull request #7255 from tensor-tang/capi
7 years ago
Qiao Longfei 0f353ab46e
cpu gpu transform function (#7191)
7 years ago
Yibing Liu 6416eee780 Merge branch 'develop' of upstream into profiling_tool
7 years ago
Yibing Liu ea0280b4a1 Remove unused included header gflags
7 years ago
Yibing Liu d09503b2be Remove the redundant switch case statement
7 years ago
emailweixu 8814bec0c5 Show argument dimensions with operator::DebugStringEx (#7268)
7 years ago
Yang Yu ed0cf3d6c6 Merge branch 'develop' of github.com:baidu/Paddle into feature/get_places
7 years ago
ying 52c2278692 Merge branch 'develop' into update_capi_doc
7 years ago
Siddharth Goyal 12e3514139 Modify inference.cc to run example without pickletools (#7262)
7 years ago
Yibing Liu e85c513307 Add sequencee erase operator
7 years ago
Yibing Liu 7a4f3be9f3 Fix profiler place bug
7 years ago
tensor-tang 11ed2f2f93 package right mkldnn and mklml libs if enabled in capi
7 years ago
Yibing Liu df3b250c70 Fix bad_alloc bug & refine code in profiler
7 years ago
tensor-tang 5ab271821b fix crash when generating c-api package
7 years ago
guosheng ea6eb96347 Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into enhance-reorderTensor
7 years ago
guosheng e21923543a Enhance reorder_lod_tensor_by_rank_op to support Tensor
7 years ago
Yancey 643ff03fbc
capi package (#7237)
7 years ago
tensor-tang 673dc26cd4
Merge pull request #7164 from tensor-tang/context
7 years ago
Yibing Liu 367a5c9ef2 Merge branch 'develop' of upstream into profiler_tool
7 years ago
Yibing Liu 5a0a461753 Make time calc funcs return ms instead of us
7 years ago
Yu Yang 894236a128
Merge pull request #6730 from tonyyang-svail/parallel_do
7 years ago
Yibing Liu d7e56847c7 fix typos
7 years ago
zhangchao d06bbb12e8
Merge pull request #7203 from peterzhang2029/fix_adagrad
7 years ago
Yibing Liu 0aa03a822e Enable sorting the profiling result by different keys
7 years ago
Yang Yu 0cfb5465cd Add COWPtr and its unittest
7 years ago
tensor-tang 6177cb5162 Merge remote-tracking branch 'upstream/develop' into context
7 years ago
dzhwinter 5593858dd9
Feature/use cudnn (#7141)
7 years ago
peterzhang2029 dd09d8b2bf fix gpu error
7 years ago
tensor-tang 9c7cea81c8 follow comments, use unique_ptr and remove unused file
7 years ago
Yang Yu 8496b2e41b Refine parallel_do
7 years ago
Yang Yu 60e27d110b Merge branch 'develop' of github.com:baidu/Paddle into parallel_do
7 years ago
Yibing Liu 2d94eca8a1 Format profiling report
7 years ago
Yancey e5fe8935fb
send_recv variables (#7161)
7 years ago
Yibing Liu 0f4410755f Confirm the contents in profiling report
7 years ago
Tao Luo 809122cb5a
Merge pull request #7097 from Xreki/core_inference_example
7 years ago
Yu Yang a8b3996854
Merge pull request #7219 from reyoung/feature/correctly_handle_lod_information_for_image_operators
7 years ago
Yang Yu 040dc59b0f Correctly handle image operators
7 years ago
Yu Yang 564dba1753
Merge pull request #7196 from reyoung/feature/async_drop_kid
7 years ago
Liu Yiqun c7bd77792e Support the link of inference library on mac.
7 years ago
Tao Luo ee341efd79
Merge pull request #7183 from tensor-tang/use_mkl_packed
7 years ago
dzhwinter a4024a5f3d
"remove cudnn devicecontext" (#7207)
7 years ago
Yibing Liu 770aff2c4d Merge the update in profiling tool
7 years ago
peterzhang2029 c10d5d2aa1 refine code style
7 years ago
tensor-tang 3b5e4e0a83 default disable use_mkl_packed
7 years ago
peterzhang2029 f9ef6d1519 init
7 years ago
Yang Yu 7e10b81818 Fix style check
7 years ago
Yang Yu e138bcf450 Update cmake of scope
7 years ago
Yang Yu 24181fdc93 Merge branch 'develop' of github.com:baidu/Paddle into feature/async_drop_kid
7 years ago
Yu Yang cd5fad13ce
Merge pull request #7160 from reyoung/feature/expose_activations
7 years ago
Yang Yu 8ae84a5745 Async to drop kid
7 years ago
yangyaming e32103645e Only shrink for the first level LoD.
7 years ago
dzhwinter a893f15652
fix layout transform (#7149)
7 years ago
yangyaming f947c15378 Consider multiple levels of LoD.
7 years ago
Yu Yang dd8ffe1ea8
Merge pull request #7131 from reyoung/feature/tiny_enhance_of_while_op
7 years ago
tensor-tang 042f3524d2 add flag use_mkl_packed
7 years ago
Yang Yang 97dc451f4a clean up
7 years ago
Yang Yang fccbc2fc28 licence update
7 years ago
tensor-tang 880b2e8012 Merge remote-tracking branch 'upstream/develop' into context
7 years ago
tensor-tang 5bf5650dcf generate mkldnn dummy target for static deps
7 years ago
tensor-tang 0a8775cc5d fix shape_inference deps
7 years ago
yangyaming 66ae0a8cb2 Enhence shrink_rnn_memory_op.
7 years ago
tensor-tang b0ba2b061e fix typo
7 years ago
dzhwinter 19541468b6
"fix frigled test gradient of rnn" (#7166)
7 years ago
Tao Luo 231e2ee55e
Merge pull request #7148 from luotao1/op_make
7 years ago
Luo Tao 2d2b633282 add more comments in CMakelists.txt of operator
7 years ago
Yang Yang 7411df34b8 add multi thread
7 years ago
Yang Yang 8ee17e965f pass sgd at first iter
7 years ago
Yang Yang 9313233297 merge develop
7 years ago
QI JUN 907e6d04de
Fix bug in SetAttrDescVisitor (#7165)
7 years ago
Yibing Liu 0250e54c2d Enable batch input in edit_distance_op
7 years ago
Yang Yang 6004a2ed4f add copy skeleton
7 years ago
Yang Yu 5a4367bb16 Update
7 years ago
Liu Yiqun 5b3cf4ee61 Use gflags to parse arguments from command-line.
7 years ago
tensor-tang 31fda46cf4 fix mkldnn deps
7 years ago
Yang Yu 63e3150772 Update code
7 years ago
tensor-tang 03091ccbff add mkldnn_helper
7 years ago
tensor-tang 72652845b6 add MKLDNNDeviceContext
7 years ago
Yang Yu 2b3d94691b Update init.cc
7 years ago
sweetsky0901 cbe25b3345
Merge pull request #6881 from sweetsky0901/cross_channel_norm
7 years ago
Yang Yu 059096741e Add init glog
7 years ago
Tao Luo c10023006b
Merge pull request #6719 from tensor-tang/mkl_packed
7 years ago
Yang Yang cb0b81f97b add << lodtensor
7 years ago
Luo Tao 5974c1b76e refine comments in CMakelists.txt of operator
7 years ago
hedaoyuan cb4caa5ccb Bug fix.
7 years ago
hedaoyuan 3676789187 Add a capi interface of release middle layer's output memory.
7 years ago
Liu Yiqun 42a0603e6f Merge branch 'develop' into core_inference_example
7 years ago
tensor-tang 89cb3a249c follow comments, refine comment and function name
7 years ago
qingqing01 39502e6ecd
Merge pull request #6701 from qingqing01/profiler_tool
7 years ago
tensor-tang adf79faaca Merge remote-tracking branch 'upstream/develop' into mkl_packed
7 years ago
tensor-tang df2b054b13 follow comments refine code
7 years ago
Luo Tao e4e95beedc manually pybind some specific operators
7 years ago
Luo Tao f3851fe58d auto pybind when *_op.cc contains several operators
7 years ago