Commit Graph

2935 Commits (c44fb003715aab90d14f0d0fce020d0b65ec6fbf)

Author SHA1 Message Date
nhzlx 32a9e050bc mapping the variable name inside the subgraph
7 years ago
Wu Yi 2d036c47cd
polish dist unit test code (#12512)
7 years ago
qiaolongfei 9331ba752f add WITH_INFERENCE flag
7 years ago
chengduo 97a77512b4
Fix the order of sum (#12562)
7 years ago
fengjiayi 7834b4a470 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_op_tensor_support
7 years ago
Luo Tao cf74473244 make inference_anakin_test SERIAL
7 years ago
Qiao Longfei 88a607c342
Merge pull request #12541 from jacquesqiao/optimize-profiler
7 years ago
sneaxiy e57bc4d745 Merge branch 'refine_elementwise_add' of https://github.com/sneaxiy/Paddle into refine_elementwise_add
7 years ago
qiaolongfei 954d680b40 fix test_parallel_do.py
7 years ago
sneaxiy 222fbbedfb Merge branch 'develop' into refine_elementwise_add
7 years ago
sneaxiy 4b83afff6e
Merge branch 'develop' into refine_elementwise_add
7 years ago
sneaxiy b2d0ee5159 refine elementwise_add op
7 years ago
Tao Luo 0fd2f713a4
Merge pull request #12548 from Superjomn/bugfix/disable-anakin-test
7 years ago
fengjiayi 7c55e08c93 stash
7 years ago
superjomn ebe1920626 add comment
7 years ago
superjomn 3c5e15de03 disable anakin test
7 years ago
fengjiayi b656d97e86
Merge pull request #12485 from JiayiFeng/dev_ops_tensor_support
7 years ago
qiaolongfei 52576c5f38 revert inference
7 years ago
qiaolongfei 1623f1ba4f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into optimize-profiler
7 years ago
qiaolongfei 903b2c0162 optimize code
7 years ago
Xin Pan 99a77cfc62
Merge pull request #12468 from panyx0718/improve_profiler2
7 years ago
qiaolongfei 4c5bcd7859 add guard to profiler
7 years ago
qiaolongfei d553e2ff3f revert inference
7 years ago
qiaolongfei a3f9d6a38c optimize profiler
7 years ago
minqiyang a58dd3e557 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_python3_syntax
7 years ago
minqiyang f9ef0ee8a9 Polish code
7 years ago
minqiyang c4d000a990 Make code more efficient
7 years ago
minqiyang 9812bb8b48 Fix pserver can NOT start with DebugString problem
7 years ago
Zhaolong Xing d7dd0868db
Merge pull request #12449 from NHZlX/add_tensorrt_elementwise_add
7 years ago
nhzlx d50f776b27 merge develop
7 years ago
Bai Yifan 900d61dd98
Clean python api (#12406)
7 years ago
dzhwinter 0c8fde7dce
"cherry picked cpp tests" (#12182)
7 years ago
dzhwinter 595a2c83ae
explicit gradient of elementwise_add/elementwise_sub (#11970)
7 years ago
nhzlx 64a08f840f increase the test batch
7 years ago
Zhaolong Xing f37f875f1f
Merge pull request #12349 from NHZlX/add_tensorrt_conv2d_converter
7 years ago
Zhaolong Xing 7e6bac3ea6
Merge pull request #12479 from NHZlX/fix_gtest_test_eq_warning
7 years ago
fengjiayi e7d8e16a66 update softmax_mkldnn_op
7 years ago
nhzlx c7e6a11bc1 merge develop
7 years ago
nhzlx 0015df1b12 modify op converter for conv2d
7 years ago
Yu Yang 2567afa35d
Merge pull request #12462 from reyoung/feature/fix_cudnn_deterministic
7 years ago
fengjiayi dc111d3476 update softmax_cudnn_op
7 years ago
nhzlx 66406619ec merge develop
7 years ago
nhzlx a2749adf5d fix warning
7 years ago
fengjiayi f7bd0b227b Add unittests for softmax_op
7 years ago
gongweibao 819ac3df0a
Modify style (#12465)
7 years ago
cuichaowen 046de2acdb Improve anakin feature (#11961)
7 years ago
fengjiayi b314a69523 make softmax supporting tensors
7 years ago
fengjiayi b1af7e5d9b Add unittests for lookup_table_op
7 years ago
Xin Pan 486345551d clean
7 years ago
Xin Pan caf10b474f make profiler use thread_id from g_thread_id
7 years ago
nhzlx c13efe02d9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_elementwise_add
7 years ago
nhzlx a5c96af33c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_conv2d_converter
7 years ago
Yu Yang 040fc1c39b Fix bug in cudnn_determistic
7 years ago
fengjiayi 7efdf05ac2 make look_up_op supporting tensor ids
7 years ago
Yan Chunwei dcfbc6a661
inference analyzer as bin (#12450)
7 years ago
Yan Chunwei 31a2c87688
fea/lightly support lod (#12451)
7 years ago
fengjiayi 38863a2c9d
Merge pull request #12454 from JiayiFeng/dev_exception_holder
7 years ago
Qiao Longfei 690625fe15
Merge pull request #12456 from jacquesqiao/add-profiler-to-pserver
7 years ago
dzhwinter 6d3da458a7
Fix/float16 style (#12446)
7 years ago
yuyang18 59c900e1e9 Update API.spec
7 years ago
fengjiayi bc1b7b96ec Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_exception_holder
7 years ago
qiaolongfei 7e46a8d172 fix logical bug, optimize code
7 years ago
qiaolongfei d04dca3798 revert cmakelist
7 years ago
qiaolongfei 0b62f61d29 add init flag in __init__.py for listen_and_serv_profile_period
7 years ago
dzhwinter 91fb0156ca
Memory/reshape op (#12414)
7 years ago
qiaolongfei b4496ee442 Merge branch 'fix-mac-build-graph_executor' of ssh://github.com/jacquesqiao/Paddle into add-profiler-to-pserver
7 years ago
qiaolongfei c8c8c01a23 fix mac build of graph_executor
7 years ago
qiaolongfei 0b861bbca9 add profiler for listen_and_serv op
7 years ago
Zhaolong Xing 7ae73e33da
Merge pull request #12432 from Superjomn/fea/analysis-ssa
7 years ago
fengjiayi 3e4083ed1f Make exception handling of threaded_ssa_graph_executor an independent class
7 years ago
tensor-tang 059b27840c
Merge pull request #12408 from tensor-tang/refine/im2col
7 years ago
Superjomn 15c2f1abb3 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fea/analysis-ssa
7 years ago
nhzlx b241a47e8e merge develop
7 years ago
nhzlx 5fcdd81da7 tiny modify
7 years ago
minqiyang ce4eba3b0d Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_python3_syntax
7 years ago
qiaolongfei 236fc1bd38 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add-test-for-split-ids-op
7 years ago
qingqing01 f372f27e3f
Hidden APIs for While, StaticRNN, ParallelDo. (#12332)
7 years ago
minqiyang 000ba1ac5f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_python3_syntax
7 years ago
Xin Pan 4b8ae523c4
Merge pull request #12367 from panyx0718/ir_pass
7 years ago
nhzlx f05c7fb8ae Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_conv2d_converter
7 years ago
nhzlx 6f6d552790 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_conv2d_converter
7 years ago
Qiao Longfei 297cbeb1c6
Merge pull request #12439 from jacquesqiao/CheckTensorNANOrInf-support-selectedrows
7 years ago
dzhwinter 39ac9e39c2
float16 type support enhance (#12181)
7 years ago
qiaolongfei 3033841b4a CheckTensorNANOrInf support checking SelectedRows
7 years ago
qiaolongfei 147bf00ffe clear mutable rows for the output of split_ids_op
7 years ago
qiaolongfei 91b114a787 change map to unordered_map
7 years ago
tensor-tang d8d2dbcfac further optimize im2col using variables
7 years ago
Superjomn 4d2405d851 inference analysis support ssa
7 years ago
qiaolongfei 91f63cd401 fix split_ids_op and add unit test
7 years ago
tensor-tang 5373fe29c2 Merge remote-tracking branch 'ups/develop' into refine/im2col
7 years ago
Xin Pan 02c31458bb
Merge pull request #12417 from panyx0718/add_dist_deps
7 years ago
Xin Pan 25706d0868 properly set up dep of concat and fetch_bar
7 years ago
minqiyang e96fef2cf7 Fix inference api impl deps
7 years ago
Xin Pan 4abcb1b8e7
Merge pull request #12409 from panyx0718/add_dist_deps
7 years ago
Qiyang Min 7da453630e
Merge pull request #12403 from velconia/fix_hang_up
7 years ago
Xin Pan 398cfb47b1 disable dist_se_resnext since it's not stable yet.
7 years ago
Tao Luo 5a634786af
Merge pull request #12312 from luotao1/unify
7 years ago
Bai Yifan e12b1d1792 Add flatten op (#12341)
7 years ago
Luo Tao 062556f938 Merge branch 'develop' into unify
7 years ago
Xin Pan 5fff8d7a55 add distributed training deps.
7 years ago
nhzlx 98948b975e wrong added file
7 years ago
nhzlx 830aa12c1a add elementwise init code
7 years ago
chengduo 2409d0f710
Refine regularization for selected_rows (#12369)
7 years ago
Zhaolong Xing 85c4912755
Merge pull request #12355 from NHZlX/add_tensorrt_pooling_converter
7 years ago
tensor-tang 5bea9c148c
Merge pull request #12397 from tensor-tang/refine/num_threads
7 years ago
tensor-tang 687a322267 Merge remote-tracking branch 'ups/develop' into refine/im2col
7 years ago
tensor-tang 65d418f060 complete im2col with padding==1 and speedup filter width==1
7 years ago
minqiyang 053540e199 Add volatile to stopped_ member
7 years ago
tensor-tang 4f0383f52e fix unknown flag
7 years ago
minqiyang 0c7d6eb8b2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_python3_syntax
7 years ago
minqiyang b78ffde6d5 Add stopped sign for grpc client
7 years ago
fengjiayi ec4c6e1f7c
Merge pull request #12384 from JiayiFeng/dev_update_save_inference_model
7 years ago
tensor-tang 9788e5ab87 add flags to control num_threads
7 years ago
tensor-tang 10a1c2bb86 control omp num_threads
7 years ago
Xin Pan 99c0c20468 add pass test
7 years ago
tensor-tang 52eb86e30f refine im2col benchmark
7 years ago
tensor-tang 3017f46076 add more test cases
7 years ago
typhoonzero 54e9fd3f61 fix cudnn enforce
7 years ago
tensor-tang 8d6be4fb5f refine im2col test and add benchmark
7 years ago
minqiyang 559d36328c Apply 2to3 to current paddle main python code
7 years ago
tensor-tang 507c143047 im2col cfo cpu code clean
7 years ago
fengjiayi 604bd85a45 update inference_optimize()
7 years ago
Xin Pan 12e9bf6c17 clean up
7 years ago
Xin Pan ab72d28a5e clean up and correctness check
7 years ago
tensor-tang 4eeed0b5e4 refine width padding and enable core copy
7 years ago
Tao Luo 3ade95d0db
Merge pull request #12379 from luotao1/demo_ci_fix
7 years ago
fengjiayi 0d43594d16
Merge pull request #12364 from JiayiFeng/dev_add_FLAG_free_idle_memory
7 years ago
Wu Yi 73fcfc06ec
refine conv cudnn enforce (#12353)
7 years ago
Xin Pan aa1085ddc5 all passes
7 years ago
nhzlx fb204fbfbe Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_pooling_converter
7 years ago
nhzlx 4f71a3b12b fix a bug
7 years ago
Luo Tao 83e59257d0 fix manylinux1 Failed to publish artifacts
7 years ago
Xin Pan e4d7d7ae8f pass refactoring
7 years ago
tensor-tang e3131e2d73 enable width padding
7 years ago
Xin Pan 142e832d21 pass registration
7 years ago
Xin Pan 5b183557f3 graph viz pass
7 years ago
qiaolongfei 64e7902530 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into parallel-executor-support-prefetch
7 years ago
Xin Pan d7e08c53c2
Merge pull request #12169 from panyx0718/ir_graph_sort
7 years ago
tensor-tang 92518c519f reuse sizes saving time
7 years ago
tensor-tang 660df122ce enable padding!=0 and fill height padding with 0
7 years ago
tensor-tang d8e00facf7 reuse im_size
7 years ago
tensor-tang 179dd0cb8a
Merge pull request #12337 from tensor-tang/refine/im2col
7 years ago
nhzlx c8adfb3451 add paddle_enforce
7 years ago
nhzlx 5533400720 fix comments
7 years ago
fengjiayi fd2d2c66e9 add flag to prevent unnessary memory free
7 years ago
qiaolongfei e7eeb19f90 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into parallel-executor-support-prefetch
7 years ago
Qiao Longfei 2d21aa76c7
Merge pull request #12331 from jacquesqiao/fix-mixed-tensor
7 years ago
Luo Tao 5ba4337698 unify libpaddle_inference_api into libpaddle_fluid
7 years ago
nhzlx 01566fb61b 1. support mutil batch utest 2. support pool op
7 years ago
qiaolongfei 754e96a30c distribute lookup table work with parallel executor
7 years ago