Commit Graph

2800 Commits (1f618c4ff9622259489546535c85309e4b619ebb)

Author SHA1 Message Date
qiaolongfei 147bf00ffe clear mutable rows for the output of split_ids_op
7 years ago
qiaolongfei 91b114a787 change map to unordered_map
7 years ago
tensor-tang d8d2dbcfac further optimize im2col using variables
7 years ago
Superjomn 4d2405d851 inference analysis support ssa
7 years ago
qiaolongfei 91f63cd401 fix split_ids_op and add unit test
7 years ago
tensor-tang 5373fe29c2 Merge remote-tracking branch 'ups/develop' into refine/im2col
7 years ago
Xin Pan 02c31458bb
Merge pull request #12417 from panyx0718/add_dist_deps
7 years ago
Xin Pan 25706d0868 properly set up dep of concat and fetch_bar
7 years ago
minqiyang e96fef2cf7 Fix inference api impl deps
7 years ago
Xin Pan 4abcb1b8e7
Merge pull request #12409 from panyx0718/add_dist_deps
7 years ago
Qiyang Min 7da453630e
Merge pull request #12403 from velconia/fix_hang_up
7 years ago
Xin Pan 398cfb47b1 disable dist_se_resnext since it's not stable yet.
7 years ago
Tao Luo 5a634786af
Merge pull request #12312 from luotao1/unify
7 years ago
Bai Yifan e12b1d1792 Add flatten op (#12341)
7 years ago
Luo Tao 062556f938 Merge branch 'develop' into unify
7 years ago
Xin Pan 5fff8d7a55 add distributed training deps.
7 years ago
nhzlx 98948b975e wrong added file
7 years ago
nhzlx 830aa12c1a add elementwise init code
7 years ago
chengduo 2409d0f710
Refine regularization for selected_rows (#12369)
7 years ago
Zhaolong Xing 85c4912755
Merge pull request #12355 from NHZlX/add_tensorrt_pooling_converter
7 years ago
tensor-tang 5bea9c148c
Merge pull request #12397 from tensor-tang/refine/num_threads
7 years ago
tensor-tang 687a322267 Merge remote-tracking branch 'ups/develop' into refine/im2col
7 years ago
tensor-tang 65d418f060 complete im2col with padding==1 and speedup filter width==1
7 years ago
minqiyang 053540e199 Add volatile to stopped_ member
7 years ago
tensor-tang 4f0383f52e fix unknown flag
7 years ago
minqiyang 0c7d6eb8b2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_python3_syntax
7 years ago
minqiyang b78ffde6d5 Add stopped sign for grpc client
7 years ago
fengjiayi ec4c6e1f7c
Merge pull request #12384 from JiayiFeng/dev_update_save_inference_model
7 years ago
tensor-tang 9788e5ab87 add flags to control num_threads
7 years ago
tensor-tang 10a1c2bb86 control omp num_threads
7 years ago
Xin Pan 99c0c20468 add pass test
7 years ago
tensor-tang 52eb86e30f refine im2col benchmark
7 years ago
tensor-tang 3017f46076 add more test cases
7 years ago
typhoonzero 54e9fd3f61 fix cudnn enforce
7 years ago
tensor-tang 8d6be4fb5f refine im2col test and add benchmark
7 years ago
minqiyang 559d36328c Apply 2to3 to current paddle main python code
7 years ago
tensor-tang 507c143047 im2col cfo cpu code clean
7 years ago
fengjiayi 604bd85a45 update inference_optimize()
7 years ago
Xin Pan 12e9bf6c17 clean up
7 years ago
Xin Pan ab72d28a5e clean up and correctness check
7 years ago
tensor-tang 4eeed0b5e4 refine width padding and enable core copy
7 years ago
Tao Luo 3ade95d0db
Merge pull request #12379 from luotao1/demo_ci_fix
7 years ago
fengjiayi 0d43594d16
Merge pull request #12364 from JiayiFeng/dev_add_FLAG_free_idle_memory
7 years ago
Wu Yi 73fcfc06ec
refine conv cudnn enforce (#12353)
7 years ago
Xin Pan aa1085ddc5 all passes
7 years ago
nhzlx fb204fbfbe Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_pooling_converter
7 years ago
nhzlx 4f71a3b12b fix a bug
7 years ago
Luo Tao 83e59257d0 fix manylinux1 Failed to publish artifacts
7 years ago
Xin Pan e4d7d7ae8f pass refactoring
7 years ago
tensor-tang e3131e2d73 enable width padding
7 years ago
Xin Pan 142e832d21 pass registration
7 years ago
Xin Pan 5b183557f3 graph viz pass
7 years ago
qiaolongfei 64e7902530 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into parallel-executor-support-prefetch
7 years ago
Xin Pan d7e08c53c2
Merge pull request #12169 from panyx0718/ir_graph_sort
7 years ago
tensor-tang 92518c519f reuse sizes saving time
7 years ago
tensor-tang 660df122ce enable padding!=0 and fill height padding with 0
7 years ago
tensor-tang d8e00facf7 reuse im_size
7 years ago
tensor-tang 179dd0cb8a
Merge pull request #12337 from tensor-tang/refine/im2col
7 years ago
nhzlx c8adfb3451 add paddle_enforce
7 years ago
nhzlx 5533400720 fix comments
7 years ago
fengjiayi fd2d2c66e9 add flag to prevent unnessary memory free
7 years ago
qiaolongfei e7eeb19f90 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into parallel-executor-support-prefetch
7 years ago
Qiao Longfei 2d21aa76c7
Merge pull request #12331 from jacquesqiao/fix-mixed-tensor
7 years ago
Luo Tao 5ba4337698 unify libpaddle_inference_api into libpaddle_fluid
7 years ago
nhzlx 01566fb61b 1. support mutil batch utest 2. support pool op
7 years ago
qiaolongfei 754e96a30c distribute lookup table work with parallel executor
7 years ago
qiaolongfei 65e5aebd43 fix mixed_vector_test
7 years ago
nhzlx 990741aa85 add weight's dim assert
7 years ago
nhzlx 21890ca0cf Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into add_tensorrt_pooling_converter
7 years ago
qiaolongfei da035fc674 remove explicit for compile problem
7 years ago
tensor-tang 7b63b85086 fix mismatch of infer api (#12342)
7 years ago
tensor-tang b72befc5cc reuse copy size
7 years ago
Yancey 6133efd9ed
Merge pull request #12218 from Yancey1989/rpc_complete_interface
7 years ago
qiaolongfei c6fb163571 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-mixed-tensor
7 years ago
nhzlx fc41eb40b1 add conv2d trt converter
7 years ago
qingqing01 24bea40116
Hiden some LoDTensor ralated ops' Python wrapper. (#12230)
7 years ago
Zhaolong Xing 6169d724b9
Merge pull request #12324 from NHZlX/enhance_for_tensorrt_infer
7 years ago
nhzlx 4d49e61ab8 fix comments
7 years ago
qiaolongfei 18d539e82a Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into fix-mixed-tensor
7 years ago
Wu Yi 9f0d9dffe6
hide variable API (#12307)
7 years ago
tensor-tang 6788af4bf1 refine test cases
7 years ago
tensor-tang b163e601b6 add gtest
7 years ago
Yu Yang 7c046ae772
Merge pull request #12323 from reyoung/feature/polish_reshape_and_lod_tensor_blocking_queue
7 years ago
nhzlx bcd67bdd71 add assert for GetOutput
7 years ago
qiaolongfei 5022b14de8 fix mixed tensor compile and add cpu unit test
7 years ago
tensor-tang aae994fd26 refine im2col no padding
7 years ago
Yancey1989 fb06ed7bdc Merge branch 'develop' of github.com:PaddlePaddle/Paddle into rpc_complete_interface
7 years ago
nhzlx 7382f98600 1. set ut batch > 1 2. readd the mul op(utest will be added later)
7 years ago
nhzlx bd64979fe9 the argument should not be a const one
7 years ago
Yu Yang 21387e3c2a Tiny refines for lod_tensor_blocking_queue and reshape_op
7 years ago
nhzlx f42ea48996 deal with conflict
7 years ago
nhzlx 940f5dbcac modify the tensorrt engine op to adapt to chage
7 years ago
nhzlx 82527696e7 1. we delelte mul op, 2.modify fc and action op 3. modify the test inferface
7 years ago
nhzlx 2372daff1d there is no batchsize concept in tensorrt's tensor
7 years ago
qiaolongfei 35d09abd01 add profiler for demo_trainer
7 years ago
Yan Chunwei 02cf54d331
bugfix lod cpu performance (#12297)
7 years ago
Qiao Longfei b41f8b9d42
Merge pull request #12295 from jacquesqiao/speedup-reduce-sum-grad-op
7 years ago
Xin Pan 5173a53c8a fix reorder issue.
7 years ago
fengjiayi eec412b230
Merge pull request #12273 from JiayiFeng/update_py_reader
7 years ago
Yan Chunwei 9e0a94f069
inference-api code clean (#12274)
7 years ago