Commit Graph

10111 Commits (698c926ce5f1666d18b00bdd12fe63803dc738fe)

Author SHA1 Message Date
Zhaolong Xing 310708726b
Merge pull request #12761 from NHZlX/global_pooling_trt
7 years ago
tensor-tang b090479409 Merge remote-tracking branch 'ups/develop' into feature/op/fusion_lstm
7 years ago
nhzlx 1e92baf746 fix comments
7 years ago
Xin Pan 17b88811e0 fix ProgramToGraph
7 years ago
tangwei12 b4f52b01d0 bug fix when all inputs are empty
7 years ago
tangwei12 3efac174ea Merge branch 'develop' of github.com:PaddlePaddle/Paddle into sum_op_dim_fix
7 years ago
tangwei12 dbb4f0d35d Merge branch 'develop' of github.com:PaddlePaddle/Paddle into dis_ckpt_fix
7 years ago
Qiao Longfei fd10669ecb
Add dependency to send recv (#12760)
7 years ago
nhzlx ce7f361a80 fix comments
7 years ago
Xin Pan a9217031ba small fix
7 years ago
nhzlx df9cbabcee add pool2d test for global_pooling true
7 years ago
dzhwinter 2673798ddb
"fix float16 ShuffleDownSync Bug" (#12756)
7 years ago
Yan Chunwei 6fe5547db7
switch NodeAttr to boost::varient (#12539)
7 years ago
Chen Weihang 535a6e9206
Merge pull request #12509 from JiabinYang/scripts0802
7 years ago
nhzlx 133ec69625 add batch norm trt converter
7 years ago
tangwei12 7c12c0f865 add sync in load selectedrows
7 years ago
luotao1 413bf9d494 disable anakin when cuda < 8.0 or cudnn < 7.0
7 years ago
tangwei12 9f11db4080 add todo in impl
7 years ago
tangwei12 40febec402 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into dis_ckpt_fix
7 years ago
tangwei12 c24a9263ba Merge branch 'develop' of github.com:PaddlePaddle/Paddle into sum_op_dim_fix
7 years ago
Qiao Longfei 03d4c7efd3
add rw lock test (#12752)
7 years ago
dzhwinter f36818d532
"windows testing easier" (#12739)
7 years ago
nhzlx 2bdd20be22 add support for global pooling for trt
7 years ago
tangwei12 ac9ae97001 code fix
7 years ago
nhzlx f55e8901c8 merge develop
7 years ago
nhzlx 1600ba86f6 1. change tensorrt op from cpu to gpu
7 years ago
tangwei12 bb9f494740 merge develop
7 years ago
tangwei12 eba7177475 add unit test and code fix
7 years ago
dzhwinter 4069262f0e
Revert ""cherry picked operators changes" (#12184)" (#12747)
7 years ago
Qiao Longfei 653fad08f8
Optimize selected rows for dist lookup table with pthread rwlock (#12635)
7 years ago
Qiao Longfei 64d48f4d6a
fix mac compile (#12751)
7 years ago
tensor-tang 92890ac258 Merge remote-tracking branch 'ups/develop' into feature/op/fusion_lstm
7 years ago
tangwei12 0749c8822d
Merge pull request #12556 from seiriosPlus/samplingIdOp
7 years ago
Qiyang Min 340a104c58
Merge pull request #12658 from velconia/port_pybind11
7 years ago
tensor-tang a56142c155 optimize elementwise_mul cpu forward
7 years ago
tensor-tang 6644ce79a5 add mklml vmul
7 years ago
tensor-tang ff92b6ba81
Merge pull request #12531 from tensor-tang/refine/op/gru
7 years ago
tangwei12 26b228e405 remove assignment and add vlog
7 years ago
Chen Weihang d4d8f83137
Merge pull request #12633 from chenwhql/demangle_type_name
7 years ago
Chen Weihang 1e961b145c
Merge pull request #12591 from chenwhql/enforce_msg_polish
7 years ago
tangwei12 125e9166e1 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into sum_op_dim_fix
7 years ago
tensor-tang a72f68f223 Merge remote-tracking branch 'ups/develop' into feature/op/fusion_lstm
7 years ago
tensor-tang df28a3b452 fix lod and op test
7 years ago
Tao Luo 17da113c87
Merge pull request #12693 from luotao1/anakin_bug
7 years ago
Qingsheng Li 317e18abd2
Remove Data Sharing between input and output in scatter_op (#12672)
7 years ago
tensor-tang f3cd2612ae refine fc and use the fc compute in fusion_lstm
7 years ago
qingqing01 c44fb00371
Add name in relu and log API. (#12438)
7 years ago
luotao1 9f3789944c use latest anakin commit
7 years ago
tangwei12 822496f626 merge cpu and gpu
7 years ago
dzhwinter bf3c34960f
"cherry picked operators changes" (#12184)
7 years ago
tensor-tang 40138c4cd6 add unit test of fusion lstm op
7 years ago
jerrywgz c108376506 Add three modes for prelu_op (#12630)
7 years ago
tangwei12 9f09d68678 add enforce
7 years ago
gongweibao d06849305a
parameter dispather. (#12666)
7 years ago
tensor-tang 852bc6f4aa refine fusion lstm op doc
7 years ago
tensor-tang 8f9132959e fuse fc in lstm
7 years ago
tensor-tang ddb05dffb6 init fusion lstm op
7 years ago
tensor-tang efc5392d97
Merge pull request #12676 from tensor-tang/refine/op/fc
7 years ago
minqiyang a32ce8c444 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_pybind11
7 years ago
Yan Chunwei 5d2834fcf7
fea/ir support fuse, based on graph pattern detection helper (#12636)
7 years ago
tangwei12 470fb7c5c3 bug fix
7 years ago
minqiyang 0d7047ca79 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_pybind11
7 years ago
yuyang18 d1d825ee02 Hide unnecessary API
7 years ago
yuyang18 265302edea Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into feature/fast_executor
7 years ago
tangwei12 60dda7bf9f add gpu Implementation
7 years ago
tangwei12 4661f5589d random optimize
7 years ago
Wu Yi bd87f67f0e
Dist transpile can pass startup program by argument (#12606)
7 years ago
Bai Yifan 9333a62792
Add flatten op interface and enhance APIs about detection to support variable-length image. (#12422)
7 years ago
tensor-tang eee38464dc refine fc op use cpu only
7 years ago
tangwei12 ed937bc6f8 merge
7 years ago
fengjiayi f276006f0c
Merge pull request #12694 from JiayiFeng/dev_op_tensor_support
7 years ago
Yu Yang a197737c02
Merge pull request #12690 from reyoung/feature/better_exception_holder
7 years ago
Yan Chunwei e765dead86
add profiler to fluid inference (#12707)
7 years ago
tensor-tang d84a1a0010 fc op use cpu only
7 years ago
tensor-tang fbc164047d Merge remote-tracking branch 'ups/develop' into refine/op/fc
7 years ago
Xin Pan d96ee24f0b
Merge pull request #12697 from panyx0718/ir2
7 years ago
minqiyang 77f12e000f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_pybind11
7 years ago
tangwei12 f56102505a add pserver_endpoints args in load_inference_model
7 years ago
tangwei12 478f73c188 merge header in cc
7 years ago
fengjiayi d6b5302bd6 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_op_tensor_support
7 years ago
Yan Chunwei 0a641ba326
add ratio to profiler (#12701)
7 years ago
tensor-tang c588c64a76 Merge remote-tracking branch 'ups/develop' into refine/op/gru
7 years ago
tensor-tang 0098a494a2 Merge remote-tracking branch 'ups/develop' into refine/op/fc
7 years ago
gongweibao 842fb021b3
Fix clone() bug. (#12583)
7 years ago
Qiao Longfei 5d579e1a96
add export_for_deployment flag to save_inference_model (#12582)
7 years ago
chenweihang 7797e55f42 use paddle::platform::demangle
7 years ago
minqiyang e0d5f8a820 Move compat module to python/paddle
7 years ago
chenweihang da39d84a48 refine by reviewer's advice
7 years ago
Xin Pan 891c3c0f9a test and doc IR Graph
7 years ago
minqiyang 7e0f66e99a Polish code
7 years ago
minqiyang 5338417b47 Polish code style
7 years ago
minqiyang ae39709e59 Polish code
7 years ago
minqiyang 55d7f55c63 Revert the changes to attribute.h
7 years ago
fengjiayi 5e7aa8c7e5 code clean
7 years ago
chenweihang 21d5b94228 error message refine: add demangle api to attribute type
7 years ago
minqiyang 1800fef142 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into port_pybind11
7 years ago
tensor-tang 742300baa8 fix unkown omp pragmas
7 years ago
yuyang18 05cadf1b24 Add FastExecutor
7 years ago
tensor-tang b9dbb7c5cb fix bias attri in mkldnn fc
7 years ago
yuyang18 c6eb7a89ff Merge branch 'feature/better_exception_holder' into feature/fast_executor
7 years ago