Commit Graph

201 Commits (17299b8d217c0872408cc9146a58f0769d8b05ba)

Author SHA1 Message Date
Yu Yang f1a392a5fe
Merge pull request #13804 from sneaxiy/rewrite_allocation
6 years ago
qingqing01 fd7e643153
Convolution fusion operator. (#14449)
6 years ago
Yu Yang 98bbfc17be Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
Wu Yi a2d9b34417
Refine operator cmake (#14413)
6 years ago
whs 1722678258
Make nce support more distribution. (#13549)
6 years ago
Wu Yi b32c13dc20
Add cudnn ctc loss (#12366)
6 years ago
Yu Yang c8f6e70ab4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
peizhilin 61fa5218b9 Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Yu Yang 8f9bfad246
perf(compile): speed up reduce_op compile by splitting files (#14294)
6 years ago
sneaxiy d231e55065 merge develop
6 years ago
peizhilin 7638f0afb3 simplify the logic
6 years ago
peizhilin d01a26280e Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
li099 688ed60116 Add lod tensor array to tensor op (#13990)
6 years ago
peizhilin 52f7644f53 Merge remote-tracking branch 'upstream/develop' into windows/build
6 years ago
Yu Yang fdc689142c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
chengduo c5b6573a5a
Fix input<tensor> (#14208)
6 years ago
Zhaolong Xing ba8b5619a3
Revert "cherry picked windows patches."
6 years ago
Yu Yang 057a682ee9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
peizhilin 9d67c1fb69 cpu build support
6 years ago
dzhwinter 60f70b174d test=develop
6 years ago
dzhwinter 316765839d add back jit simd instructions. stage.
6 years ago
dzhwinter bf2e4cb188 cleard. staged
6 years ago
dzhwinter ebfe5a02b3 merge develop branch
6 years ago
tensor-tang 3c957af139
Merge pull request #14080 from tensor-tang/refine/jit/crf2
6 years ago
Yu Yang c01696f8c2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
tensor-tang 21487d78bf add crf decode jit kernel
6 years ago
minqiyang 8a0f26f45f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into continue_hash_op
6 years ago
minqiyang d4f9aa0852 Add hash op implementation
6 years ago
chengduo a7497653d0
Refine Split op (#13967)
6 years ago
Yu Yang 461f71a90b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
6 years ago
tensor-tang 3c249283af init seqconv eltadd relu op
6 years ago
qingqing01 67a2b5215d
Add affine channel op to speed and save memory for faster-rcnn model. (#13919)
6 years ago
tensor-tang bcb8ea397d Merge remote-tracking branch 'ups/develop' into fea/jitkernel_peephole
6 years ago
Qiyang Min f99ea99e36
Merge pull request #13720 from velconia/fix_grad_clip
6 years ago
tensor-tang 9131a35676 replace the lstm compute with jitkernel
6 years ago
sneaxiy 4c672ab1a2 Merge reyoung:rewrite_allocation
6 years ago
minqiyang bcd8c2ccc3 Add unit test
6 years ago
minqiyang 67308822f8 Add selected_rows merge for clip_by_norm op
6 years ago
dzhwinter 26771f41ba
"fix compile error" (#13579)
7 years ago
Xin Pan 425a882165
Merge pull request #13643 from panyx0718/ir2
7 years ago
Yu Yang 5cf395beaf Fix bug in uts
7 years ago
Dun 161c3e31f7 Optimization of Kernels that related to DeepLabv3+ (#13534)
7 years ago
Xin Pan ddd60581b7 clean up channel
7 years ago
chengduo 6757a31552
[Accelerate] Refine seq_softmax_op (#13421)
7 years ago
tensor-tang 612ba41aee add simple lstm compute
7 years ago
dzhwinter c3e1fb5a3e add demo
7 years ago
dzhwinter 379b471ee2 squash commit
7 years ago
qingqing01 9bd933d3fb
Improve and fix fake_quantize_op (#13092)
7 years ago
dzhwinter 52d60f8f3e merge conclit
7 years ago
dzhwinter dbe90cc0f6 merge develop branch
7 years ago
dzhwinter b74af56bbc cpu compile is done
7 years ago
fengjiayi 7e0c9f50ae Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into dev_sequence_padding_op
7 years ago
dzhwinter cd8f3e9ed0 operator module is done
7 years ago
dzhwinter 0153c21d83 add unstack_op
7 years ago
dzhwinter 7dceb8a080 check some operators
7 years ago
dzhwinter 26dbe35c54 add msvc flags and copy lib done
7 years ago
dzhwinter eca4563e5d
operators module (#12938)
7 years ago
dzhwinter 488a2dd2e8 with ir node
7 years ago
dzhwinter cfbf1ba305 add source
7 years ago
dzhwinter 89f95ea25e merge develop branch
7 years ago
dzhwinter 34f8c9b6f5 windows port
7 years ago
dzhwinter e23ddf6ae4
status (#12764)
7 years ago
fengjiayi 34b209cffa Complete sequence_padding GPU kernel
7 years ago
dzhwinter 00463fdfe3
cudnn windows support (#12757)
7 years ago
dzhwinter 5c88cd2af5 remove werror in windows
7 years ago
dzhwinter 64ce1210aa "windows support"
7 years ago
nhzlx f55e8901c8 merge develop
7 years ago
nhzlx 1600ba86f6 1. change tensorrt op from cpu to gpu
7 years ago
tensor-tang eee38464dc refine fc op use cpu only
7 years ago
tensor-tang d84a1a0010 fc op use cpu only
7 years ago
tensor-tang 0098a494a2 Merge remote-tracking branch 'ups/develop' into refine/op/fc
7 years ago
tensor-tang 4b5986bb77 enable fc op in normal case
7 years ago
Yu Yang 8dda526a45
Merge pull request #12659 from sneaxiy/refine_softmax_with_cross_entropy
7 years ago
sneaxiy c50c537732 fix arithmetic error in backward kernel
7 years ago
sneaxiy 2c560623d1 fix dependency error
7 years ago
Bai Yifan e12b1d1792 Add flatten op (#12341)
7 years ago
chengduo 2409d0f710
Refine regularization for selected_rows (#12369)
7 years ago
Xin Pan 93355cc0d2 fix control deps
7 years ago
Yan Chunwei b42ced8eda
bugfix/tensorrt analysis fix subgraph trigger (#12266)
7 years ago
Guo Sheng da3f766821
Merge pull request #12088 from guoshengCS/complete-hsigmoid
7 years ago
chenweihang 938319bbd2
Merge branch 'develop' into unsqueeze_op
7 years ago
guosheng d695381677 Merge branch 'develop' of https://github.com/PaddlePaddle/paddle into complete-hsigmoid
7 years ago
chenweihang ca15779394 rewrite, use reshape op in unsqueeze op, test passed
7 years ago
yuyang18 1854814d49
Use reshape_op inside squeeze_op
7 years ago
gongweibao 66c91911cf
Improve brpccmake (#11842)
7 years ago
Yan Chunwei 5082642bdb
feature/analysis to support sub-graph for TRT engine (#11538)
7 years ago
tangwei12 e589005229 merge
7 years ago
Yancey1989 712adc786f polish dist cmake
7 years ago
Yancey1989 1ef6cdb60e move dist codes from operaotrs/detail to operators/distributed
7 years ago
tangwei12 1c2e9bdd49 fix cmakelist
7 years ago
weixing02 8bd148dc00 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into hsigmoid_op
7 years ago
gongweibao d9de6b8621
Add brpc surpport. (#11263)
7 years ago
gongweibao 627d7a64f8
Clean `sendop` `recv` operator. (#11309)
7 years ago
dzhwinter d48172f22a
split reduce op into multiple libraries, accelerate the compiling (#11029)
7 years ago
Yan Chunwei 4f95bc9463
feature/trt engine op test (#11182)
7 years ago
weixing02 3e46ec41a9 add hsigmoid
7 years ago
Luo Tao aa4f685b66 fix compiler error when do not have TensorRT library
7 years ago
Yan Chunwei 211e131525
feature/tensorrt engine op (#11001)
7 years ago
qingqing01 3a29821bd5
Develop a fake dequantized op for fixed-point quantization training framework. (#10965)
7 years ago
Yancey1989 0aa6f9e934 Merge branch 'develop' of github.com:PaddlePaddle/Paddle into overlap_send_op
7 years ago