Commit Graph

159 Commits (d49990e447ff5837f79e7f1816a7e2150c88f24f)

Author SHA1 Message Date
peizhilin 19ebd8b4cf add ctc support for windows
6 years ago
nhzlx f75815b78c add prelu gpu inference
7 years ago
Qiao Longfei b9d3d75fc4 fix prefetch dependency test=develop
7 years ago
Qiao Longfei 47280ef8b4 lookup table op support prefetch
7 years ago
wopeizl d9a1f3e58e Windows/online (#14474)
7 years ago
Tao Luo 5d4d117edc
Merge pull request #14502 from qingqing01/cudnn5_fix
7 years ago
Yu Yang 3edd32d070 fix(Compile): fix depends error when compile op using cub
7 years ago
Dang Qingqing cda60311f9 Fix compling with cuDNN v5
7 years ago
Yu Yang f1a392a5fe
Merge pull request #13804 from sneaxiy/rewrite_allocation
7 years ago
qingqing01 fd7e643153
Convolution fusion operator. (#14449)
7 years ago
Yu Yang 98bbfc17be Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
Wu Yi a2d9b34417
Refine operator cmake (#14413)
7 years ago
whs 1722678258
Make nce support more distribution. (#13549)
7 years ago
Wu Yi b32c13dc20
Add cudnn ctc loss (#12366)
7 years ago
Yu Yang c8f6e70ab4 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
peizhilin 61fa5218b9 Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Yu Yang 8f9bfad246
perf(compile): speed up reduce_op compile by splitting files (#14294)
7 years ago
sneaxiy d231e55065 merge develop
7 years ago
peizhilin 7638f0afb3 simplify the logic
7 years ago
peizhilin d01a26280e Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
li099 688ed60116 Add lod tensor array to tensor op (#13990)
7 years ago
peizhilin 52f7644f53 Merge remote-tracking branch 'upstream/develop' into windows/build
7 years ago
Yu Yang fdc689142c Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
chengduo c5b6573a5a
Fix input<tensor> (#14208)
7 years ago
Zhaolong Xing ba8b5619a3
Revert "cherry picked windows patches."
7 years ago
Yu Yang 057a682ee9 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
peizhilin 9d67c1fb69 cpu build support
7 years ago
dzhwinter 60f70b174d test=develop
7 years ago
dzhwinter 316765839d add back jit simd instructions. stage.
7 years ago
dzhwinter bf2e4cb188 cleard. staged
7 years ago
dzhwinter ebfe5a02b3 merge develop branch
7 years ago
tensor-tang 3c957af139
Merge pull request #14080 from tensor-tang/refine/jit/crf2
7 years ago
Yu Yang c01696f8c2 Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
tensor-tang 21487d78bf add crf decode jit kernel
7 years ago
minqiyang 8a0f26f45f Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into continue_hash_op
7 years ago
minqiyang d4f9aa0852 Add hash op implementation
7 years ago
chengduo a7497653d0
Refine Split op (#13967)
7 years ago
Yu Yang 461f71a90b Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into rewrite_allocation
7 years ago
tensor-tang 3c249283af init seqconv eltadd relu op
7 years ago
qingqing01 67a2b5215d
Add affine channel op to speed and save memory for faster-rcnn model. (#13919)
7 years ago
tensor-tang bcb8ea397d Merge remote-tracking branch 'ups/develop' into fea/jitkernel_peephole
7 years ago
Qiyang Min f99ea99e36
Merge pull request #13720 from velconia/fix_grad_clip
7 years ago
tensor-tang 9131a35676 replace the lstm compute with jitkernel
7 years ago
sneaxiy 4c672ab1a2 Merge reyoung:rewrite_allocation
7 years ago
minqiyang bcd8c2ccc3 Add unit test
7 years ago
minqiyang 67308822f8 Add selected_rows merge for clip_by_norm op
7 years ago
dzhwinter 26771f41ba
"fix compile error" (#13579)
7 years ago
Xin Pan 425a882165
Merge pull request #13643 from panyx0718/ir2
7 years ago
Yu Yang 5cf395beaf Fix bug in uts
7 years ago
Dun 161c3e31f7 Optimization of Kernels that related to DeepLabv3+ (#13534)
7 years ago