Commit Graph

199 Commits (fc02c2995e7cde4df837997efee02d31feeb411b)

Author SHA1 Message Date
Zeng Jinle 878a40f57d
Support NoNeedBufferVarsInference in dygraph backward (#20868)
5 years ago
Yiqun Liu 03ba0fdae6
Move the codes of fused operators to operators/fused directory. (#20881)
5 years ago
Aurelius84 aacd16dbb4 add pyramid_hash_op (#20698)
5 years ago
Tao Luo efbdad0596
make search_compute support avx default (#20779)
5 years ago
Zeng Jinle ab575de725 Fix op run log when memory optimization strategy is enabled (#20695)
5 years ago
zhaoyuchen2018 e867366805
Add multihead op for ernie opt (#19933)
5 years ago
Wilber 276b5e3440
fix compile paddle with anakin bug
5 years ago
chengjuntao 00efd1d8a9
add deformable conv v1 op and cpu version of deformable conv v2 (#18500)
5 years ago
Zeng Jinle 0daa5c9772
Make leaky relu inplacable (#19676)
6 years ago
Yiqun Liu a65c728e5d
Implement the GPU kernel of fc operator (#19687)
6 years ago
hutuxian 1ca6ea0318
fix cmakelist deps (#19668)
6 years ago
Aurelius84 78a3d837f8
Add match_matrix_tensor op (#18525)
6 years ago
Kevin e681d65515 Add var_conv_2d op (#18518)
6 years ago
chengduo fd3aad6cb3
Make fuse_optimizer_op_pass also work when the model contains sparse gradients. (#18664)
6 years ago
HaoRen b7128bac5f supports collective communicated training (#18175)
6 years ago
石晓伟 bce259e5bf
Update the Anakin interfaces for content-dnn and MLU (#17890)
6 years ago
Bai Yifan bba57cdd82
Add deformable conv v2 op,test=develop (#17145)
6 years ago
chengduo 5a6ab38013 Add record event And remove CSP (#17447)
6 years ago
gongweibao cbdb8a17b1
Polish DGC code (#16818)
6 years ago
gongweibao eb83abeac3
Add DGC(Deep Gradient Compression) interface. (#15841)
6 years ago
nhzlx f3a2e4b3d8 1. Add ANAKIN_ROOT compile option
6 years ago
nhzlx b21770a2aa cherry-pick from feature/anakin-engine: Add subgraph fuse support and anakin engine #16018
6 years ago
qingqing01 86e912c544 Fix windows compiling (#16230)
6 years ago
qingqing01 8ad672a287
Support sync batch norm. (#16121)
6 years ago
xuezhong 1dad36f6aa
Merge pull request #15609 from xuezhong/add_sample_logits_op
6 years ago
Yiqun Liu 7d96c74ab2
Initialize the benchmark tester for operator. (#15772)
6 years ago
xuezhong 58ad40cc15 add sample_logits op
6 years ago
baojun efce25673c Adding ngraph_engine_op (#14948)
6 years ago
Yiqun Liu 3008fa1261
Add the CUDA kernel for beam_search op (#15020)
6 years ago
zhaozhehao e2ba9668b4 Tree conv op (#15217)
6 years ago
peizhilin dba009dbbf fix script issue
6 years ago
peizhilin 01c00b07dd fix test issues on windows
6 years ago
tensor-tang 693e5e65ce
Merge pull request #14958 from tensor-tang/refine/jit
6 years ago
Zeng Jinle 95cbe07c40
Merge pull request #14836 from sneaxiy/feature/py_func
6 years ago
tensor-tang 20392be001 Merge remote-tracking branch 'ups/develop' into refine/jit
6 years ago
peizhilin 19ebd8b4cf add ctc support for windows
6 years ago
sneaxiy deb0d41cea fix cmake
6 years ago
sneaxiy 8760d23c7d featue/py_func
6 years ago
tensor-tang 53709e7e61 refine names
6 years ago
tensor-tang fab0ee8757 Merge remote-tracking branch 'ups/develop' into refine/jitkernel
6 years ago
tensor-tang 77236e33fc init jitkernel
6 years ago
nhzlx f75815b78c add prelu gpu inference
6 years ago
Qiao Longfei b9d3d75fc4 fix prefetch dependency test=develop
6 years ago
Qiao Longfei 47280ef8b4 lookup table op support prefetch
6 years ago
wopeizl d9a1f3e58e Windows/online (#14474)
6 years ago
Tao Luo 5d4d117edc
Merge pull request #14502 from qingqing01/cudnn5_fix
6 years ago
Yu Yang 3edd32d070 fix(Compile): fix depends error when compile op using cub
6 years ago
Dang Qingqing cda60311f9 Fix compling with cuDNN v5
6 years ago
Yu Yang f1a392a5fe
Merge pull request #13804 from sneaxiy/rewrite_allocation
6 years ago
qingqing01 fd7e643153
Convolution fusion operator. (#14449)
6 years ago