Commit Graph

212 Commits (9bf70ed75274bd4ca9d6fde6e171231ddb5dfe36)

Author SHA1 Message Date
Huihuang Zheng 8a1a2af82e
Add Assert Op (#24280)
5 years ago
Tao Luo d7850dcdc0
add noavx_axpy and noavx_axpy_noadd (#24207)
5 years ago
Yiqun Liu ecfddebbef
Add the implementation of inverse (#23310)
5 years ago
mozga-intel 3baaee9aab
Remove: NGraph engine from PDPD repository (#23545)
5 years ago
Kaipeng Deng d223a24904
Fix inplace_abn compile error on Windows (#23464)
5 years ago
Kaipeng Deng 21d95be0db
Add inplace abn op (#22806)
5 years ago
Jacek Czaja 012886df79
[DNNL] Softmax mkldnn op inplace support (#23197)
5 years ago
Zhaolong Xing 430b0099c9
[Paddle-TRT]: Ernie Dynamic shape support. (#23138)
5 years ago
Wilber 7bc4b09500
add WITH_NCCL option for cmake. (#22384)
5 years ago
石晓伟 e1b0d7cbb1
remove anakin from code, test=develop (#22420)
5 years ago
石晓伟 ad0dfb17c1
[Feature] Lite subgraph (#22114)
5 years ago
Zeng Jinle 89966525f1
Polish reference count pass (#21324)
5 years ago
Huihuang Zheng 1957192f05
Add select_input_op and select_output_op (#21016)
5 years ago
Zeng Jinle 878a40f57d
Support NoNeedBufferVarsInference in dygraph backward (#20868)
5 years ago
Yiqun Liu 03ba0fdae6
Move the codes of fused operators to operators/fused directory. (#20881)
5 years ago
Aurelius84 aacd16dbb4 add pyramid_hash_op (#20698)
5 years ago
Tao Luo efbdad0596
make search_compute support avx default (#20779)
5 years ago
Zeng Jinle ab575de725 Fix op run log when memory optimization strategy is enabled (#20695)
5 years ago
zhaoyuchen2018 e867366805
Add multihead op for ernie opt (#19933)
5 years ago
Wilber 276b5e3440
fix compile paddle with anakin bug
5 years ago
chengjuntao 00efd1d8a9
add deformable conv v1 op and cpu version of deformable conv v2 (#18500)
5 years ago
Zeng Jinle 0daa5c9772
Make leaky relu inplacable (#19676)
6 years ago
Yiqun Liu a65c728e5d
Implement the GPU kernel of fc operator (#19687)
6 years ago
hutuxian 1ca6ea0318
fix cmakelist deps (#19668)
6 years ago
Aurelius84 78a3d837f8
Add match_matrix_tensor op (#18525)
6 years ago
Kevin e681d65515 Add var_conv_2d op (#18518)
6 years ago
chengduo fd3aad6cb3
Make fuse_optimizer_op_pass also work when the model contains sparse gradients. (#18664)
6 years ago
HaoRen b7128bac5f supports collective communicated training (#18175)
6 years ago
石晓伟 bce259e5bf
Update the Anakin interfaces for content-dnn and MLU (#17890)
6 years ago
Bai Yifan bba57cdd82
Add deformable conv v2 op,test=develop (#17145)
6 years ago
chengduo 5a6ab38013 Add record event And remove CSP (#17447)
6 years ago
gongweibao cbdb8a17b1
Polish DGC code (#16818)
6 years ago
gongweibao eb83abeac3
Add DGC(Deep Gradient Compression) interface. (#15841)
6 years ago
nhzlx f3a2e4b3d8 1. Add ANAKIN_ROOT compile option
6 years ago
nhzlx b21770a2aa cherry-pick from feature/anakin-engine: Add subgraph fuse support and anakin engine #16018
6 years ago
qingqing01 86e912c544 Fix windows compiling (#16230)
6 years ago
qingqing01 8ad672a287
Support sync batch norm. (#16121)
6 years ago
xuezhong 1dad36f6aa
Merge pull request #15609 from xuezhong/add_sample_logits_op
6 years ago
Yiqun Liu 7d96c74ab2
Initialize the benchmark tester for operator. (#15772)
6 years ago
xuezhong 58ad40cc15 add sample_logits op
6 years ago
baojun efce25673c Adding ngraph_engine_op (#14948)
6 years ago
Yiqun Liu 3008fa1261
Add the CUDA kernel for beam_search op (#15020)
6 years ago
zhaozhehao e2ba9668b4 Tree conv op (#15217)
6 years ago
peizhilin dba009dbbf fix script issue
6 years ago
peizhilin 01c00b07dd fix test issues on windows
6 years ago
tensor-tang 693e5e65ce
Merge pull request #14958 from tensor-tang/refine/jit
6 years ago
Zeng Jinle 95cbe07c40
Merge pull request #14836 from sneaxiy/feature/py_func
6 years ago
tensor-tang 20392be001 Merge remote-tracking branch 'ups/develop' into refine/jit
6 years ago
peizhilin 19ebd8b4cf add ctc support for windows
6 years ago
sneaxiy deb0d41cea fix cmake
6 years ago