Commit Graph

500 Commits (7e68bc896bc6e01d22f00f26cfa9ec076e6c293a)

Author SHA1 Message Date
Zeng Jinle 89966525f1
Polish reference count pass (#21324)
6 years ago
GaoWei8 8493f20ebc Polish the codes of fc when needs padding (#21378)
6 years ago
Michał Gallus 5d7d548275 INT8 Fully-connected (#17641)
6 years ago
GaoWei8 234060f88f Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
6 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
6 years ago
Yiqun Liu c918788ba9 Disable fusion_group pass for windows and mac. We will do some experiments on Linux first. (#21310)
6 years ago
Chen Weihang 952508527a
Polish some PE code details (#21274)
6 years ago
Yiqun Liu 6b1e1f0dda
Enable generating code for a given subgraph. (#21126)
6 years ago
Zeng Jinle cdb3d27985
Fix warn of gcc8 (#21205)
6 years ago
Zhaolong Xing 65f7052554
TRT int8: refine trt int8 for dynamic range set (#21112)
6 years ago
Chen Weihang 8da0cd537a
Add examples for error message writing specification - NotFound, OutOfRange, AlreadyExists, PermissionDenied (#21134)
6 years ago
Chen Weihang 8414575b78
Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137)
6 years ago
WangXi de5d3ff688 Fix dgc buffer illegal & reuse velocity (#21012)
6 years ago
Chen Weihang 826254f664
Add pre-condition check for fuse optimizer op pass (#21005)
6 years ago
Yiqun Liu 9091f8cdf9
Support generating code for grad_op (#21066)
6 years ago
joanna.wozna.intel 77c2083586 Add transpose2 INT8 for mkl-dnn (#19424)
6 years ago
Zeng Jinle 878a40f57d
Support NoNeedBufferVarsInference in dygraph backward (#20868)
6 years ago
Wilber c534149642
fix squared_mat_sub_fuse_pass when elementwise_op input is from persistable param test=develop (#20960)
6 years ago
WangXi eec4fa9099 And Enforce to fuse pass for DGC doesn't support fuse for now, test=develop (#20935)
6 years ago
hong 8c4573a3cb
GradMaker for dygraph (#19706)
6 years ago
Yiqun Liu b5f3be8330
Implement a pass detect fusion group of elementwise op (#19884)
6 years ago
wangchaochaohu ba45dce35d
fix codetest for windows make test=develop (#20796)
6 years ago
石晓伟 48a774c713
fix ts_sort's bug, test=develop (#20720)
6 years ago
wopeizl 9e5948230e
add support to gcc8, add docker env test=develop (#19807)
6 years ago
WangXi cadc6a9704 fix dgc test and bug when not set trainers_endpoints_, test=develop (#20617)
6 years ago
Pei Yang 443f604c3b
add DisableGlogInfo() to AnalysisConfig, test=develop (#20581)
6 years ago
zhaoyuchen2018 b8333edef6
Add Multihead matmul fuse pass (#20167)
6 years ago
Adam 7faa3e9555 Add ConvTranspose + BatchNorm fuse pass (#20161)
6 years ago
bingyanghuang 9de6772510 Follow comment of Merged QAT PR 18970 (#19979)
6 years ago
joanna.wozna.intel f5221ac19f Disable conv requant squash (#20041)
6 years ago
wangchaochaohu c9ea317b36
codegen code for reconstruction (#19728)
6 years ago
chengduo 101a2b610a Add dtype for coalesce_tensor_op (#20016)
6 years ago
joanna.wozna.intel 3f1d0234ae Fix conv2d+dequantize squash for residual fusion (#19545)
6 years ago
Yiqun Liu 3cd985a669
Add a pass to fuse fc+elementwise_add+layernorm (#19776)
6 years ago
Zeng Jinle 3fd3b663a8
fix gc bug in controlflow ops, test=develop (#19827)
6 years ago
Zeng Jinle db26de8389
[Bug fix] Disable memory reuse on feeded variables (#19835)
6 years ago
chengduo 8281497030
Fix warning info of build_strategy (#19805)
6 years ago
Yiqun Liu c67c8758cb
Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733)
6 years ago
chengduo 056fdedde3
Open fuse all reduce option (#19765)
6 years ago
chengduo e506c99c20
Open fuse broadcast option (#18833)
6 years ago
Yiqun Liu a65c728e5d
Implement the GPU kernel of fc operator (#19687)
6 years ago
chengduo 5866a7a5fe
Enable fused_all_reduce_op_handle support GPU and CPU Gradients (#19418)
6 years ago
wangchaochaohu ed8f44ea21
codegen for fused elementwise operation (#19520)
6 years ago
baojun a3a4b6e570 Enable ngraph through build_strategy (#19266)
6 years ago
Tao Luo 75d1571995
refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603)
6 years ago
Yiqun Liu c5548178b0
A a pass to enable the use of cudnn (#19346)
6 years ago
Yiqun Liu fcec365d29
Add a pass to replace dropout_op with scale_op when is_test is true (#19297)
6 years ago
tangwei12 65c7368400
Fix the correctness of async mode at distributed training (#18863)
6 years ago
joanna.wozna.intel 2e3ec66be0 Add conv dequant squash for int8 (#18905)
6 years ago
Tao Luo c82280e445
remove unused conv_elementwise_add2_act_fuse.cc (#19344)
6 years ago