Commit Graph

662 Commits (d038746e1c9c20ddda5c548a3c4d666b88238116)

Author SHA1 Message Date
Yiqun Liu 071a702060
Fix the error misjudgment when there are control nodes in graph. (#23943)
5 years ago
Zeng Jinle c49791362f
Correct reader device index (#23802)
5 years ago
joanna.wozna.intel 12ba05ce0c
Add scale-matmul fuse pass (#23734)
5 years ago
Chen Weihang 532079a222
API (CompiledProgram) error message enhancement (#23559)
5 years ago
chenhaoze 9b06dd8628
Add three passes and api reference of paddle_pass_builder. test=develop (#23741)
5 years ago
joanna.wozna.intel 5ee099ca57
Op-requant squash (#23665)
5 years ago
mozga-intel 3baaee9aab
Remove: NGraph engine from PDPD repository (#23545)
5 years ago
joanna.wozna.intel 3cb5623dad
Add matmul dequant squash (#23505)
5 years ago
wangchaochaohu c1187cd6f4
Fp16 refine for fusion group (#23472)
5 years ago
joanna.wozna.intel ce08fdcf2b
Add support for INT8 matmul in C-API quantization (#23463)
5 years ago
wangchaochaohu d085f79228
fix untime fail for output var stop_gradient=True for fusion group (#23317)
5 years ago
Kaipeng Deng d223a24904
Fix inplace_abn compile error on Windows (#23464)
5 years ago
wangchaochaohu 5c60778731
polish the code of fusion group test=develop (#23370)
5 years ago
Yiqun Liu bc2981e998
Disable test_code_generator and test_post_training_quantization_mobilenetv1 (#23440)
5 years ago
joanna.wozna.intel 8c463700e1
Add default pass attributes (#23042)
5 years ago
Kaipeng Deng 21d95be0db
Add inplace abn op (#22806)
5 years ago
Zeng Jinle 3a21980b78
add reader dependency pass, test=develop (#23301)
5 years ago
wangchaochaohu d280106007
Add support for attr type Op and add fill_constant Op and scale Op (#23163)
5 years ago
Jacek Czaja 2bb1b0e89e
[DNNL] Added MKL-DNN inplace pass for C-API inference (#23315)
5 years ago
Wojciech Uss f836c8aa8f
add check for scales and a message (#23119)
5 years ago
Tao Luo c00d427d52
simplify the cmake log of ir/CMakeLists.txt (#23262)
5 years ago
Zeng Jinle bae5930ba1
fix graph attr copy issues, test=develop (#23191)
5 years ago
Zeng Jinle acfc9b8a70
Reader sequential and inference partial feed (#22699)
5 years ago
Wilber 95b356a069
update embedding_eltwise_layernorm fuse and kernel. test=develop (#23114)
5 years ago
Yiqun Liu 3af4771122
Add the detection and code-generation of sqrt and square in fusion_group (#23095)
5 years ago
Sylwester Fraczek abee05a8c8
added mkldnn swish activation (#23041)
5 years ago
wangchaochaohu 3757e0687c
Add Unittest for backward of fusion group (#22932)
5 years ago
wangchaochaohu f0d193a23c
Cast fusion for fusion group (#22876)
5 years ago
Wilber ff3ddbb502
add skip_layernorm pass. test=develop (#22895)
5 years ago
Zhaolong Xing 8d6dc102fe
[Ernie GPU Optimize]: Embedding_eltwise_layernorm Fuse (#22494)
5 years ago
liu zhengxi 61fef9754b
Fix fc padding bug during inference fusion (#22860)
5 years ago
wangchaochaohu ca9e77a8d4
add sum op support for fusion group (#22771)
5 years ago
tianshuo78520a 433cef03e5
fix typo word (#22784)
5 years ago
GaoWei8 cdf5f6fb8c
Add an inference interface to disable FC padding (#22097)
5 years ago
tianshuo78520a d2ba91aad1
fix typo words (#22653)
5 years ago
Yiqun Liu 22bbd54719
Add the support of fp16 in fusion_group (#22239)
5 years ago
Wilber 9a8203aa25
fix fc_lstm_fuse when multi sub-graph use same fc_bias. test=develop (#22551)
5 years ago
Zhaolong Xing 8acd745c25
[Ernie GPU Optim]: Fuse three fc to multihtead matmul (#22486)
5 years ago
Yiqun Liu dcfb603897
Enable the detection of subgraph composed of grad ops (#21223)
5 years ago
joanna.wozna.intel 17f2c0899f
Add dequant-scale squash (#22409)
5 years ago
Wilber 7bc4b09500
add WITH_NCCL option for cmake. (#22384)
5 years ago
石晓伟 e1b0d7cbb1
remove anakin from code, test=develop (#22420)
5 years ago
Michał Gallus 269db0d1d1
[DNNL] Fix accuracy in INT8 FC (#22404)
5 years ago
joanna.wozna.intel 3099d9d47c Restore requantize squash (#22399)
5 years ago
Yiqun Liu b7cac50b64
Implement a common python unittest to test the ir passes. (#22209)
5 years ago
lidanqing 895f8da7d6 change std::cout to log(INFO), vlog (#22316)
5 years ago
Zhen Wang e40cfb1010
fix the bug of assert_is_op_output. test=develop (#22262)
5 years ago
Wojciech Uss d3a6647372 improve placement pass tests code coverage (#22197)
5 years ago
Zhen Wang 46189b166d Add bn and relu fuse pass (#22048)
5 years ago
joanna.wozna.intel 5b2e98aa17 Add multiple quantize operators fuse (#22062)
5 years ago
Yiqun Liu 96980c2244
Polish the PADDLE_ENFORCE in fusion_group pass related codes. (#22144)
5 years ago
liu zhengxi 724b13e459
fix xception precision problem, test=develop (#22124)
5 years ago
Yiqun Liu b1401fb74d Remove subgraph_detector from inference/analysis to the common framework/ir directory. (#22094)
5 years ago
Yiqun Liu d48320777e
Add the first implememtation of fusion_group op (#19621)
5 years ago
Michał Gallus 6192108408 [DNNL] 3D Fully-Connected (#21746)
5 years ago
liu zhengxi 196e20dfbb
Fix multi-threads memory out of bounds error for passes (#21920)
5 years ago
石晓伟 03479469a7
fix multi-thread error of fc_gru_fuse_pass.cc, test=develop (#21841)
5 years ago
Pei Yang 3e5008ad01
fix trt calib not working bug, test=develop (#21934)
5 years ago
Aurelius84 51a86d2b6b Optimize adam speed (#21777)
5 years ago
lidanqing d3a96632fa Add fc-dequantize squash in cpu_quantize_squash_pass for ernie model (#21714)
5 years ago
joanna.wozna.intel d419b859c0 Add reshape int8 mkldnn op (#21428)
5 years ago
Tao Luo 01fa4ead61
fix -Wno-error=sign-compare warning in gcc8 (#21434)
5 years ago
Zeng Jinle 89966525f1
Polish reference count pass (#21324)
5 years ago
GaoWei8 8493f20ebc Polish the codes of fc when needs padding (#21378)
5 years ago
Michał Gallus 5d7d548275 INT8 Fully-connected (#17641)
5 years ago
GaoWei8 234060f88f Add fc padding to improve mkl GEMM's performance when N and K are multiple of 128. (#20972)
5 years ago
zhouwei25 345b67b5e2 remove warning LNK4006 and warning LNK4221 (#21226)
5 years ago
Yiqun Liu c918788ba9 Disable fusion_group pass for windows and mac. We will do some experiments on Linux first. (#21310)
5 years ago
Chen Weihang 952508527a
Polish some PE code details (#21274)
5 years ago
Yiqun Liu 6b1e1f0dda
Enable generating code for a given subgraph. (#21126)
5 years ago
Zeng Jinle cdb3d27985
Fix warn of gcc8 (#21205)
5 years ago
Zhaolong Xing 65f7052554
TRT int8: refine trt int8 for dynamic range set (#21112)
5 years ago
Chen Weihang 8da0cd537a
Add examples for error message writing specification - NotFound, OutOfRange, AlreadyExists, PermissionDenied (#21134)
5 years ago
Chen Weihang 8414575b78
Add examples for error message writing specification - PreconditionNotMet, Unimplemented, Unavailable (#21137)
5 years ago
WangXi de5d3ff688 Fix dgc buffer illegal & reuse velocity (#21012)
5 years ago
Chen Weihang 826254f664
Add pre-condition check for fuse optimizer op pass (#21005)
5 years ago
Yiqun Liu 9091f8cdf9
Support generating code for grad_op (#21066)
5 years ago
joanna.wozna.intel 77c2083586 Add transpose2 INT8 for mkl-dnn (#19424)
5 years ago
Zeng Jinle 878a40f57d
Support NoNeedBufferVarsInference in dygraph backward (#20868)
5 years ago
Wilber c534149642
fix squared_mat_sub_fuse_pass when elementwise_op input is from persistable param test=develop (#20960)
5 years ago
WangXi eec4fa9099 And Enforce to fuse pass for DGC doesn't support fuse for now, test=develop (#20935)
5 years ago
hong 8c4573a3cb
GradMaker for dygraph (#19706)
5 years ago
Yiqun Liu b5f3be8330
Implement a pass detect fusion group of elementwise op (#19884)
5 years ago
wangchaochaohu ba45dce35d
fix codetest for windows make test=develop (#20796)
5 years ago
石晓伟 48a774c713
fix ts_sort's bug, test=develop (#20720)
5 years ago
wopeizl 9e5948230e
add support to gcc8, add docker env test=develop (#19807)
5 years ago
WangXi cadc6a9704 fix dgc test and bug when not set trainers_endpoints_, test=develop (#20617)
5 years ago
Pei Yang 443f604c3b
add DisableGlogInfo() to AnalysisConfig, test=develop (#20581)
5 years ago
zhaoyuchen2018 b8333edef6
Add Multihead matmul fuse pass (#20167)
5 years ago
Adam 7faa3e9555 Add ConvTranspose + BatchNorm fuse pass (#20161)
5 years ago
bingyanghuang 9de6772510 Follow comment of Merged QAT PR 18970 (#19979)
5 years ago
joanna.wozna.intel f5221ac19f Disable conv requant squash (#20041)
5 years ago
wangchaochaohu c9ea317b36
codegen code for reconstruction (#19728)
5 years ago
chengduo 101a2b610a Add dtype for coalesce_tensor_op (#20016)
5 years ago
joanna.wozna.intel 3f1d0234ae Fix conv2d+dequantize squash for residual fusion (#19545)
6 years ago
Yiqun Liu 3cd985a669
Add a pass to fuse fc+elementwise_add+layernorm (#19776)
6 years ago
Zeng Jinle 3fd3b663a8
fix gc bug in controlflow ops, test=develop (#19827)
6 years ago
Zeng Jinle db26de8389
[Bug fix] Disable memory reuse on feeded variables (#19835)
6 years ago
chengduo 8281497030
Fix warning info of build_strategy (#19805)
6 years ago
Yiqun Liu c67c8758cb
Enhance fc_fuse_pass to enable fusing relu to fc_op (#19733)
6 years ago
chengduo 056fdedde3
Open fuse all reduce option (#19765)
6 years ago
chengduo e506c99c20
Open fuse broadcast option (#18833)
6 years ago
Yiqun Liu a65c728e5d
Implement the GPU kernel of fc operator (#19687)
6 years ago
chengduo 5866a7a5fe
Enable fused_all_reduce_op_handle support GPU and CPU Gradients (#19418)
6 years ago
wangchaochaohu ed8f44ea21
codegen for fused elementwise operation (#19520)
6 years ago
baojun a3a4b6e570 Enable ngraph through build_strategy (#19266)
6 years ago
Tao Luo 75d1571995
refine PADDLE_ENFORCE codes for unify PADDLE_ASSERT_MSG (#19603)
6 years ago
Yiqun Liu c5548178b0
A a pass to enable the use of cudnn (#19346)
6 years ago
Yiqun Liu fcec365d29
Add a pass to replace dropout_op with scale_op when is_test is true (#19297)
6 years ago
tangwei12 65c7368400
Fix the correctness of async mode at distributed training (#18863)
6 years ago
joanna.wozna.intel 2e3ec66be0 Add conv dequant squash for int8 (#18905)
6 years ago
Tao Luo c82280e445
remove unused conv_elementwise_add2_act_fuse.cc (#19344)
6 years ago
Adam 97d1db1874 Add generalized Conv+Activation MKLDNN fuse pass creation Part2 (#19237)
6 years ago
Zhaolong Xing 76c95af000
Fix BUG: Mask RCNN inference diff When using AnalysisPredictor. (#19213)
6 years ago
liuwei1031 50582071dc
fix compilation issue in windows vs2017 (#19183)
6 years ago
juncaipeng 5368b36512 remove the warning for reminding user to avoid using the OriginProgram method, test=develop (#19244)
6 years ago
Adam b837689e97 Add generalized Conv+Activation MKLDNN fuse pass creation (#19072)
6 years ago
joanna.wozna.intel 492a00f53e Add conv reqantize squash (#18754)
6 years ago
joanna.wozna.intel bce72c7fea Replace Relu with bounded Relu in MobileNetV2 quantization (#18988)
6 years ago
chengduo e044e84264
open fuse_all_optimizer_ops (#19087)
6 years ago
chengduo 17d62ab220
Enhance fuse optimization op pass (#19010)
6 years ago
Zeng Jinle 2175d19993
fix memory_reuse_pass memory_size calculation error, test=develop (#19020)
6 years ago
Zeng Jinle 7ac748adb4
Open gc by default (#18836)
6 years ago
石晓伟 ee2f296ef8
Fusion: seqpool_cvm_concat (#18471)
6 years ago
Zeng Jinle 8008ab4e6b
Remove legacy C++ memory optimization codes (#18834)
6 years ago
chengduo 4140fe11a4
Open fuse optimization ops (#18741)
6 years ago
Zeng Jinle a802da650b
Feature/mem opt pass refactor (#18735)
6 years ago
Zhaolong Xing 26ae6d49e4
Update trt5 for paddle-trt (#18645)
6 years ago
chengduo fd3aad6cb3
Make fuse_optimizer_op_pass also work when the model contains sparse gradients. (#18664)
6 years ago
Huihuang Zheng 89bc3fd841
Support memory eager deletion on recurrent OP (#17710)
6 years ago
Zeng Jinle d3003a1620
Feature/buffer_shared_inplace (#17911)
6 years ago
Zhaolong Xing 88b52a27fe
Inference: fix mask rcnn model diff, optim memory usage, memory leak. (#18532)
6 years ago
gongweibao 160ddc980c
Regroup fusion by date type. (#18496)
6 years ago
chengduo 7453857324 Make fuse_all_reduce_op_pass support mix_precision (#17652)
6 years ago
Michał Gallus 7023a86c3a Fix Pooling output scale (#18186)
6 years ago
Sylwester Fraczek 9252e8fa08 add int8 mkldnn prior_box (#17242)
6 years ago
chengduo 14e1e165df
update alloc_continuous_space_for_grad_pass (#18287)
6 years ago
gongweibao f5caf3443c
Fix reinitialized ncclid error! (#18025)
6 years ago
gongweibao da9143c1cc
Polish codes of old prs. (#17938)
6 years ago
石晓伟 bce259e5bf
Update the Anakin interfaces for content-dnn and MLU (#17890)
6 years ago
Zeng Jinle 3ece61f71e
Remove attribute in Allocator::Allocate (#17878)
6 years ago
gongweibao 972c54cd70
Fix FLAGS_fuse_parameter_memory_size unit from Bytes to MBytes. (#17924)
6 years ago
gongweibao fbbdc9ccad
Add backward and optimizer operator dependency pass. (#17746)
6 years ago
Yiqun Liu 8fd39f3e99
Enhance fused_elementwise_activation op and add python api in contrib.layers (#17236)
6 years ago
mozga-intel 5eb81fe595 Capi for a ngraph engine (#17037)
6 years ago
Jacek Czaja 6d8075ecef [MKL-DNN] conv_transpose mkldnn bias pass (#17644)
6 years ago
Sylwester Fraczek 96845d2168 add Concat quantization (#17448)
6 years ago
gongweibao 65bbf950ee
Add multi-ncclcomm and 2D ncclallreduce support. (#17263)
6 years ago
Zhaolong Xing 61221ebc28
TRT: Support set dynamic range in int8 mode. (#17524)
6 years ago
Michał Gallus 0c39b97b4e [MKL-DNN] Add Fully Connected Op for inference only(#15226)
6 years ago