Commit Graph

57 Commits (87852616aaf2517567a68d6b7dd5a61ab3857380)

Author SHA1 Message Date
Jacek Czaja 23d96cf221
[oneDNN] bumpup onednn 2.2 fixup version (#31473)
4 years ago
jakpiase 2f1165342b
OneDNN hardswish integration (#30211)
4 years ago
Jacek Czaja d3f09ad702
Update of onednn to 2.2 (#31067)
4 years ago
joanna.wozna.intel caf9d39839
Add Conv Transpose BF16 (#30877)
4 years ago
Jacek Czaja 173660be7b
[oneDNN] Cache oneDNN stream not to recreate in each oneDNN op (#30358)
4 years ago
Jacek Czaja f6cca62575
[oneDNN] Making ThreadID info in caching key optional (#29272)
5 years ago
jakpiase 57a4f16d9e
added internal and external reorders to profiler (#29443)
5 years ago
Adam Osewski 4096ff94dc
Small optimizations for conv2d kernel subroutines. (#29188)
5 years ago
Tao Luo e1e666a05f
fix conv mkldnn build error (#28288)
5 years ago
Jacek Czaja c11d9b3035
[oneDNN ] conv2d fwd&bwd optimization (#27871)
5 years ago
lidanqing 7cb4a8b8f2
[oneDNN] Conv dilation support (#27914)
5 years ago
joanna.wozna.intel b0ee1405f7
Add conv2d bfloat16 support (#27325)
5 years ago
wanghuancoder df43905f12
use iwyu clean include (#27267)
5 years ago
Jacek Czaja f6653c71e9
[oneDNN] Fix to conv2d grad with groups (#27006)
5 years ago
Jacek Czaja 5e874cc333
- Cosmetic fixes to align with PADDLE_ENFORCE guidelines (#26891)
5 years ago
joanna.wozna.intel 559e43eee4
Small change in conv2d and quantize pass (#26671)
5 years ago
Jacek Czaja 7dbc441eab
[oneDNN] cache cosmetics improvement (#25576)
5 years ago
Adam bd0b38e671
Refactor of conv fp32 oneDNN operator (#25137)
5 years ago
lidanqing c3c61d34c1
Update PADDLE_ENFORCE in DNNL related ops (#24333)
5 years ago
pawelpiotrowicz db2b6b6568
Hide globals & redesign restore PR (#24279)
5 years ago
FDInSky 526a211774
update conv error info (#24430)
5 years ago
Adam 5842ae6785
Revert "Change ShareDataWith() to TensorCopy() in conv_mkldnn (#22695)" (#22985)
5 years ago
Adam 056edf3929
Change ShareDataWith() to TensorCopy() in conv_mkldnn (#22695)
5 years ago
Adam e81f0228df MKL-DNN 1.0 Update (#20162)
6 years ago
Jacek Czaja 8f5a93a07b - Fix to regression in performance of ResNet-50 training (#21588)
6 years ago
Jacek Czaja 18a5d30754 [MKL-DNN] Conv2d and Conv2d transpose MKL-DNN NHWC support (#21466)
6 years ago
hong ac8546701d
Add dygraph execution context (#20157)
6 years ago
Adam 3fda695bb0 Add support for asymetric padding in MKLDNN pool, conv and conv_transpose (#21062)
6 years ago
Adam 67b59ddb38 Minor MKL-DNN conv int8 performance fixes (#20753)
6 years ago
lidanqing 46e93f7c86 Revert "Refactor conv computeINT8" (#20640)
6 years ago
Jacek Czaja a1cd27f13f [MKL-DNN] Added mkl-dnn cache clearing when creating Executor instance (#20241)
6 years ago
lidanqing 2c32c2d649 Refactor conv computeINT8 (#19574)
6 years ago
Adam d4413a54bc Add common CreateKey for mkldnn handlers (#19767)
6 years ago
Adam 428b2b9e17 MKLDNN handler cleanup (#19713)
6 years ago
Adam 8d6d95cc2b paddle::framework::vectorize() templatization (#19611)
6 years ago
Adam e94b26daf5 using MKLDNNMemoryFormat = mkldnn::memory::format changes (#19568)
6 years ago
lidanqing ba368bf696 clean up intel labeled TODOs (#19476)
6 years ago
Adam 97d1db1874 Add generalized Conv+Activation MKLDNN fuse pass creation Part2 (#19237)
6 years ago
Adam b837689e97 Add generalized Conv+Activation MKLDNN fuse pass creation (#19072)
6 years ago
joanna.wozna.intel bce72c7fea Replace Relu with bounded Relu in MobileNetV2 quantization (#18988)
6 years ago
Jacek Czaja cfcb96d2df [MKL-DNN] Fix int8 performance regression (#18758)
6 years ago
lidanqing 9ecd8ee789 change ComputeINT8 to template version to remove checking dst_datatype code (#18756)
6 years ago
Jiabin Yang 667f88f9a6
Fix/gcc 4.8 ubt link error (#18558)
6 years ago
Leo Zhao 681d3553f1 Fix potential mkldnn concat/pool/conv kernel issues (#18393)
6 years ago
lidanqing f8ecc3de89 refactor the function ConvFwdPrimitiveDesc (#17897)
6 years ago
Zeng Jinle 3ece61f71e
Remove attribute in Allocator::Allocate (#17878)
6 years ago
Yihua Xu 14a32bf0c4 Fix the accuracy issue while using float precision to get the scale. (#17884)
6 years ago
lidanqing 04b6c29ee0 Improve mobilenetv2 INT8 performance by using INT8 relu as post-op (#17570)
6 years ago
guomingz 2281ebf0f3 Enable the convolution/relu6(bounded_relu) fusion for FP32 on Intel platform. (#17130)
6 years ago
Jacek Czaja 87a44b1149 [MKL-DNN] Added reusing of primitive descriptors (fp32) (#16667)
6 years ago