You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
mindspore/mindspore/lite/nnacl/assembly/arm32
zhanyuan f2256aca38
[MSLITE] Fix the arm v7a assembly's bug
5 years ago
..
ConvDw3x3Int8BorderPixel.S [MSLITE][Develop] optimize arm cpu int8 depthwise: 3x3 support perchannel 5 years ago
ConvDwFp32Border.S add arm32 fp32 op 5 years ago
ConvDwFp32Center.S add arm32 fp32 op 5 years ago
ConvDwFp32Row.S [MSLITE][Develop] fix bug of arm32 fp32 cpu depthwise 5 years ago
ConvDwInt8Center.S fix arm32 ConvDwInt8Center.S 5 years ago
ConvDwInt8PostAlign4.S [MSLITE][Develop] arm cpu int8 conv depthwise support arm32 5 years ago
ConvDwInt8PostAlign4PerChannel.S [MSLITE][Develop] support conv_depthwise arm32 int8 weight perchannel 5 years ago
ConvDwInt8Row.S [MSLITE][Develop] arm cpu int8 conv depthwise support arm32 5 years ago
DeconvDwFp32Center.S move nnacl to lite/ 5 years ago
DeconvDwInt8Center.S [MSLITE][Develop] reopen arm cpu int8 deconv depthwise assembly 5 years ago
DeconvDwInt8Post.S [MSLITE][Develop] optimize arm cpu int8 op deconv dw, add post assembly 5 years ago
IndirectGemmInt8_2x4.S enable int8 kernel on arm32 5 years ago
IndirectGemmInt16to32_8x4.S move nnacl to lite/ 5 years ago
MatVecMulFp32.S [MSLITE] Support GEMV for ARM v7a 5 years ago
MatmulFp32.S syc op_base act type with schema 5 years ago
MatmulFp32Opt.S optimization for fp32 winograd on arm32 5 years ago
MatmulFp32Opt12x4.S [MSLITE] Fix the arm v7a assembly's bug 5 years ago
MatmulInt8.S Support per-channel quantization of int8 matmul for arm32 platform 5 years ago
MatmulInt8Opt.S optimization for int8 matmul kernel on arm32 5 years ago
MatmulWinogradFp32.S arm32 winograd init optimize 5 years ago
PostFuncBiasReluC4.S [MSLITE] fp16 run test with settable accuracy 5 years ago
PostFuncBiasReluC8.S post deconv assembly for arm32 5 years ago
PreSum4x16Int8Peroc.S [MSLITE] conv1x1 int8 peroc inputsum bug 5 years ago
PreSum4x16Int8Pert.S [MSLITE][Develop]Conv1x1 preTrasn neon code -> .S 5 years ago
TiledC4MatmulFp32.S add fp32 deconv kernels 5 years ago
WinogradTransLeft.S add fp32 deconv kernels 5 years ago
WinogradTransRight.S add fp32 deconv kernels 5 years ago