LibShalom: Optimizing Small and Irregular-shaped Matrix Multiplications on ARMv8 Multi-Core

Published in SC, 2021

Recommended citation: Weiling Yang, Jianbin Fang, Dezun Dong, Xing Su, Zheng Wang. " LibShalom: Optimizing Small and Irregular-shaped Matrix Multiplications on ARMv8 Multi-Core." SC. 2021. http://jianbinfang.github.io/files/2021-06-22-sc.pdf

This article presents LibShalom, an open-source libraryfor optimizing small and irregular-shaped GEMMs, explicitly targeting the ARMv8 architecture. LibShalom builds upon the classical Goto algorithm but tailors it to minimize the expensive memory accessing overhead for data packing and processing small matrices. It uses analytic methods to determine GEMM kernel optimization parameters, enhancing the computation and parallelization efficiencyof the GEMM kernels. Download paper here

Recommended citation: Weiling Yang, Jianbin Fang, Dezun Dong, Xing Su, Zheng Wang. (2021). “LibShalom: Optimizing Small and Irregular-shaped Matrix Multiplications on ARMv8 Multi-Core.” SC. 2021.