LibShalom: Optimizing Small and Irregular-shaped Matrix Multiplications on ARMv8 Multi-Core
Published in SC, 2021
Recommended citation: Weiling Yang, Jianbin Fang, Dezun Dong, Xing Su, Zheng Wang. " LibShalom: Optimizing Small and Irregular-shaped Matrix Multiplications on ARMv8 Multi-Core." SC. 2021. http://jianbinfang.github.io/files/2021-06-22-sc.pdf
This article presents LibShalom, an open-source libraryfor optimizing small and irregular-shaped GEMMs, explicitly targeting the ARMv8 architecture. LibShalom builds upon the classical Goto algorithm but tailors it to minimize the expensive memory accessing overhead for data packing and processing small matrices. It uses analytic methods to determine GEMM kernel optimization parameters, enhancing the computation and parallelization efficiencyof the GEMM kernels. Download paper here
Recommended citation: Weiling Yang, Jianbin Fang, Dezun Dong, Xing Su, Zheng Wang. (2021). “LibShalom: Optimizing Small and Irregular-shaped Matrix Multiplications on ARMv8 Multi-Core.” SC. 2021.