Sampling-based Sparse Format Selection on GPUs

Gangyi Zhu, Gagan Agrawal

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

Sparse Matrix-Vector Multiplication (SpMV) is an important kernel in numerous computational disciplines. The overall performance of SpMV is highly dependent on the storage format of the sparse matrix. This has led to much interest in recent years on automatically choosing the appropriate format, typically using machine learning techniques and training a model using a large number of matrices. However, these methods have limitations in practice-besides the dependency on obtaining a large number of sparse matrices of training and expensive overheads of the training, they usually have limited prediction ability across architectures. In this paper, we take a very distinct approach to the same problem. This approach involves obtaining samples from the original matrix, executing the kernel using these samples, and selecting the best format. However, our approach requires obtaining representative samples that can help understand performance associated with using a specific format on the full matrix, which turns out to be challenging. Based on the storage properties and processing granularity associated with different formats, we develop three novel sampling schemes: Row Cropping sampling, Random Warp sampling, and Diagonal Aligning sampling. These sampling methods are designed by observing that certain factors tend to be critical for performance associated with a particular format, and thus preserving that factor through sampling. Experimental results using nearly 2000 matrices demonstrate that our approach delivers high efficiency without the expensive training process, and it is easy to migrate across architectures. At the same time, our approach achieves comparable prediction accuracy with the state-of-art methodologies, and even outperforms them in certain cases (especially for predicting on some of the largest matrices we use). Through our work, we also offer new insights into the performance achieved using different formats on GPUs.

Original languageEnglish (US)
Title of host publicationProceedings - 2021 IEEE 33rd International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2021
PublisherIEEE Computer Society
Pages198-208
Number of pages11
ISBN (Electronic)9781665443012
DOIs
StatePublished - 2021
Event33rd IEEE International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2021 - Virtual, Online, Brazil
Duration: Oct 26 2021Oct 29 2021

Publication series

NameProceedings - Symposium on Computer Architecture and High Performance Computing
ISSN (Print)1550-6533

Conference

Conference33rd IEEE International Symposium on Computer Architecture and High Performance Computing, SBAC-PAD 2021
Country/TerritoryBrazil
CityVirtual, Online
Period10/26/2110/29/21

Keywords

  • GPUs Sampling
  • Sparse Computations

ASJC Scopus subject areas

  • Hardware and Architecture
  • Software

Fingerprint

Dive into the research topics of 'Sampling-based Sparse Format Selection on GPUs'. Together they form a unique fingerprint.

Cite this