14th International Conference on Computer and Knowledge Engineering (ICCKE) , 2024-11-19

Title : ( Improve The Utility Of Tensor Cores By Compacting Sparse Matrix Technique )

Authors: Mohammad Sediq Abazari , Mahsa Zahedi , Abdorreza Savadi ,

Access to full-text not allowed by authors

Citation: BibTeX | EndNote

Abstract

Neural networks have demanding computational requirements, particularly in matrix multiplication operations. To address this challenge, we propose a model that combines network pruning and matrix compression techniques. Our approach leverages NVIDIA\\\'s tensor cores, which excel at efficient matrix operations. We compress the network weights based on the tensor core structure and perform convolutions using the compressed weight matrix on the tensor cores. Our model incorporates neural network pruning, mixed-precision training, and compression of network weight tensors using the im2col algorithm and CSR format. We also utilize tensor kernels with a block size of 16x16 for multiplication. We evaluate the performance of various models, including pruned, AMPoptimized, combined pruning, and AMP techniques, as well as our proposed model. Our evaluation reveals a significant performance improvement compared to a simple baseline model. Through an extensive analysis of related works, we establish foundational concepts, present our proposed model, and share the obtained results.

Keywords

, Tensor Cores, Neural Networks, Convolution Operations, Graphics Processing Unit
برای دانلود از شناسه و رمز عبور پرتال پویا استفاده کنید.

@inproceedings{paperid:1102983,
author = {Abazari, Mohammad Sediq and Zahedi, Mahsa and Abdorreza Savadi, },
title = {Improve The Utility Of Tensor Cores By Compacting Sparse Matrix Technique},
booktitle = {14th International Conference on Computer and Knowledge Engineering (ICCKE)},
year = {2024},
location = {مشهد, IRAN},
keywords = {Tensor Cores; Neural Networks; Convolution Operations; Graphics Processing Unit},
}

[Download]

%0 Conference Proceedings
%T Improve The Utility Of Tensor Cores By Compacting Sparse Matrix Technique
%A Abazari, Mohammad Sediq
%A Zahedi, Mahsa
%A Abdorreza Savadi,
%J 14th International Conference on Computer and Knowledge Engineering (ICCKE)
%D 2024

[Download]