Publication: Efficient Hardware Implementation of Convolution Layers Using Multiply-Accumulate Blocks
Loading...
Date
Advisor
Journal Title
Journal ISSN
Volume Title
Publisher
IEEE
Type
Abstract
In this paper, we propose an efficient method to realize a convolution layer of the convolution neural networks (CNNs). Inspired by the fully-connected neural network architecture, we introduce an efficient computation approach to implement convolution operations. Also, to reduce hardware complexity, we implement convolutional layers under the time-multiplexed architecture where computing resources are re-used in the multiply-accumulate (MAC) blocks. A comprehensive evaluation of convolution layers shows using our proposed method when compared to the conventional MAC-based method results up to 97% and 50% reduction in dissipated power and computation time, respectively.