Journal of Supercomputing, ( ISI ), Volume (60), No (2), Year (2012-5) , Pages (196-222)

Title : ( Improving performance and energy efficiency of embedded processors via post-fabrication instruction set customization )

Authors: Hamid Noori , Farhad Mehdipour , Koji Inoue , Kazuaki Murakami ,

Access to full-text not allowed by authors

Citation: BibTeX | EndNote

Abstract

Encapsulating critical computation subgraphs as application-specific instruction set extensions is an effective technique to enhance the performance and energy efficiency of embedded processors. However, the addition of custom functional units to the base processor is required to support the execution of custom instructions. Although automated tools have been developed to reduce the long design time needed to produce a new extensible processor for each application, short time-to-market, significant non-recurring engineering and design costs are issues. To address these concerns, we introduce an adaptive extensible processor in which custom instructions are generated and added after chip-fabrication. To support this feature, custom functional units (CFUs) are replaced by a reconfigurable functional unit (RFU). The proposed RFU is based on a matrix of functional units which is multi-cycle with the capability of conditional execution. To generate more effective custom instructions, they are extended over basic blocks and hence, multiple-exits custom instruction and intuition behind it are introduced. Conditional execution capability has been added to the RFU to support the multi-exit feature of custom instructions. Because the proposed RFU has limitations on hardware resources (i.e., connections and processing elements), an integrated mapping-temporal partitioning framework is proposed to guarantee that the generated custom instructions can be mapped on the RFU (mappable custom instructions). Experimental results show that multi-exit custom instructions enhance the performance and energy efficiency by an average of 32% and 3% compared to custom instructions limited to one basic block, respectively. A maximum speedup of 4.9, compared to a single-issue embedded processor, and an average speedup of 1.9 was achieved on MiBench benchmark suite. The maximum and average energy saving are 56% and 22%, respectively. These performance and energy efficiency are obtained at the cost of 30% area overhead.

Keywords

, Reconfigurable processor · Custom instruction · High, performance low, power embedded processors · Reconfigurable functional unit · Conditional execution
برای دانلود از شناسه و رمز عبور پرتال پویا استفاده کنید.

@article{paperid:1023907,
author = {Noori, Hamid and Farhad Mehdipour and Koji Inoue and Kazuaki Murakami},
title = {Improving performance and energy efficiency of embedded processors via post-fabrication instruction set customization},
journal = {Journal of Supercomputing},
year = {2012},
volume = {60},
number = {2},
month = {May},
issn = {0920-8542},
pages = {196--222},
numpages = {26},
keywords = {Reconfigurable processor · Custom instruction · High-performance low-power embedded processors · Reconfigurable functional unit · Conditional execution},
}

[Download]

%0 Journal Article
%T Improving performance and energy efficiency of embedded processors via post-fabrication instruction set customization
%A Noori, Hamid
%A Farhad Mehdipour
%A Koji Inoue
%A Kazuaki Murakami
%J Journal of Supercomputing
%@ 0920-8542
%D 2012

[Download]