Você está em: Start > Publications > View > Generation of Customized Accelerators for Loop Pipelining of Binary Instruction Traces

Map of Premises

Publication

Publication Search

Publications

Generation of Customized Accelerators for Loop Pipelining of Binary Instruction Traces

Title

Generation of Customized Accelerators for Loop Pipelining of Binary Instruction TracesExport publication in the APA format Export publication in the EXCEL format Export publication in the RIS format

Type

Article in International Scientific Journal

Date

2017

Title

Generation of Customized Accelerators for Loop Pipelining of Binary Instruction Traces

Type

Article in International Scientific Journal

Year

2017

Authors

Nuno Paulino

(Author)

FEUP

View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page

João Canas Ferreira

(Author)

FEUP

View Personal Page Send message Search for Participant Publications View Authenticus page View ORCID page

João M. P. Cardoso

(Author)

FEUP

View Personal Page Send message Search for Participant Publications View Authenticus page View ORCID page

Journal

Title: IEEE Transactions on Very Large Scale Integration (VLSI) SystemsImported from Authenticus Search for Journal Publications

Vol. 25

Pages: 21-34

ISSN: 1063-8210

Publisher: IEEE

Indexing

ISI Web of Knowledge - 6 Citations

Scopus - 6 Citations

Other information

Authenticus ID: P-00M-AM7

DOI: 10.1109/tvlsi.2016.2573640

Abstract (EN): Many embedded applications process large amounts of data using regular computational kernels, amenable to acceleration by specialized hardware coprocessors. To reduce the significant design effort, the dedicated hardware may be automatically generated, usually starting from the application's source or binary code. This paper presents a moduloscheduled loop accelerator capable of executing multiple loops and a supporting toolchain. A generation/scheduling procedure, which fully relies on MicroBlaze instruction traces, produces accelerator instances, customized in terms of functional units and interconnections. The accelerators support integer and single-precision floating-point arithmetic, and exploit instruction-level parallelism, loop pipelining, and memory access parallelism via two read/write ports. A complete implementation of the proposed architecture is evaluated in a Virtex-7 device. Augmenting a MicroBlaze processor with a tailored accelerator achieves a geometric mean speedup, over software-only execution, of 6.61x for 13 floating-point kernels from the Livermore Loops set, and of 4.08x for 11 integer kernels from Texas Instruments' IMGLIB. The proposed customized accelerators are compared with ALU-based ones. The average specialized accelerator requires only 0.47x the number of field-programmable gate array slices of an accelerator with four ALUs. A geometric mean speedup of 1.78x over a four-issue very long instruction word (without floating-point support) was obtained for the integer kernels.

Language: English

Type (Professor's evaluation): Scientific

No. of pages: 14

Documents

We could not find any documents associated to the publication.

Related Publications

Of the same authors

Transparent Trace-Based Binary Acceleration for Reconfigurable HW/SW Systems (2013)
Article in International Scientific Journal
João Bispo; Nuno Paulino; João Cardoso; João Canas Ferreira

Transparent runtime migration of loop-based traces of processor instructions to reconfigurable processing units (2013)
Article in International Scientific Journal
João Bispo; Nuno Paulino; João Cardoso; João Canas Ferreira

Optimizing OpenCL Code for Performance on FPGA: k-Means Case Study With Integer Data Sets (2020)
Article in International Scientific Journal
Nuno Paulino; João Canas Ferreira; João M. P. Cardoso

Improving Performance and Energy Consumption in Embedded Systems via Binary Acceleration: A Survey (2020)
Article in International Scientific Journal
Nuno Paulino; João Canas Ferreira; João M. P. Cardoso

Dynamic Partial Reconfiguration of Customized Single-Row Accelerators (2019)
Article in International Scientific Journal
Nuno Paulino; João Canas Ferreira; João M. P. Cardoso

See all (12)

Of the same journal

Reliability and availability in reconfigurable computing: A basis for a common solution (2008)
Article in International Scientific Journal
Manuel G. Gericota; Gustavo R. Alves; Miguel L. Silva; José M. Ferreira

Dynamic Partial Reconfiguration of Customized Single-Row Accelerators (2019)
Article in International Scientific Journal
Nuno Paulino; João Canas Ferreira; João M. P. Cardoso

Recommend this page Top

Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto I Terms and Conditions I Acessibility I Index A-Z
Page created on: 2025-10-23 at 00:00:54 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book