Você está em: Início > Publicações > Visualização > Generation of Customized Accelerators for Loop Pipelining of Binary Instruction Traces

Mapa das Instalações

Publicação

Pesquisa de Publicações

Publicações

Generation of Customized Accelerators for Loop Pipelining of Binary Instruction Traces

Título

Generation of Customized Accelerators for Loop Pipelining of Binary Instruction TracesExportar publicação no formato APA Exportar publicação no formato EXCEL Exportar publicação no formato RIS

Tipo

Artigo em Revista Científica Internacional

Data

2017

Título

Generation of Customized Accelerators for Loop Pipelining of Binary Instruction Traces

Tipo

Artigo em Revista Científica Internacional

Ano

2017

Autores

Nuno Paulino

(Autor)

FEUP

Ver página pessoal Sem permissões para visualizar e-mail institucional Pesquisar Publicações do Participante Ver página do Authenticus Ver página ORCID

João Canas Ferreira

(Autor)

FEUP

Ver página pessoal Enviar mensagem Pesquisar Publicações do Participante Ver página do Authenticus Ver página ORCID

João M. P. Cardoso

(Autor)

FEUP

Ver página pessoal Enviar mensagem Pesquisar Publicações do Participante Ver página do Authenticus Ver página ORCID

Revista

Título: IEEE Transactions on Very Large Scale Integration (VLSI) SystemsImportada do Authenticus Pesquisar Publicações da Revista

Vol. 25

Páginas: 21-34

ISSN: 1063-8210

Editora: IEEE

Indexação

ISI Web of Knowledge - 6 Citações

Scopus - 6 Citações

Outras Informações

ID Authenticus: P-00M-AM7

DOI: 10.1109/tvlsi.2016.2573640

Abstract (EN): Many embedded applications process large amounts of data using regular computational kernels, amenable to acceleration by specialized hardware coprocessors. To reduce the significant design effort, the dedicated hardware may be automatically generated, usually starting from the application's source or binary code. This paper presents a moduloscheduled loop accelerator capable of executing multiple loops and a supporting toolchain. A generation/scheduling procedure, which fully relies on MicroBlaze instruction traces, produces accelerator instances, customized in terms of functional units and interconnections. The accelerators support integer and single-precision floating-point arithmetic, and exploit instruction-level parallelism, loop pipelining, and memory access parallelism via two read/write ports. A complete implementation of the proposed architecture is evaluated in a Virtex-7 device. Augmenting a MicroBlaze processor with a tailored accelerator achieves a geometric mean speedup, over software-only execution, of 6.61x for 13 floating-point kernels from the Livermore Loops set, and of 4.08x for 11 integer kernels from Texas Instruments' IMGLIB. The proposed customized accelerators are compared with ALU-based ones. The average specialized accelerator requires only 0.47x the number of field-programmable gate array slices of an accelerator with four ALUs. A geometric mean speedup of 1.78x over a four-issue very long instruction word (without floating-point support) was obtained for the integer kernels.

Idioma: Inglês

Tipo (Avaliação Docente): Científica

Nº de páginas: 14

Documentos

Não foi encontrado nenhum documento associado à publicação.

Publicações Relacionadas

Dos mesmos autores

Transparent Trace-Based Binary Acceleration for Reconfigurable HW/SW Systems (2013)
Artigo em Revista Científica Internacional
João Bispo; Nuno Paulino; João Cardoso; João Canas Ferreira

Transparent runtime migration of loop-based traces of processor instructions to reconfigurable processing units (2013)
Artigo em Revista Científica Internacional
João Bispo; Nuno Paulino; João Cardoso; João Canas Ferreira

Optimizing OpenCL Code for Performance on FPGA: k-Means Case Study With Integer Data Sets (2020)
Artigo em Revista Científica Internacional
Nuno Paulino; João Canas Ferreira; João M. P. Cardoso

Improving Performance and Energy Consumption in Embedded Systems via Binary Acceleration: A Survey (2020)
Artigo em Revista Científica Internacional
Nuno Paulino; João Canas Ferreira; João M. P. Cardoso

Dynamic Partial Reconfiguration of Customized Single-Row Accelerators (2019)
Artigo em Revista Científica Internacional
Nuno Paulino; João Canas Ferreira; João M. P. Cardoso

Ver todas (12)

Da mesma revista

Reliability and availability in reconfigurable computing: A basis for a common solution (2008)
Artigo em Revista Científica Internacional
Manuel G. Gericota; Gustavo R. Alves; Miguel L. Silva; José M. Ferreira

Dynamic Partial Reconfiguration of Customized Single-Row Accelerators (2019)
Artigo em Revista Científica Internacional
Nuno Paulino; João Canas Ferreira; João M. P. Cardoso

Recomendar Página Voltar ao Topo

Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto I Termos e Condições I Acessibilidade I Índice A-Z
Página gerada em: 2025-11-18 às 14:44:39 | Política de Privacidade | Política de Proteção de Dados Pessoais | Denúncias | Livro Amarelo Eletrónico