Go to:
Logótipo
Você está em: Start > Publications > View > Couillard: Parallel programming via coarse-grained Data-flow Compilation
Publication

Couillard: Parallel programming via coarse-grained Data-flow Compilation

Title
Couillard: Parallel programming via coarse-grained Data-flow Compilation
Type
Article in International Scientific Journal
Year
2014
Authors
Leandro A J Marzulo
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Tiago A O Alves
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Felipe M G Franca
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Vitor Santos Costa
(Author)
FCUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page
Journal
Title: Parallel ComputingImported from Authenticus Search for Journal Publications
Vol. 40
Pages: 661-680
ISSN: 0167-8191
Publisher: Elsevier
Scientific classification
FOS: Natural sciences > Computer and information sciences
Other information
Authenticus ID: P-009-ZC7
Abstract (EN): Data-flow is a natural approach to parallelism. However, describing dependencies and control between fine-grained data-flow tasks can be complex and present unwanted overheads. TALM (TALM is an Architecture and Language for Multi-threading) introduces a user-defined coarse-grained parallel data-flow model, where programmers identify code blocks, called super-instructions, to be run in parallel and connect them in a data-flow graph. TALM has been implemented as a hybrid Von Neumann/data-flow execution system: the Trebuchet. We have observed that TALM's usefulness largely depends on how programmers specify and connect super-instructions. Thus, we present Couillard, a full compiler that creates, based on an annotated C-program, a data-flow graph and C-code corresponding to each super-instruction. We show that our toolchain allows one to benefit from data-flow execution and explore sophisticated parallel programming techniques, with small effort. To evaluate our system we have executed a set of real applications on a large multi-core machine. Comparison with popular parallel programming methods shows competitive speedups, while providing an easier parallel programing approach. More specifically, for an application that follows the wavefront method, running with big inputs, Trebuchet achieved up to 4.7% speedup over Intel (R) TBB novel flow-graph approach and up to 44% over OpenMP.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 20
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same journal

Dynamic scheduling of a batch of parallel task jobs on heterogeneous clusters (2011)
Article in International Scientific Journal
Jorge G. Barbosa; Belmiro Moreira
A sleek lock-free hash map in an ERA of safe memory reclamation methods (2025)
Article in International Scientific Journal
Moreno, P; Miguel Areias; Ricardo Rocha
Recommend this page Top
Copyright 1996-2026 © Faculdade de Farmácia da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2026-02-13 at 06:50:39 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book