Skip to Main content Skip to Navigation
Theses

Adéquation Algorithme Architecture et modèle de programmation pour l'implémentation d'algorithmes de traitement du signal et de l'image sur cluster multi-GPU

Abstract : Originally designed to relieve the CPU from graphics rendering tasks, the GPU has becomea massively parallel architecture suitable for processing large amounts of data. While it haswon a significant market share in the High Performance Computing domain, an Algorithm-Architecture Matching approach is still necessary to efficiently implement an algorithm onGPU.The contribution of this thesis is twofold. Firstly, we present the significant gain providedby the implementation of a granulometry optimized algorithm (computation time decreasesfrom several hours to less than minute for a volume of 10243 voxels). An analytical modelestablishing the performance variations of the granulometry application is also presented. Webelieve it can be expanded to other regular algorithms.Secondly, the deployment of Signal and Image processing applications on multi-GPUcluster can be a tedious task for the programmer. In order to help him, we developped alibrary that reduces the scope of the programmer’s contribution in the development. Hisremaining tasks are decomposing the application into a Data Flow Graph and giving mappingannotations in order for the tool to automatically dispatch tasks on the processing elements(GPP or GPU). The throughput of a visual sailency streaming application is then improvedthanks to the efficient implementation brought by our tool on a multi-GPU cluster. In orderto permit dynamic load balancing, a task migration method has also been incorporated into it.
Document type :
Theses
Complete list of metadata

Cited literature [109 references]  Display  Hide  Download

https://tel.archives-ouvertes.fr/tel-00876668
Contributor : Abes Star :  Contact
Submitted on : Friday, October 25, 2013 - 11:12:12 AM
Last modification on : Thursday, November 19, 2020 - 12:59:58 PM
Long-term archiving on: : Monday, January 27, 2014 - 12:56:59 PM

File

BOULOS_2013_archivage.pdf
Version validated by the jury (STAR)

Identifiers

  • HAL Id : tel-00876668, version 1

Collections

Citation

Vincent Boulos. Adéquation Algorithme Architecture et modèle de programmation pour l'implémentation d'algorithmes de traitement du signal et de l'image sur cluster multi-GPU. Autre. Université de Grenoble, 2012. Français. ⟨NNT : 2012GRENT099⟩. ⟨tel-00876668⟩

Share

Metrics

Record views

1292

Files downloads

4012