Skip to Main content Skip to Navigation

Tolérance aux fautes et reconfiguration dynamique pour les applications distribuées à grande échelle

Xavier Besseron 1 
1 MOAIS - PrograMming and scheduling design fOr Applications in Interactive Simulation
Inria Grenoble - Rhône-Alpes, LIG - Laboratoire d'Informatique de Grenoble
Abstract : This work deals with high performance computing on large scale platforms like computing grids. Computing grids are characterized by (1) frequent changes in execution context and, especially, by (2) a high failure probability caused by the large number of components. Running an application efficiently in such an environment requires to consider these parameters. Our research work is based on the abstract representation of the application as a data flow graph from the parallel and distributed programming model Athapascan/Kaapi. This abstract representation is used to provide solutions for (1) dynamic reconfiguration and (2) fault tolerance issues. - First, we propose a dynamic reconfiguration mechanism that manages, transparently for the reconfiguration programmer, concurrent operations on the application state and mutual consistency of states for distributed reconfiguration. - Secondly, we present an original fault tolerance protocol that allows partial rollback of the application in case of failure. For this purpose, the set of strictly required computation tasks to recover is computed. These contributions are evaluated through the Kaapi and X-Kaapi software on the Grid'5000 computing platform.
Document type :
Complete list of metadata
Contributor : Xavier Besseron Connect in order to contact the contributor
Submitted on : Thursday, May 27, 2010 - 1:32:38 PM
Last modification on : Wednesday, July 6, 2022 - 4:17:32 AM
Long-term archiving on: : Friday, October 19, 2012 - 3:05:31 PM


  • HAL Id : tel-00486939, version 1


Xavier Besseron. Tolérance aux fautes et reconfiguration dynamique pour les applications distribuées à grande échelle. Informatique [cs]. Institut National Polytechnique de Grenoble - INPG, 2010. Français. ⟨tel-00486939⟩



Record views


Files downloads