. Bibliographie, sourceforge.net/. [2] MICO, an OpenSource CORBA implementation

A. Agbaria and R. Friedman, Starfish: fault-tolerant dynamic MPI programs on clusters of workstations, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469), pp.167-176, 1999.
DOI : 10.1109/HPDC.1999.805295

L. Amar, A. Barak, and A. Shiloh, The MOSIX Parallel I/O System for Scalable I/O Performance, Proc. 14-th IASTED International Conference on Parallel and Distributed Computing and Systems, pp.495-500, 2002.

G. Antoniu, L. Boug, and R. Namyst, An efficient and transparent thread migration scheme in the PM2 runtime system, Parallel and Distributed Processing. Proc. 3rd Workshop on Runtime Systems for Parallel Programming (RTSPP '99), volume 1586 of Lect. Notes in Comp. Science, pp.496-510, 1999.
DOI : 10.1007/BFb0097934

URL : https://hal.archives-ouvertes.fr/inria-00073068

Y. Aridor, M. Factor, and A. Teperman, cJVM: a single system image of a JVM on a cluster, Proceedings of the 1999 International Conference on Parallel Processing, pp.4-11, 1999.
DOI : 10.1109/ICPP.1999.797382

O. Aumage, Madeleine : une interface de communication performante et portable pour exploiter les interconnexions hétérogènes de grappes, 2002.

O. Aumage, L. Bougé, A. Denis, J. Méhaut, G. Mercier et al., Madeleine II: a portable and efficient communication library for high-performance cluster computing, Proceedings IEEE International Conference on Cluster Computing. CLUSTER 2000, pp.78-87, 2000.
DOI : 10.1109/CLUSTR.2000.889004

R. Badrinath, C. Morin, and G. Vallée, Checkpointing and recovery of shared memory parallel applications in a cluster, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings., pp.471-477, 2003.
DOI : 10.1109/CCGRID.2003.1199403

URL : https://hal.archives-ouvertes.fr/hal-01272453

R. Badrinath, C. Morin, and G. Vallée, Checkpointing and recovery of shared memory parallel applications in a cluster, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings., 2003.
DOI : 10.1109/CCGRID.2003.1199403

URL : https://hal.archives-ouvertes.fr/hal-01272453

J. Basney and M. Livny, Managing network resources in Condor, Proceedings the Ninth International Symposium on High-Performance Distributed Computing, pp.298-299, 2000.
DOI : 10.1109/HPDC.2000.868666

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.111.9638

R. Batchu, J. P. Neelamegam, Z. Cui, M. Beddhu, A. Skjellum et al., MPI/FT/sup TM/: architecture and taxonomies for fault-tolerant, message-passing middleware for performance-portable parallel computing, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid, pp.26-33, 2001.
DOI : 10.1109/CCGRID.2001.923171

A. D. Birrell and B. J. Nelson, Implementing remote procedure calls, Proceedings of the ACM Symposium on Operating System Principles, 1983.

N. J. Boden, D. Cohen, R. E. Felderman, A. E. Kulawik, C. L. Seitz et al., Myrinet: a gigabit-per-second local area network, IEEE Micro, vol.15, issue.1, pp.29-36, 1995.
DOI : 10.1109/40.342015

G. Bosilca, A. Bouteiller, F. Cappello, G. Djilali, C. Fedak et al., MPICH-V: Toward a Scalable Fault Tolerant MPI for Volatile Nodes, ACM/IEEE SC 2002 Conference (SC'02), 2002.
DOI : 10.1109/SC.2002.10048

URL : https://hal.archives-ouvertes.fr/in2p3-00457138

A. Bouteiller, P. Lemarinier, G. Krawezik, and F. Cappello, Coordinated checkpoint versus message log for fault tolerant MPI, Proceedings IEEE International Conference on Cluster Computing CLUSTR-03, 2003.
DOI : 10.1109/CLUSTR.2003.1253321

G. Burns, R. Daoud, and J. Vaigl, LAM : An Open Cluster Environment for MPI, Proceedings of Supercomputing Symposium, pp.379-386, 1994.

R. Butler and E. Lusk, User's guide to the p4 parallel programming system, 1992.

D. Cameron and G. Regnier, The Virtual Interface Architecture, 2002.

J. Casas, D. L. Clark, R. Konuru, S. W. Otto, R. M. Prouty et al., MPVM : A migration transparent version of PVM, Usenix Computing Systems, vol.8, issue.2, pp.171-216, 1995.

J. Chapin, M. Rosenblum, S. Devine, T. Lahiri, D. Teodosiu et al., Hive : fault containment for shared-memory multiprocessors, Proceedings of the fifteenth ACM symposium on Operating systems principles, pp.12-25, 1995.

G. Ciaccio, Optimal communication performance on fast ethernet with GAMMA, IPPS/SPDP Workshops, volume LNCS 1388, pp.534-548
DOI : 10.1007/3-540-64359-1_726

G. Ciaccio and G. Chiola, Low-cost gigabit ethernet at work, International Conference on Cluster Computing (CLUSTER'00), pp.359-360, 2000.

A. F. Díaz, J. Ferreira, J. Ortega, A. Cañas, and A. Prieto, CLIC : Fast Communication on Linux Clusters, International Conference on Cluster Computing (CLUSTER'00), pp.365-366, 2000.

R. Friedman, M. Goldin, A. Itzkovitz, and A. Schuster, MILLIPEDE : Easy parallel programming in available distributed environments. Software Practice and Experience, pp.929-965, 1997.

P. Douglas, D. Ghormley, S. H. Petrou, A. M. Rodrigues, T. E. Vahdat et al., GLUnix : A Global Layer Unix for a network of workstations. Software ? Practice and Experience, pp.929-961, 1998.

F. Giacomini, T. Amundsen, A. Bogaerts, R. Hauser, B. D. Johnsen et al., Low-Level SCI software functional specification

R. Goeckelmann, M. Schoettner, S. Frenz, and P. Schulthess, A kernel running in DSM - design aspects of a distributed operating system, Proceedings IEEE International Conference on Cluster Computing CLUSTR-03, pp.478-482, 2003.
DOI : 10.1109/CLUSTR.2003.1253353

J. Gosling, B. Joy, G. Steele, and G. Bracha, The Java Language Specification, 2000.

J. Yvon, Implementation of page management in Mome, a user-level DSM, Proc. Intl. Workshop on Distributed Shared Memory on Clusters (DSM 2003), pp.479-486, 2003.

W. Gropp, E. Lusk, N. Doss, and A. Skjellum, A highperformance , portable implementation of the mpi message passing interface standard, 2004.

R. Große, R. Butenuth, and H. Heiß, IP over SCI, Proceeding of the IEEE International Conference on Cluster Computing (CLUSTER'00), pp.73-77, 2000.

M. Hayden, The ensemble system, 1998.

C. Huang, O. Lawlor, and L. V. Kalé, Adaptive MPI, Proceedings of the 16th International Workshop on Languages and Compilers for Parallel Computing (LCPC 03), College Station, 2003.
DOI : 10.1007/978-3-540-24644-2_20

M. Itoh, T. Ishizaki, and M. , Accelerated Socket Communication in System Area Networks, Proceeding of the IEEE International Conference on Cluster Computing (CLUSTER'00), pp.357-358, 2000.

R. Jain, Fddi : Current issues and future trends, IEEE Communications Magazine, pp.98-105, 1993.

T. Povl, J. S. Koch, E. Hansen, X. Cecchet, . Rousset et al., Scios : An scibased software distributed shared memory, Proceedings of the 1st Workshop on Software Distributed Shared Memory, pp.20-25, 1999.

R. B. Konuru, S. W. Otto, and J. Walpole, A Migratable User-Level Process Package for PVM, Journal of Parallel and Distributed Computing, vol.40, issue.1, pp.81-102, 1997.
DOI : 10.1006/jpdc.1996.1270

M. E. Kounavis, A. T. Campbell, S. Chou, F. Modoux, J. Vicente et al., The Genesis Kernel: a programming system for spawning network architectures, IEEE Journal on Selected Areas in Communications, vol.19, issue.3, pp.49-73, 2001.
DOI : 10.1109/49.917711

O. S. Lawlor and L. V. Kalé, Supporting dynamic parallel object arrays. Concurrency and Computation : Practice and Experience, pp.371-393, 2003.
DOI : 10.1002/cpe.665

URL : http://aspen.ucs.indiana.edu/CandCPandE/jg2001/C572lawlor/c572lawlor_arrays.pdf

J. Boudec, The Asynchronous Transfer Mode: a tutorial, Computer Networks and ISDN Systems, vol.24, issue.4, pp.279-309, 1992.
DOI : 10.1016/0169-7552(92)90114-6

J. Leon, A. L. Fisher, and P. Steenkiste, Fail-safe PVM : A Portable Package for Distributed Programming with Transparent Recovery

K. Li, Shared Virtual Memory on Loosely Coupled Multiprocessors, 1986.

M. Litzkow and M. Livny, Supporting checkpointing and process migration outside the UNIX kernel, Proceedings of the Winter 1992 USENIX Conference, pp.283-290, 1992.

M. Litzkow, T. Tannenbaum, J. Basney, and M. Livny, Checkpoint and Migration of UNIX Processes in the Condor distributed processing system, 1997.

R. Lottiaux, B. Boissinot, and C. Morin, Openmosix, openssi and kerrighed : a comparative study Rapport de recherche PI-1656, 2004.

R. Lottiaux, Gestion globale de la mémoire physique d'une grappe pour un systèmè a image unique : mise en oeuvre dans le système Gobelins, 2001.

D. Margery, G. Vall-e, R. Lottiaux, C. Morin, and J. Berthou, Kerrighed : a SSI cluster OS running OpenMP, Proc. 5th European Workshop on OpenMP, 2003.
URL : https://hal.archives-ouvertes.fr/hal-01272452

D. Margery, G. Vall-e, R. Lottiaux, C. Morin, and J. Berthou, Kerrighed : a SSI cluster OS running OpenMP, 2003.
URL : https://hal.archives-ouvertes.fr/hal-01272452

C. Morin and R. Lottiaux, Global resource management for high availability and performance in a dsm-based cluster, Proc. of 1st workshop on Software Distributed Shared memory, 1999.
URL : https://hal.archives-ouvertes.fr/inria-00072975

. Myricom, The GM-1 Message Passing System

. Myricom, The GM-2 Message Passing System. http ://www.myri.com/scs. [61] Myricom. Myrinet Express (MX) : A High Performance, Low-Level, Message-Passing Interface for Myrinet

. Myricom, Performance of mpich-gm 1.2.4..8a uniprocessor (up) case (one process per node), 2004.

J. K. Ousterhout, A. R. Cherenson, F. Douglis, M. N. Nelson, and B. B. Welch, The Sprite network operating system, Computer, vol.21, issue.2, pp.23-36, 1988.
DOI : 10.1109/2.16

A. Ross, J. Overbeek, and . Boyle, Portable Programs for Parallel Processors, 1987.

F. Petrini, A. Wu-chun-feng, S. Hoisie, E. Coll, and . Frachtenberg, The Quadrics network: high-performance clustering technology, IEEE Micro, vol.22, issue.1, pp.46-57, 2002.
DOI : 10.1109/40.988689

J. S. Plank, M. Beck, G. Kingsley, and K. Li, Libckpt : Transparent checkpointing under Unix, Usenix Winter Technical Conference, pp.213-223, 1995.

. Linux-infiniband and . Project, Linux system software for the infiniband architecture, 2002.

L. Prylli and B. Tourancheau, BIP: A new protocol designed for high performance networking on Myrinet, 1st Workshop on Personnal Computer based Networks Of Workstations (PC-NOW'98), pp.472-485, 1998.
DOI : 10.1007/3-540-64359-1_721

L. Prylli, B. Tourancheau, and R. Westrelin, An Improved NIC program for high-performance MPI, Workshop on Cluster-Based Computing, Internation Conference in SuperComputing, pp.26-30, 1999.

D. J. Scales and K. Gharachorloo, Towards transparent and efficient software distributed shared memory, Proc. of the 16th ACM Symp. on Operating Systems Principles (SOSP-16), 1997.
DOI : 10.1145/269005.266673

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.78.7410

D. Schmidt, A. Gokale, T. Harrison, and G. Parulkar, A high-performance endsystem architecture for real-time COR BA, IEEE Communication Magazine, vol.14, issue.2, 1997.

I. Schoinas, B. Falsafi, A. R. Lebeck, S. K. Reinhardt, J. R. Larus et al., Fine-grain access control for distributed shared memory, Proc. of the 6th Symp. on Architectural Support for Programming Languages and Operating Systems (ASPLOSVI), pp.297-307, 1994.

A. Selikhov, G. Bosilca, C. Germain, G. Fédak, and F. Cappello, MPICH-CM: A Communication Library Design for a P2P MPI Implementation, EuroPVM/MPI 2002, 2002.
DOI : 10.1007/3-540-45825-5_49

T. Shanley, PCI-X System Architecture, 2003.

T. Shanley and D. Anderson, PCI System Architecture, 2003.

F. John and . Shoch, An introduction to the ethernet specification, SIGCOMM Comput. Commun. Rev, vol.11, issue.3, pp.17-19, 1981.

Q. Snell, A. Mikler, and J. Gustafson, NetPIPE : A Network Protocol Independent Performace Evaluator, IASTED International Conference on Intelligent Information Management and Systems, 1996.

G. Stellner and . Cocheck, Checkpointing and Process Migration for MPI, Proceedings of the 10th International Parallel Processing Symposium (IPPS'96), 1996.

T. Sterling, D. Savarese, D. J. Becker, J. E. Dorband, U. A. Ranawake et al., BEOWULF : A parallel workstation for scientific computation, Proceedings of the 24th International Conference on Parallel Pr ocessing, pp.11-14, 1995.

R. Stevens, Unix Network Programming Networking APIs : Sockets and XTI, 1998.

R. Stevens, Unix network programming, ACM SIGCOMM Computer Communication Review, vol.20, issue.2, 1999.
DOI : 10.1145/378570.378600

F. Sultan, A. Bohra, P. Gallard, S. Neamtiu, Y. Smaldone et al., Recovering Internet Service Sessions from Operating System Failures, IEEE Internet Computing, vol.9, issue.2, 2005.
DOI : 10.1109/MIC.2005.45

F. Sultan, A. Bohra, I. Neamtiu, and L. Iftode, Nonintrusive remote healing using backdoors, Proceedings of First Workshop on Algorithms and Architectures for Self-Managing Systems, in conjunction with ISCA '03, 2003.

F. Sultan, System Support for Service Availability, Remote Healing and Fault Tolerance using Lazy State Propagation, 2004.

S. Inc and . Microsystems, Nfs : Network file system protocol specification, 1989.

V. S. Sunderam, PVM: A framework for parallel distributed computing, Concurrency, Practice and Experience, pp.315-340, 1990.
DOI : 10.1002/cpe.4330020404

T. E. Anderson, D. E. Culler, and D. A. Patterson, A case for NOW (Networks of Workstations), IEEE Micro, vol.15, issue.1, pp.54-64, 1995.
DOI : 10.1109/40.342018

B. Tourancheau and R. Westrelin, Study of Medium Message Performance of BIP/Myrinet, Proceeding of the IEEE International Conference on Cluster Computing (CLUSTER'00), pp.65-72, 2000.

G. Vallée, C. Morin, J. Berthou, and L. Rilling, A new approach to configurable dynamic scheduling in clusters based on single system image technologies, Proceedings International Parallel and Distributed Processing Symposium, p.91, 2003.
DOI : 10.1109/IPDPS.2003.1213198

G. Vallée, Conception dun ordonnanceur de processus adaptable pour la gestion globale des ressources dans les grappes de calculateurs : mise en oeuvre 150 BIBLIOGRAPHIE dans le système d'exploitation, 2004.

T. Von-eicken, A. Basu, V. Buch, and W. Vogels, U-net : a user-level network interface for parallel and distributed computing, Proceedings of the fifteenth ACM symposium on Operating systems principles, pp.40-53, 1995.

T. Von-eiken, D. Culler, S. Goldstein, and K. Schauser, Active Messages : A Mechanism for Integrated Communication and Computation, Proceeding of the 19th International Symposium on Computer Architecture, 1992.

M. Welsh, A. Basu, and T. Von-eicken, Low-latency communication over Fast Ethernet, In Euro-Par, vol.I, pp.187-194, 1996.
DOI : 10.1007/3-540-61626-8_24

M. Wende, M. Schoettner, O. Schirpf, and P. Schulthess, Network design for the DSM Operating System Plurix, Proceedings of the 3rd Workshop, 1999.