sourceforge.net/. [2] MICO, an OpenSource CORBA implementation ,
Starfish: fault-tolerant dynamic MPI programs on clusters of workstations, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469), pp.167-176, 1999. ,
DOI : 10.1109/HPDC.1999.805295
The MOSIX Parallel I/O System for Scalable I/O Performance, Proc. 14-th IASTED International Conference on Parallel and Distributed Computing and Systems, pp.495-500, 2002. ,
An efficient and transparent thread migration scheme in the PM2 runtime system, Parallel and Distributed Processing. Proc. 3rd Workshop on Runtime Systems for Parallel Programming (RTSPP '99), volume 1586 of Lect. Notes in Comp. Science, pp.496-510, 1999. ,
DOI : 10.1007/BFb0097934
URL : https://hal.archives-ouvertes.fr/inria-00073068
cJVM: a single system image of a JVM on a cluster, Proceedings of the 1999 International Conference on Parallel Processing, pp.4-11, 1999. ,
DOI : 10.1109/ICPP.1999.797382
Madeleine : une interface de communication performante et portable pour exploiter les interconnexions hétérogènes de grappes, 2002. ,
Madeleine II: a portable and efficient communication library for high-performance cluster computing, Proceedings IEEE International Conference on Cluster Computing. CLUSTER 2000, pp.78-87, 2000. ,
DOI : 10.1109/CLUSTR.2000.889004
Checkpointing and recovery of shared memory parallel applications in a cluster, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings., pp.471-477, 2003. ,
DOI : 10.1109/CCGRID.2003.1199403
URL : https://hal.archives-ouvertes.fr/hal-01272453
Checkpointing and recovery of shared memory parallel applications in a cluster, CCGrid 2003. 3rd IEEE/ACM International Symposium on Cluster Computing and the Grid, 2003. Proceedings., 2003. ,
DOI : 10.1109/CCGRID.2003.1199403
URL : https://hal.archives-ouvertes.fr/hal-01272453
Managing network resources in Condor, Proceedings the Ninth International Symposium on High-Performance Distributed Computing, pp.298-299, 2000. ,
DOI : 10.1109/HPDC.2000.868666
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.111.9638
MPI/FT/sup TM/: architecture and taxonomies for fault-tolerant, message-passing middleware for performance-portable parallel computing, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid, pp.26-33, 2001. ,
DOI : 10.1109/CCGRID.2001.923171
Implementing remote procedure calls, Proceedings of the ACM Symposium on Operating System Principles, 1983. ,
Myrinet: a gigabit-per-second local area network, IEEE Micro, vol.15, issue.1, pp.29-36, 1995. ,
DOI : 10.1109/40.342015
MPICH-V: Toward a Scalable Fault Tolerant MPI for Volatile Nodes, ACM/IEEE SC 2002 Conference (SC'02), 2002. ,
DOI : 10.1109/SC.2002.10048
URL : https://hal.archives-ouvertes.fr/in2p3-00457138
Coordinated checkpoint versus message log for fault tolerant MPI, Proceedings IEEE International Conference on Cluster Computing CLUSTR-03, 2003. ,
DOI : 10.1109/CLUSTR.2003.1253321
LAM : An Open Cluster Environment for MPI, Proceedings of Supercomputing Symposium, pp.379-386, 1994. ,
User's guide to the p4 parallel programming system, 1992. ,
The Virtual Interface Architecture, 2002. ,
MPVM : A migration transparent version of PVM, Usenix Computing Systems, vol.8, issue.2, pp.171-216, 1995. ,
Hive : fault containment for shared-memory multiprocessors, Proceedings of the fifteenth ACM symposium on Operating systems principles, pp.12-25, 1995. ,
Optimal communication performance on fast ethernet with GAMMA, IPPS/SPDP Workshops, volume LNCS 1388, pp.534-548 ,
DOI : 10.1007/3-540-64359-1_726
Low-cost gigabit ethernet at work, International Conference on Cluster Computing (CLUSTER'00), pp.359-360, 2000. ,
CLIC : Fast Communication on Linux Clusters, International Conference on Cluster Computing (CLUSTER'00), pp.365-366, 2000. ,
MILLIPEDE : Easy parallel programming in available distributed environments. Software Practice and Experience, pp.929-965, 1997. ,
GLUnix : A Global Layer Unix for a network of workstations. Software ? Practice and Experience, pp.929-961, 1998. ,
Low-Level SCI software functional specification ,
A kernel running in DSM - design aspects of a distributed operating system, Proceedings IEEE International Conference on Cluster Computing CLUSTR-03, pp.478-482, 2003. ,
DOI : 10.1109/CLUSTR.2003.1253353
The Java Language Specification, 2000. ,
Implementation of page management in Mome, a user-level DSM, Proc. Intl. Workshop on Distributed Shared Memory on Clusters (DSM 2003), pp.479-486, 2003. ,
A highperformance , portable implementation of the mpi message passing interface standard, 2004. ,
IP over SCI, Proceeding of the IEEE International Conference on Cluster Computing (CLUSTER'00), pp.73-77, 2000. ,
The ensemble system, 1998. ,
Adaptive MPI, Proceedings of the 16th International Workshop on Languages and Compilers for Parallel Computing (LCPC 03), College Station, 2003. ,
DOI : 10.1007/978-3-540-24644-2_20
Accelerated Socket Communication in System Area Networks, Proceeding of the IEEE International Conference on Cluster Computing (CLUSTER'00), pp.357-358, 2000. ,
Fddi : Current issues and future trends, IEEE Communications Magazine, pp.98-105, 1993. ,
Scios : An scibased software distributed shared memory, Proceedings of the 1st Workshop on Software Distributed Shared Memory, pp.20-25, 1999. ,
A Migratable User-Level Process Package for PVM, Journal of Parallel and Distributed Computing, vol.40, issue.1, pp.81-102, 1997. ,
DOI : 10.1006/jpdc.1996.1270
The Genesis Kernel: a programming system for spawning network architectures, IEEE Journal on Selected Areas in Communications, vol.19, issue.3, pp.49-73, 2001. ,
DOI : 10.1109/49.917711
Supporting dynamic parallel object arrays. Concurrency and Computation : Practice and Experience, pp.371-393, 2003. ,
DOI : 10.1002/cpe.665
URL : http://aspen.ucs.indiana.edu/CandCPandE/jg2001/C572lawlor/c572lawlor_arrays.pdf
The Asynchronous Transfer Mode: a tutorial, Computer Networks and ISDN Systems, vol.24, issue.4, pp.279-309, 1992. ,
DOI : 10.1016/0169-7552(92)90114-6
Fail-safe PVM : A Portable Package for Distributed Programming with Transparent Recovery ,
Shared Virtual Memory on Loosely Coupled Multiprocessors, 1986. ,
Supporting checkpointing and process migration outside the UNIX kernel, Proceedings of the Winter 1992 USENIX Conference, pp.283-290, 1992. ,
Checkpoint and Migration of UNIX Processes in the Condor distributed processing system, 1997. ,
Openmosix, openssi and kerrighed : a comparative study Rapport de recherche PI-1656, 2004. ,
Gestion globale de la mémoire physique d'une grappe pour un systèmè a image unique : mise en oeuvre dans le système Gobelins, 2001. ,
Kerrighed : a SSI cluster OS running OpenMP, Proc. 5th European Workshop on OpenMP, 2003. ,
URL : https://hal.archives-ouvertes.fr/hal-01272452
Kerrighed : a SSI cluster OS running OpenMP, 2003. ,
URL : https://hal.archives-ouvertes.fr/hal-01272452
Global resource management for high availability and performance in a dsm-based cluster, Proc. of 1st workshop on Software Distributed Shared memory, 1999. ,
URL : https://hal.archives-ouvertes.fr/inria-00072975
The GM-1 Message Passing System ,
The GM-2 Message Passing System. http ://www.myri.com/scs. [61] Myricom. Myrinet Express (MX) : A High Performance, Low-Level, Message-Passing Interface for Myrinet ,
Performance of mpich-gm 1.2.4..8a uniprocessor (up) case (one process per node), 2004. ,
The Sprite network operating system, Computer, vol.21, issue.2, pp.23-36, 1988. ,
DOI : 10.1109/2.16
Portable Programs for Parallel Processors, 1987. ,
The Quadrics network: high-performance clustering technology, IEEE Micro, vol.22, issue.1, pp.46-57, 2002. ,
DOI : 10.1109/40.988689
Libckpt : Transparent checkpointing under Unix, Usenix Winter Technical Conference, pp.213-223, 1995. ,
Linux system software for the infiniband architecture, 2002. ,
BIP: A new protocol designed for high performance networking on Myrinet, 1st Workshop on Personnal Computer based Networks Of Workstations (PC-NOW'98), pp.472-485, 1998. ,
DOI : 10.1007/3-540-64359-1_721
An Improved NIC program for high-performance MPI, Workshop on Cluster-Based Computing, Internation Conference in SuperComputing, pp.26-30, 1999. ,
Towards transparent and efficient software distributed shared memory, Proc. of the 16th ACM Symp. on Operating Systems Principles (SOSP-16), 1997. ,
DOI : 10.1145/269005.266673
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.78.7410
A high-performance endsystem architecture for real-time COR BA, IEEE Communication Magazine, vol.14, issue.2, 1997. ,
Fine-grain access control for distributed shared memory, Proc. of the 6th Symp. on Architectural Support for Programming Languages and Operating Systems (ASPLOSVI), pp.297-307, 1994. ,
MPICH-CM: A Communication Library Design for a P2P MPI Implementation, EuroPVM/MPI 2002, 2002. ,
DOI : 10.1007/3-540-45825-5_49
PCI-X System Architecture, 2003. ,
PCI System Architecture, 2003. ,
An introduction to the ethernet specification, SIGCOMM Comput. Commun. Rev, vol.11, issue.3, pp.17-19, 1981. ,
NetPIPE : A Network Protocol Independent Performace Evaluator, IASTED International Conference on Intelligent Information Management and Systems, 1996. ,
Checkpointing and Process Migration for MPI, Proceedings of the 10th International Parallel Processing Symposium (IPPS'96), 1996. ,
BEOWULF : A parallel workstation for scientific computation, Proceedings of the 24th International Conference on Parallel Pr ocessing, pp.11-14, 1995. ,
Unix Network Programming Networking APIs : Sockets and XTI, 1998. ,
Unix network programming, ACM SIGCOMM Computer Communication Review, vol.20, issue.2, 1999. ,
DOI : 10.1145/378570.378600
Recovering Internet Service Sessions from Operating System Failures, IEEE Internet Computing, vol.9, issue.2, 2005. ,
DOI : 10.1109/MIC.2005.45
Nonintrusive remote healing using backdoors, Proceedings of First Workshop on Algorithms and Architectures for Self-Managing Systems, in conjunction with ISCA '03, 2003. ,
System Support for Service Availability, Remote Healing and Fault Tolerance using Lazy State Propagation, 2004. ,
Nfs : Network file system protocol specification, 1989. ,
PVM: A framework for parallel distributed computing, Concurrency, Practice and Experience, pp.315-340, 1990. ,
DOI : 10.1002/cpe.4330020404
A case for NOW (Networks of Workstations), IEEE Micro, vol.15, issue.1, pp.54-64, 1995. ,
DOI : 10.1109/40.342018
Study of Medium Message Performance of BIP/Myrinet, Proceeding of the IEEE International Conference on Cluster Computing (CLUSTER'00), pp.65-72, 2000. ,
A new approach to configurable dynamic scheduling in clusters based on single system image technologies, Proceedings International Parallel and Distributed Processing Symposium, p.91, 2003. ,
DOI : 10.1109/IPDPS.2003.1213198
Conception dun ordonnanceur de processus adaptable pour la gestion globale des ressources dans les grappes de calculateurs : mise en oeuvre 150 BIBLIOGRAPHIE dans le système d'exploitation, 2004. ,
U-net : a user-level network interface for parallel and distributed computing, Proceedings of the fifteenth ACM symposium on Operating systems principles, pp.40-53, 1995. ,
Active Messages : A Mechanism for Integrated Communication and Computation, Proceeding of the 19th International Symposium on Computer Architecture, 1992. ,
Low-latency communication over Fast Ethernet, In Euro-Par, vol.I, pp.187-194, 1996. ,
DOI : 10.1007/3-540-61626-8_24
Network design for the DSM Operating System Plurix, Proceedings of the 3rd Workshop, 1999. ,