O. Aumage, L. Bougé, J. Méhaut, and R. Namyst, Madeleine II: a portable and efficient communication library for high-performance cluster computing, Proceedings IEEE International Conference on Cluster Computing. CLUSTER 2000, pp.607-626, 2002.
DOI : 10.1109/CLUSTR.2000.889004

M. Acacio, O. Canovas, J. M. Garcia, P. E. Lopez-de-teruelagh+93, ]. C. Addison et al., The Performance of MPI Parallel Jacobi Implementation on Workstation Clusters IX Jornadas de Paralelismo The {Genesis}Distributed-memory Benchmarks, Journal Concurrency: Practice and Experience, vol.5, issue.1, pp.1-22, 1993.

D. Ashton, W. Gropp, E. Lusk, R. Ross, and B. Toonen, MPICH2 Design Document 0ct-2002, http://www-unix.mcs.anl.gov/mpi/mpich/adi3/mpich2/mpich2, LogGP: Incorporating Long Messages into the LogP Model for Parallel Computation Journal of Parallel and Distributed Computing, pp.171-79, 1997.

G. R. Andrews and F. B. Schneider, Concepts and Notations for Concurrent Programming, ACM Computing Surveys, vol.15, issue.1, pp.3-43, 1983.
DOI : 10.1145/356901.356903

]. O. Aum02 and . Aumage, Heterogeneous multi-cluster networking with the Madeleine III communication library, Proc. 16th Intl. Parallel and Distributed Processing Symposium, 11th Heterogeneous Computing Workshop, 2002.

S. B. Baden, G. T. Balls, and P. , Colella Applications, scalability and technological change p.50 http://www.psc, 2004.

D. P. Bovet and M. , Cesati Understading the Linux Kernel Chapter, pp.145-146, 2000.

S. Bolis, E. G. Economou, D. Mouzakis, and G. Philokyprou, SBP-Net: an integrated voice/data token ring LAN, Computer Communications, vol.16, issue.8, pp.494-500, 1993.
DOI : 10.1016/0140-3664(93)90064-Y

]. Ber97 and . Bernard, Parallélisation et multiprogrammation pour une application irrégulière de dynamique moléculaire opérationnelle, Thèse de doctorat en mathématiques appliquées, 1997.

T. [. Briat and . Gautier, What is INUKTITUT ? http://www-id.imag.fr/logiciels/inuktitut Athapascan Runtime: Efficiency for Irregular Problems, Proc. of EuroPar'97, 1997.

M. Bernaschi and G. Iannello, Collective Communication Operation : Experimental Results vs Theory. Concurrency : Practical and Experience, 1999.

R. Bariuso and A. Knies, SHMEM's User's Guide, Cray Research, p.2516, 1994.

L. Bougé, J. Méhaut, and R. Namyst, MADELEINE: an efficient and portable communication interface for RPC-based multithreaded environments, Proceedings. 1998 International Conference on Parallel Architectures and Compilation Techniques (Cat. No.98EX192), 1999.
DOI : 10.1109/PACT.1998.727257

A. D. Birrel, B. J. Nelsonbrad02a-]-e, E. G. Bradford, . G. Bradfordbrad02c-]-e, . G. Bradfordbrad02d-]-e et al., Implementing remote procedure calls, Context switching, Part 2 RunTime: High-performance programming techniques on Linux and Windows _IBM developerWorks :: Linux articles Septembre, pp.39-59, 1984.
DOI : 10.1145/2080.357392

H. E. Bal, J. G. Steiner, and A. S. Tanenbaum, Programming Languages for Distributed Systems, IR-147, 1988.

J. [. Bertsekas and . Tsitsiklis, Parallel and Distributed Computation, chapter 5, 6.5 6, High Performance Cluster Computing: Architectures and Systems, p.14, 1989.

R. Buyya, High Performance Cluster Computing: Architectures and Systems, pp.235-238, 1999.

A. S. Carissimi, Athapascan-0: Exploitation de la multiprogrammation légère sur grappes de multiprocesseurs, Thèse de doctorat en informatique, 1999.

G. G. Cavalheiro, Athapascan-1: Interface générique pour l'ordonnancement dans un environnement d'exécution parallèle, Thèse de doctorat en informatique, 1999.

A. S. Charão, I. Charpentier, and B. Plateau, Un environnement modulaire pour l'exploitation des processus légers dans les méthodes de décomposition de domaine, 11ème Rencontres francophones du parallélisme, des architectures et des systèmes, 1999.

. Reuse, CI] Confidence Interval http://www.iut.dk/bibliotek/encyclopedia CC++: A declarative concurrent objectoriented programming notation, Research Directions in Concurrent Object-Oriented Programming, 1993.

D. E. Culler, D. A. Karp, A. Patterson, K. E. Sahay, E. Shauser et al., LogP : Towards a realistic model of parallel computation, Procs. of the 4 th SIGPLAN Symp. on Principles and Practices of Parallel Programming ACM, 1993.

C. C. Neira, J. Leigh, M. Papka, and C. Barnes, Scientists in wonderland: A report on visualization applications in the CAVE virtual reality environment, Proceedings of the Symposium on Research Frontiers in Virtual Reality, pp.59-66, 1993.

A. Denis, PadicoTM : un environnement ouvert pour l'intégration d'exécutifs communiquants, 14èmes Rencontres Francophones du Parallélisme (RenPar'14), pp.99-106, 2002.

I. [. Drepper, . Molnardmn99-]-y, J. Denneulin, R. V. Méhaut, R. Danjean et al., The Native POSIX Thread Library for Linux http Namyst Customizable Thread Scheduling directed by Priorities Controling Kernel Scheduling from User Space: an Approach to Enhancing Applications' Reactivity to I/O Events, Proceedings of the 2003 International Conference on High Performance Computing (HiPC '03), 2003.

M. Doreille, Athapascan-1 Vers un modèle de programmation parallèle adapté au calcul scientifique, Thèse de doctorat en informatique, 1999.

A. Denis, C. Pérez, T. Prioldpp02, ]. A. Denis, C. Pérez et al., Portable parallèle CORBA objects : an approach to combine parallel and distributed programming for grid computing PadicoTM: An Open Integration Framework for Communication Middleware and Runtimes, Proceedings of the Intl. EuroPar'01 conf IEEE International Symposium on Cluster Computing and the Grid (CCGrid2002) Pérez, T. Priol, and A. Ribes. Padico: A Component-Based Software Infrastructure for Grid Computing 17th International Parallel and Distributed Processing Symposium (IPDPS2003), pp.835-844, 2001.

C. [. Dowd and . P. Severanceds03-]-r, High Performance Computing Chapitre 1, Chapitre 6 O'Reilly & Associates Second EditionDimitrov, a.Skjellum A Theoretical Framework for Overlapping of Communication and Computation and Early Binding, part I : BOUM Model and Overlapping Metrics Preprint submitted to Elsevier Science, pp.66-76, 1998.

D. [. Eicken, S. C. Culler, K. E. Goldstein, . R. Schauserekb+92-]-j, S. R. Eykholt et al., Active Message A Mechanism for Integrated Communication and Computation Williams Beyond multiprocessing : multithreading the SunOS kernel, Proceedings of the 19th International Symposium on Computer Architecture Proceedings of the Summer 1992 USENIX Technical Conference and Exhibition, pp.11-18, 1992.

M. Feeley, J. Chase, and E. Lazowska, User-level threads and interprocess communication, 1993.

M. [. Fahringer, P. Haines, and . Mehrotra, On the utility of threads for data parallèle programming, ACM Conference proceedings of the 9 th International Conference on Supercomputing, pp.51-59, 1995.

I. Foster and C. Kesselman, Globus: A metacomputing infrastructure toolkit. The International Journal of Supercomputer Applications and High Performance Computing, pp.115-128, 1997.

I. Foster, C. Kesselman, and S. Tuecke, The Nexus Approach to Integrating Multithreading and Communication, Journal of Parallel and Distributed Computing, vol.37, issue.1, pp.70-82, 1996.
DOI : 10.1006/jpdc.1996.0108

I. Foster and R. Olson, A guide to parallel and distributed programming in nperl, p.95

I. Foster, Designing and Building parallel programs, 1995.

I. Foster, R. Olson, and S. Tuecke, Programming in fortran M, p.93
DOI : 10.2172/10189341

G. Folino, G. Spezzano, and D. Talia, Performance evaluation and modeling of MPI communications on the meiko CS-2, HPCN Europe, pp.932-936, 1998.
DOI : 10.1007/BFb0037237

F. Galilée, Athapascan-1 : interprétation distribuée du flot de données d'un programme parallèle, Thèse de doctorat en informatique, 1999.

D. Grove, P. Coddingtongf00-]-f, J. Garcia, and . Fernandez, Precise MPI performance measurement using MPIBench POSIX Thread Libraries?sid=3184 [Gin97] I. Ginzburg. Athapascan-0b: Intégration efficace et portable de multiprogrammation légère et de communications, Proceedings of HPC Asia Thèse de doctorat en informatique, 1997.

W. Gropp, E. Lusk, N. Doss, and A. Skjellum, A high-performance, portable implementation of the MPI message passing interface standard, Parallel Computing, vol.22, issue.6, pp.789-828, 1996.
DOI : 10.1016/0167-8191(96)00024-5

W. Goralski, TCP/IP applications and protocols, 6N. Atlantic Wharf, 1995.

F. Galilée, J. Roch, G. Cavalheiro, and M. Doreille, Athapascan-1: Online building data flow graph in a parallel language, IEEE, pp.88-95, 1998.

R. Hockney and M. , Berry Public International Benchmarks for Parallel Computers, 71994-02.

M. Haines, D. Cronk, and P. Mehrotra, On the design of Chant, Proceedings of the 1994 ACM/IEEE conference on Supercomputing , Supercomputing '94, pp.350-359157, 1994.
DOI : 10.1145/602770.602834

E. Hermann, Portage de la bibliothèque de communication INUKTITUT sur architecture Itanium et réseau rapide Myrinet, 2004.

L. P. Huse, Collective Communication on Dedicated Cluster of Workstations Proceedings of the 6 th European PVM/MPI User's Group Meeting, pp.469-476, 1999.

G. Iannello, Efficient Algorithms for the Reduce-Scatter Operation in LogGP [Iclus2] I-Cluster2 Presentation http://I-cluster2.inrialpes.fr [IEEE93] IEEE. IEEE Standard for Scalable Coherent Interface (SCI), IEEE Transactions on Parallel and Distributed Systems, vol.8, issue.9, 1993.

G. Iannello, M. Lauria, and S. Mercolino, Cross-platform analysis of fast messages for Myrinet, Proc. Workshop CANPC'98, number 1362 in Lecture Notes in Computer Science, pp.217-231, 1998.
DOI : 10.1007/BFb0052219

T. Kielmann, H. E. Bal, and S. Gorlatch, Bandwidth-efficient collective communication for clustered wide area systems, Proceedings 14th International Parallel and Distributed Processing Symposium. IPDPS 2000, pp.492-499, 2000.
DOI : 10.1109/IPDPS.2000.846026

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.147.859

T. Kielmann, H. E. Bal, and K. Verstoep, Fast Measurement of LogP Parameters for Message Passing Platforms, Lecture Notes in Computer Science, vol.1800, p.1176, 2000.
DOI : 10.1007/3-540-45591-4_162

J. A. Kohl, G. A. Geist, P. M. Papadopoulos, and S. Scott, Beyond pvm 3.4 : What we've learned, what's next, and why, Proceedings of Euro PVM-MPI 97, 1997.

J. Kim, K. Kim, S. M. Jungksss93-]-r, A. Karp, E. E. Sahay et al., Building a high-performance communication layer over virtual interface architecture on Linux clusters, Proceedings of the 15th international conference on Supercomputing , ICS '01, pp.142-153, 1993.
DOI : 10.1145/377792.377858

]. B. Lewis and D. J. , Installation Guide http://www.lam-mpi.org/6.5/install.php [LabID] Laboratoire Informatique et Distribution http, LAM 6.5.9, 2000.

J. Liu, B. Chandrasekaran, J. Wu, W. Jiang, S. Kini et al., Performance comparison of MPI implementations over In_niBand, Myrinet and Quadrics, The International Conference for High Performance Computing and Communications (SC2003), 2003.

]. B. Les93 and . Lester, NJ 1993 [Ler96] X. Leroy GNU LinuxThreads http://pauillac.inria.fr/~xleroy/linuxthreads/index.html [Lin] The Linux documentation Project http://www.ibiblio.org/mdw/index.html [LinP] Performance Measurement of Linux Kernel http, 2003.

M. Mehaut and R. Namyst, Marcel : Une bibliothèque de processus légers [mpich] MPICH -A portable MPI Implementation http, 1995.

C. Martin and O. Richard, Parallal launcher for cluster of pc, Proceedings ParCo, pp.473-480, 2001.

R. Namyst and J. Méhaut, PM2 : Parallel multithreaded machine. A computing environment for distributed architectures

L. [. Nupairoj and . Ni, Performance evaluation of some MPI implementations on workstation clusters, Proceedings Scalable Parallel Libraries Conference, pp.98-105, 1994.
DOI : 10.1109/SPLC.1994.376999

N. Nupairoj and L. M. Ni, Performance Metrics and Measurement Techniques of Collective Communication Services First International Workshop on Communication and Architectural Support for Network-Based Parallel Computing (CANPC'97), pp.212-226, 1997.

M. Pasin, Institut National Polytechnique de Grenoble, France Novembre 1999 [Pig03] L. Pigeon. Conception d'une bibliothèque pour les opérations de communication collective pour le langage de haut niveau Athapascan, 2003.

M. L. Powell, S. R. Kleiman, S. Barton, D. Shah, D. Stein et al., SunOS Multi-Thread Architecture, Proceedings of the Winter 1991 USENIX Technical Conference and Exhibition, pp.65-80, 1991.
DOI : 10.1007/978-1-4612-3192-9_24

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.42.2100

]. L. Prylli and B. Tourancheau, BIP: A new protocol designed for high performance networking on Myrinet, Lecture Notes in Computer Science, vol.1388, pp.472-80, 1998.
DOI : 10.1007/3-540-64359-1_721

]. R. Rab97 and . Rabenseifner, A new optimized MPI reduce algorithm, 1997.

B. Richard, P. Augerat, N. Maillard, S. Derr, S. Martin et al., I- Cluster: Reaching TOP500 performance using mainstream hardwareHPL-2001-206.html [Red] RedHat http://www.redhat.com [RedG] http://graal.ens-lyon.fr/~desprez Revire Ordonnancement de grappe dynamique de tâche sur architecture de grande taille. Régulation par dégénération séquentielle et distribuée Athapascan: API for asynchronous parallel programming, Thèse de doctorat en informatique Santhanam Toward Linux 2.6 A look into the workings of the next new kernelEvaluation of the Communication Performance on a Parallel Processing System" 4th European PVM-MPI Users Group Meeting ---Recent Advances in Parallel Virtual Machine and Message Passing Interface, 1332.

A. Silberschatz, P. B. Galvinshe96, ]. J. Shemitz, and P. K. Sinha, Operating Systems Concepts Using RDTSC for benchmarking code on Pentium computers Distributed Operating Systems Concepts and Design, p.96, 1991.

P. K. Sinha, Distributed Operating Systems Concepts and Design, Sec, vol.8, p.406, 1997.

B. R. De-supinski and J. May, Benchmarking Pthreads Performance, Parallel and Distributed Processing Techniques and Applications PDPTA, 1999.

M. Snir, S. W. Otto, S. H. Lederman, D. W. Walker, and J. Dongarra, MPI: the complete reference, Schlansker, B.R. Rau EPIC: Explicitly Parallel Instruction Computing Computer IEEE, vol.32, issue.2, pp.37-45, 1996.

B. Stroustrup, The C++ Programming Language, 2000.

A. S. Tanenbaum, Architecture de l'ordinateur Chapitre 5 p.257 InterEdition, Tan92a] A.S. Tanenbaum. Modern Operating Systems, 1987.

A. S. Tanenbaum, Modern Operating Systems Prentice Hall Chapter

A. Tanenbaum, Modern Operating Systems, 2001.

A. Tanenbaum, Modern Operating Systems, 2001.

M. Tomassini, Programming with sockets Top 500 supercomputer sites http://www.top500.org Automatically tuned collective communication, Proceedings of SuperComputing2000, pp.39-48, 1991.

S. S. Vadhiyar, G. E. Fagg, and J. Dongarra, Performance modeling for self adapting collective communication for mpi, LACSI Symposium, 2001.

S. Vinoski, New features for CORBA 3.0, Communications of the ACM, vol.41, issue.10, 1998.
DOI : 10.1145/286238.286247

Z. Xu and K. Hwang, Modeling Communication Overhead: MPI and MPL Performance on the IBM SP2, IEEE Parallel & Distributed Technology, pp.9-23, 1996.