S. Agostinelli, Geant4???a simulation toolkit, Nuclear Instruments and Methods in Physics Research Section A: Accelerators, Spectrometers, Detectors and Associated Equipment, vol.506, issue.3, 2003.
DOI : 10.1016/S0168-9002(03)01368-8

URL : https://hal.archives-ouvertes.fr/in2p3-00020246

M. Aldinucci, FastFlow : High-Level and Efficient Streaming on Multi-core " . In : Programming Multicore and Many-core Computing Systems, 2014.

M. Aldinucci, Stkm on Sca: A Unified Framework with Components, Workflows and Algorithmic Skeletons, Proceedings of the International Conference on Parallel and Distributed Computing, 2009.
DOI : 10.1007/978-0-387-09455-7_1

B. A. Allan, A Component Architecture for High-Performance Scientific Computing, International Journal of High Performance Computing Applications, 2006.

J. Allard, FlowVR: A Middleware for Large Scale Virtual Reality Applications, Proceedings of the International Conference on Parallel and Distributed Computing. Euro-Par, 2004.
DOI : 10.1007/978-3-540-27866-5_65

URL : https://hal.archives-ouvertes.fr/hal-00085302

F. Stephen, Basic Local Alignment Search Tool, Journal of Molecular Biology, 1990.

G. M. Amdahl, Validity of the single processor approach to achieving large scale computing capabilities, Proceedings of the April 18-20, 1967, spring joint computer conference on, AFIPS '67 (Spring), 1967.
DOI : 10.1145/1465482.1465560

G. Antoniu, Enabling Transparent Data Sharing in Component Models, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06), 2006.
DOI : 10.1109/CCGRID.2006.40

URL : https://hal.archives-ouvertes.fr/inria-00070227

C. Augonnet, StarPU : A Unified Platform for Task Scheduling on Heterogeneous Multicore Architectures Concurrency and Computation : Practice and Experience, Special Issue : Euro-Par, 2009.

O. Aumage, Combining Both a Component Model and a Taskbased Model for HPC Applications : A Feasibility Study on Gysela, Proceedings of the IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing. CCGrid'17, 2017.
URL : https://hal.archives-ouvertes.fr/hal-01518730

H. David and . Bailey, The NAS Parallel Benchmarks, The International Journal of High Performance Computing Applications, 1991.

A. Basu, M. Bozga, and J. Sifakis, Modeling Heterogeneous Real-time Components in BIP, Fourth IEEE International Conference on Software Engineering and Formal Methods (SEFM'06), 2006.
DOI : 10.1109/SEFM.2006.27

URL : https://hal.archives-ouvertes.fr/hal-00375298

F. Baude, GCM: a grid extension to Fractal for autonomous distributed components, annals of telecommunications - annales des t??l??communications, vol.36, issue.1, 2009.
DOI : 10.1145/383845.383872

URL : https://hal.archives-ouvertes.fr/inria-00323919

F. Baude, Interactive and descriptor-based deployment of object-oriented grid applications, Proceedings 11th IEEE International Symposium on High Performance Distributed Computing, 2002.
DOI : 10.1109/HPDC.2002.1029907

M. Bauer, Legion: Expressing locality and independence with logical regions, 2012 International Conference for High Performance Computing, Networking, Storage and Analysis, 2012.
DOI : 10.1109/SC.2012.71

S. Benkner, The PEPPHER Approach to Programmability and Performance Portability for Heterogeneous many-core Architectures, Proceedings of the International Conference on Parallel Computing. Par- Co'2011. IOS press, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00661320

J. Bézivin, In Search of a Basic Principle for Model Driven Engineering, In : Novatica Journal (Special Issue), 2004.

J. Bigot, Du support générique d'opérateurs de composition dans les modèles de composants logiciels, application au calcul scientifique, Thèse de doct

J. Bigot and C. Pérez, High Performance Composition Operators in Component Models, High Performance Computing : From Grids and Clouds to Exascale. HPC'2011, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00692584

J. Bigot, A low level component model easing performance portability of HPC applications, Computing, vol.4, issue.5, p.Computing, 2014.
DOI : 10.1109/TSE.1976.233837

URL : https://hal.archives-ouvertes.fr/hal-00911231

J. Bigot, Building and Auto-Tuning Computing Kernels : Experimenting with BOAST and StarPU in the GYSELA code, ESAIM : Proceedings and Surveys, 2017.

J. Bigot, COMET : An High-Performance Model for Fine-Grain Composition, Inria research report. LIP laboratory

J. Bigot, Scaling GYSELA code beyond 32K-cores on Blue Gene, 2013.
DOI : 10.1051/proc/201343008

URL : https://hal.archives-ouvertes.fr/hal-01050322

R. D. Blumofe, Cilk : An Efficient Multithreaded Runtime System, Journal of parallel and distributed computing, 1996.
DOI : 10.1006/jpdc.1996.0107

B. Boehm, A view of 20th and 21st century software engineering, Proceeding of the 28th international conference on Software engineering , ICSE '06, 2006.
DOI : 10.1145/1134285.1134288

G. Bosilca, Scalable Dense Linear Algebra on Heterogeneous Hardware, 2013.

C. Hinde-lilia-bouziane, T. Pérez, and . Priol, A Software Component Model with Spatial and Temporal Compositions for Grid Infrastructures, Proceedings of the International Conference on Parallel and Distributed Computing. Euro-Par, 2008.

C. Hinde-lilia-bouziane, T. Pérez, and . Priol, Extending Software Component Models with the Master-worker Paradigm, In : Parallel Computing, 2010.

F. Broquedis, T. Gautier, and V. Danjean, libKOMP, an Efficient OpenMP Runtime System for Both Fork-Join and Data Flow Paradigms, Proceedings of the International Conference on OpenMP. IWOMP'2012, 2012.
DOI : 10.1007/978-3-642-30961-8_8

URL : https://hal.archives-ouvertes.fr/hal-00796253

E. Bruneton, The FRACTAL component model and its support in Java, Software: Practice and Experience, vol.6, issue.11-12, 2006.
DOI : 10.1002/spe.767

J. Bueno, Productive Programming of GPU Clusters with OmpSs, 2012 IEEE 26th International Parallel and Distributed Processing Symposium, 2012.
DOI : 10.1109/IPDPS.2012.58

L. Bradford, D. Chamberlain, . Callahan, P. Hans, and . Zima, Parallel Programmability and the Chapel Language, The International Journal of High Performance Computing Applications, 2007.

P. Charles, X10 : An Object-oriented Approach to Non-uniform Cluster Computing, SIGPLAN Notices. OOPSLA'2005. ACM, 2005.

P. Ciechanowicz, M. Poldner, and H. Kuchen, The Münster Skeleton Library Muesli : A comprehensive overview, 2009.

E. Melvin and . Conway, A Multiprocessor System Design, Proceedings of the Fall Joint Computer Conference. AFIPS'1963 (Fall). ACM, 1963.

H. Coullon, C. Pérez, and J. Richard, Feasibility Study of a Runtime Component-based Model Integrating Task Graph Concept on a 1D Advection Case Study. Deliverable D3.2.1 of the ELCI PIA project. LIP laboratory
URL : https://hal.archives-ouvertes.fr/hal-01348204

O. Dahl, SIMULA 67 Common Base Language. Sous la dir. de Norwegian Computing Center, pp.7-9, 1968.

Z. Devito, Liszt, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '11, 2011.
DOI : 10.1145/2063384.2063396

J. Dongarra, A. Michael, P. Heroux, and . Luszczek, High-performance conjugate-gradient benchmark: A new metric for ranking high-performance computing systems, The International Journal of High Performance Computing Applications, vol.2, issue.4, 2016.
DOI : 10.1088/1742-6596/16/1/071

M. Dreher and T. Peterka, Bredala: Semantic Data Redistribution for In Situ Applications, 2016 IEEE International Conference on Cluster Computing (CLUSTER), 2016.
DOI : 10.1109/CLUSTER.2016.30

M. Dreher and T. Peterka, Decaf : Decoupled Dataflows for In Situ High-Performance Workflows. Rapp. tech. Argonne National Lab, Argonne , IL (United States), 2017.

F. D. Desmond, C. Souza-et-alan, and . Wills, Objects, Components, and Frameworks with UML : The Catalysis Approach, pp.0-201, 1999.

A. Duran, J. Corbalán, and E. Ayguadé, An adaptive cut-off for task parallelism, 2008 SC, International Conference for High Performance Computing, Networking, Storage and Analysis, 2008.
DOI : 10.1109/SC.2008.5213927

A. Ernstsson, L. Li, and C. Kessler, SkePU??2: Flexible and Type-Safe Skeleton Programming for Heterogeneous Parallel Systems, International Journal of Parallel Programming, vol.42, issue.4, 2017.
DOI : 10.1145/2854038.2854041

J. Evans, A scalable concurrent malloc implementation for FreeBSD, Proceedings of the BSDCAN conference, 2006.

J. Favre, Towards a Basic Theory to Model Model Driven Engineering, Proceeding of the Workshop in Software Model Engineering. WiSME, 2004.

M. Frigo, C. E. Leiserson, and K. H. Randall, The Implementation of the Cilk-5 Multithreaded Language, In : SIGPLAN Notices, 1998.

F. Galilée, Athapascan-1: On-line building data flow graph in a parallel language, Proceedings. 1998 International Conference on Parallel Architectures and Compilation Techniques (Cat. No.98EX192), 1998.
DOI : 10.1109/PACT.1998.727176

T. Gautier, XKaapi: A Runtime System for Data-Flow Task Programming on Heterogeneous Architectures, 2013 IEEE 27th International Symposium on Parallel and Distributed Processing, 2013.
DOI : 10.1109/IPDPS.2013.66

URL : https://hal.archives-ouvertes.fr/hal-00799904

T. El-ghazawi and L. Smith, UPC : Unified Parallel C, Proceedings of the Conference on Supercomputing. SC'2006, 2006.
DOI : 10.1002/0471478369

W. Gibbs, Software's Chronic Crisis, Scientific American, vol.271, issue.3, 1994.
DOI : 10.1038/scientificamerican0994-86

. Gnu-openmp-runtime, GOMP) Implementation. GNU Project. url : https: //gcc.gnu.org/projects

H. González-vélez and M. Leyton, A survey of algorithmic skeleton frameworks: high-level structured parallel programming enablers, Software : Practice and Experience, 2010.
DOI : 10.1007/978-3-540-39425-9_37

V. Grandgirard, High-Q club : Highest Scaling Codes on JUQUEEN ? GYSELA : GYrokinetic SEmi-LAgrangian code for plasma turbulence simulations, 2015.

V. Grandgirard, global semi-Lagrangian code for flux-driven ion turbulence simulations, Computer Physics Communications, vol.207, 2016.
DOI : 10.1016/j.cpc.2016.05.007

URL : https://hal.archives-ouvertes.fr/cea-01153011

J. L. Gustafson, Reevaluating Amdahl's law, Communications of the ACM, vol.31, issue.5, 1988.
DOI : 10.1145/42411.42415

URL : http://polaris.cs.uiuc.edu/~padua/cs420/2005/p532-gustafson.pdf

T. George, . Heineman, T. William, and . Councill, Component-based Software Engineering : Putting the Pieces Together, pp.0-201, 2001.

M. Herlihy, J. B. Eliot, and . Moss, Transactional Memory : Architectural Support for Lock-free Data Structures, Proceedings of the 20th Annual International Symposium on Computer Architecture. ISCA'93, 1993.

B. Hess, GROMACS 4:?? Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation, Journal of Chemical Theory and Computation, vol.4, issue.3, 2008.
DOI : 10.1021/ct700301q

R. M. Hodur, The Naval Research Laboratory???s Coupled Ocean/Atmosphere Mesoscale Prediction System (COAMPS), Monthly Weather Review, vol.125, issue.7, 1997.
DOI : 10.1175/1520-0493(1997)125<1414:TNRLSC>2.0.CO;2

A. Hugo, Composability of Parallel Codes on Heterogeneous Architectures, Thèse de doct
URL : https://hal.archives-ouvertes.fr/tel-01162975

B. Jacob, S. Ng, and D. Wang, Memory Systems : Cache, DRAM, Disk, p.9780080553849, 2007.

J. Jézéquel, B. Combemale, and D. Vojtisek, Ingénierie Dirigée par les Modèles : des concepts à la pratique. Ellipses, p.13359162, 2012.

G. Kahn, The Semantics of a Simple Language for Parallel Programming, Information Processing, 1974.

V. Laxmikant, S. Kale, and . Krishnan, CHARM++ : A Portable Concurrent Object Oriented System Based on C++, In : SIGPLAN Notice, 1993.

K. Kennedy, C. Koelbel, and H. Zima, The rise and fall of High Performance Fortran, Proceedings of the third ACM SIGPLAN conference on History of programming languages , HOPL III, 2007.
DOI : 10.1145/1238844.1238851

V. Kindratenko and P. Trancoso, Trends in High-Performance Computing, Computing in Science & Engineering, vol.13, issue.3, 2011.
DOI : 10.1109/MCSE.2011.52

H. Charles, M. E. Koelbel, and . Zosel, The High Performance FORTRAN Handbook, p.262111853, 1993.

J. Kurzak and J. Dongarra, Fully Dynamic Scheduler for Numerical Scheduling on Multicore Processors, Rapp. tech. Report LAWN (LAPACK Working Note), vol.220, 2009.

C. Lameter, NUMA (Non-Uniform Memory Access): An Overview, Queue, vol.11, issue.7, 2013.
DOI : 10.1145/2508834.2513149

V. Lanore, C. Perez, and J. Richard, Evaluating Component Assembly Specialization for 3D FFT. PRACE-2IP White Paper. LIP laboratory

V. Lanore, C. Perez, and J. Richard, Towards Application Variability Handling with Component Models: 3D-FFT Use Case Study, Euro-Par 2015 : Parallel Processing Workshops. UCHPC'2015, 2015.
DOI : 10.1007/978-3-319-27308-2_61

URL : https://hal.archives-ouvertes.fr/hal-01192732

K. Lau, A component model that is both control-driven and data-driven, Proceedings of the 14th international ACM Sigsoft symposium on Component based software engineering, CBSE '11, 2011.
DOI : 10.1145/2000229.2000236

J. Lee and M. Sato, Implementation and Performance Evaluation of XcalableMP: A Parallel Programming Language for Distributed Memory Systems, 2010 39th International Conference on Parallel Processing Workshops, 2010.
DOI : 10.1109/ICPPW.2010.62

J. Lesage and B. Raffin, A hierarchical component model for large parallel interactive applications, The Journal of Supercomputing, vol.14, issue.4, 2012.
DOI : 10.1109/38.291528

URL : https://hal.archives-ouvertes.fr/hal-00796249

H. Barbara, J. M. Liskov, and . Wing, A Behavioral Notion of Subtyping, In : ACM Transactions on Programming Languages and Systems, 1994.

R. Piotr and . Luszczek, The HPC Challenge (HPCC) Benchmark Suite, Proceedings of the Conference on Supercomputing. SC'2006, 2006.

H. Markram, Reconstruction and Simulation of Neocortical Microcircuitry, Cell, vol.163, issue.2, 2015.
DOI : 10.1016/j.cell.2015.09.029

M. D. Mcilroy, Mass-produced Software Components, Proc. NATO Conf. on Software Engineering, Garmisch, 1968.

A. Sally, R. W. Mckee, and . Wisniewski, Memory Wall In : Encyclopedia of Parallel Computing. Sous la dir. de David Padua, pp.1110-1116, 2011.

J. Montagnat, A data-driven workflow language for grids based on array programming principles, Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science, WORKS '09, 2009.
DOI : 10.1145/1645164.1645171

URL : https://hal.archives-ouvertes.fr/hal-00677806

E. Gordon and . Moore, Cramming More Components Onto Integrated Circuits, p.Electronics, 1965.

E. Gordon and . Moore, Progress in Digital Integrated Electronics, Electron Devices Meeting. IEDM'1975, 1975.

W. Robert, J. Numrich, and . Reid, Co-array Fortran for Parallel Programming, In : SIGPLAN Fortran Forum, 1998.

S. Olivier, OpenMP task scheduling strategies for multicore NUMA systems, The International Journal of High Performance Computing Applications, vol.44, issue.2, 2012.
DOI : 10.1145/1321211.1321241

R. Van-ommering, The Koala Component Model for Consumer Electronics Software, In : Computer, 2000.

R. Openmp-architecture and . Board, OpenMP Application Programming Interface Version 4.5, 2015.

A. David and . Patterson, Latency Lags Bandwith, Communications of the ACM, 2004.

C. Pérez, T. Priol, and A. Ribes, A Parallel CORBA Component Model for Numerical Code Coupling, International Journal of High Performance Computing Applications, 2003.

H. William and . Press, The Art of Scientific Computing. 3th, p.521880688, 2007.

J. Reinders, Intel Threading Building Blocks : Outfitting C++ for Multicore Processor Parallelism. O'Reilly Media, p.596514808, 2007.

J. Richard, Vers un modèle de composants supportant l'ordonnancement de tâches pour le calcul de haute performance. 2015. url : https

C. Martin and . Rinard, The Design, Implementation and Evaluation of Jade : A Portable, Implicitly Parallel Programming Language, Thèse de doct. Stanford University (USA), 1994.

J. Rumbaugh, Object-Oriented Modeling and Design. Prentice-hall Englewood Cliffs, p.136298419, 1991.

J. Sanders and E. Kandrot, CUDA by Example : An Introduction to General-Purpose GPU Programming, Portable Documents, p.131387685, 2010.

M. Shaw, R. Deline, and G. Zelesnik, Abstractions and implementations for architectural connections, Proceedings of International Conference on Configurable Distributed Systems, 1996.
DOI : 10.1109/CDS.1996.509340

E. Slaughter, Regent, Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis on, SC '15, 2015.
DOI : 10.1002/(SICI)1096-9128(199809/11)10:11/13<825::AID-CPE383>3.0.CO;2-H

URL : http://dl.acm.org/ft_gateway.cfm?id=2807629&type=pdf

J. , R. Sriram-krishnamoorthy, and P. Sadayappan, Domain-Specific Languages and High-Level Frameworks for High-Performance Computing, Journal of Parallel and Distributed Computing, vol.7412, 2014.

E. John, D. Stone, G. Gohara, and . Shi, OpenCL : A Parallel Programming Standard for Heterogeneous Computing Systems, In : Computing in Science & Engineering, 2010.

G. Strang, On the Construction and Comparison of Difference Schemes, SIAM Journal on Numerical Analysis, vol.5, issue.3, 1968.
DOI : 10.1137/0705041

C. Szyperski, Component Software : Beyond Object-Oriented Programming, p.201745720, 2002.

H. Tanno and H. Iwasaki, Parallel Skeletons for Variable-Length Lists in SkeTo Skeleton Library, Proceedings of the International Conference on Parallel and Distributed Computing, 2009.
DOI : 10.1145/1327171.1327180

R. Teyssier, Cosmological hydrodynamics with adaptive mesh refinement, Astronomy & Astrophysics, vol.535, issue.1, 2002.
DOI : 10.1086/312707

URL : https://www.aanda.org/articles/aa/pdf/2002/13/aa1593.pdf

P. Virouleau, Description, Implementation and Evaluation of an Affinity Clause for Task Directives, Proceedings of the International Conference on OpenMP : Memory, Devices, and Tasks, IWOMP 2012. IWOMP'2016, 2016.
DOI : 10.1007/978-3-319-11454-5_2

URL : https://hal.archives-ouvertes.fr/hal-01343442

A. Vlasov, On the Kinetic Theory of an Assembly of Particles with Collective Interaction, Journal of Physics USSR, 1945.

N. Wang, C. Douglas, D. Schmidt, and . Levine, Optimizing the CORBA component model for high-performance and real-time applications " . In : Work-in-Progress' session at the Middleware, 2000.

S. Wienke, OpenACC ??? First Experiences with Real-World Applications, Proceedings of the International Conference on Parallel and Distributed Computing. Euro-Par'2012, 2012.
DOI : 10.1007/978-3-642-32820-6_85

N. Wirth, Modula : A Language for Modular Multiprogramming, Software : Practice and Experience, 1977.

W. Wu, Hierarchical DAG Scheduling for Hybrid Distributed Systems, 2015 IEEE International Parallel and Distributed Processing Symposium, 2015.
DOI : 10.1109/IPDPS.2015.56

URL : https://hal.archives-ouvertes.fr/hal-01078359

E. Yourdon and L. L. Constantine, Structured Design : Fundamentals of a Discipline of Computer Program and Systems Design, p.138544719, 1979.