L. Osl-pour, 69 4.2.1 Principes généraux de métaprogrammation, p.82

. Minb, MinC(5) et C est voisin de B Seq1B : de MinB à la fin de B Seq2B : du début de C au match de MinB, pp.26-44

. Mina, (10) -> MinC(5) et C n'est pas voisin de A

. Mina, <-MinB(15) -> MinC(5) : B est le noeud intermédiaire Seq1A, p.12

. Mina, <-MinB(15) et B est voisin de A Seq4B : du match de MinB à la fin de A Seq3B : du début de B à MinB, pp.25-48

A. Albrecht, Measuring Application Development Productivity, IBM Application Development Symp, pp.83-92, 1979.

M. Aldinucci, M. Danelutto, and P. Dazzi, Muskel: An expandable skeleton environment, Scientific International Journal for Parallel and Distributed Computing, vol.8, pp.325-341, 2007.

M. Aldinucci, M. Danelutto, and P. Teti, An advanced environment supporting structured parallel programming in Java, Future Generation Computer Systems, vol.19, issue.5, pp.611-626, 2002.
DOI : 10.1016/S0167-739X(02)00172-3

V. Aslot, M. Domeika, R. Eigenmann, G. Gaertner, W. Jones et al., SPEComp: A New Benchmark Suite for Measuring Parallel Computer Performance
DOI : 10.1007/3-540-44587-0_1

E. Ayguadé, N. Copty, A. Duran, J. Hoeflinger, Y. Lin et al., The Design of OpenMP Tasks, IEEE Transactions on Parallel and Distributed Systems, vol.20, issue.3, pp.404-418, 2009.
DOI : 10.1109/TPDS.2008.105

B. Bacci, M. Danelutto, S. Pelagatti, and M. Vanneschi, SkIE: A heterogeneous environment for HPC applications, Parallel Computing, vol.25, issue.13-14, pp.1827-1852, 1999.
DOI : 10.1016/S0167-8191(99)00072-1

D. H. Bailey, E. Barszcz, J. T. Barton, D. S. Browning, R. L. Carter et al., The NAS parallel benchmarks---summary and preliminary results, Proceedings of the 1991 ACM/IEEE conference on Supercomputing , Supercomputing '91, pp.158-165, 1991.
DOI : 10.1145/125826.125925

K. Bá-nczyk, T. Boí, H. Krawczyk, A. Benoit, M. Cole et al., Object serialization and remote exception pattern for distributed C++/MPI application Evaluating the performance of skeleton-based high level parallel programs, PaCT The International Conference on Computational Science Part III, pp.188-193, 2004.

A. Benoit, M. Cole, S. Gilmore, and J. Hillston, Flexible Skeletal Programming with eSkel, 11th International Euro-Par Conference, pp.761-770, 2005.
DOI : 10.1007/11549468_83

URL : https://hal.archives-ouvertes.fr/hal-00807021

M. Beran, Decomposable Bulk Synchronous Parallel Computers, Proceedings of the 26th Conference on Current Trends in Theory and Practice of Informatics on Theory and Practice of Informatics, SOFSEM '99, pp.349-359, 1999.
DOI : 10.1007/3-540-47849-3_22

O. Berkman, B. Schieber, and U. Vishkin, Optimal Doubly Logarithmic Parallel Algorithms Based On Finding All Nearest Smaller Values, Journal of Algorithms, vol.14, issue.3, pp.344-370, 1993.
DOI : 10.1006/jagm.1993.1018

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

M. Berry, Public international benchmarks for parallel computers: Parkbench committee: Report-1. Sci. Program, pp.100-146, 1994.

J. Berthold, M. Dieterle, and R. Loogen, Implementing parallel google mapreduce in eden, Proceedings of the 15th International Euro-Par Conference on Parallel Processing, Euro-Par'09, pp.990-1002, 2009.

C. Bienia, S. Kumar, J. P. Singh, and K. Li, The PARSEC benchmark suite, Proceedings of the 17th international conference on Parallel architectures and compilation techniques, PACT '08, pp.72-81, 2008.
DOI : 10.1145/1454115.1454128

R. Bird, An Introduction to the Theory of Lists, Logic of Programming and Calculi of Discrete Design, pp.5-42, 1987.
DOI : 10.1007/978-3-642-87374-4_1

R. Bisseling, Parallel Scientific Computation. A structured approach using BSP and MPI, 2004.
DOI : 10.1093/acprof:oso/9780198529392.001.0001

. Whaley, ScaLAPACK Users' Guide, Society for Industrial and Applied Mathematics, 1997.

R. D. Blumofe, C. F. Joerg, B. C. Kuszmaul, C. E. Leiserson, K. H. Randall et al., Cilk: An efficient multithreaded runtime system, Proceedings of the Fifth ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP), pp.207-216, 1995.
DOI : 10.1006/jpdc.1996.0107

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

G. H. Botorog and H. Kuchen, Efficient parallel programming with algorithmic skeletons, Bougé et al. [24]. Cité, p.27
DOI : 10.1007/3-540-61626-8_95

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

G. Botorog and H. Kuchen, Skil: an imperative language with algorithmic skeletons for efficient distributed programming, Proceedings of 5th IEEE International Symposium on High Performance Distributed Computing HPDC-96, pp.243-252, 1996.
DOI : 10.1109/HPDC.1996.546194

G. Botorog and H. Kuchen, Efficient high-level parallel programming, Theoretical Computer Science, vol.196, issue.1-2, pp.71-107, 1998.
DOI : 10.1016/S0304-3975(97)00196-5

URL : http://doi.org/10.1016/s0304-3975(97)00196-5

W. Bousdira, F. Loulergue, and J. Tesson, A Verified Library of Algorithmic Skeletons on Evenly Distributed Arrays, ICA3PP 2012, pp.218-232
DOI : 10.1007/978-3-642-33078-0_16

URL : https://hal.archives-ouvertes.fr/hal-00708822

S. Breitinger, Design and Implementation of the Parallel Functional Language Eden, 1998.

S. Breitinger, R. Loogen, Y. Ortega-mallén, and R. Pena-marí, Eden ??? The paradise of functional concurrent programming, Bougé et al. [24]. Cité page 29
DOI : 10.1007/3-540-61626-8_93

D. Caromel and L. Henrio, A Theory of Distributed Objects, 2004.

D. Caromel and M. Leyton, Fine Tuning Algorithmic Skeletons, Euro-Par Parallel Processing, pp.72-81, 2007.
DOI : 10.1007/978-3-540-74466-5_9

D. Caromel and M. Leyton, A transparent non-invasive file data model for algorithmic skeletons, 2008 IEEE International Symposium on Parallel and Distributed Processing, pp.1-10, 2008.
DOI : 10.1109/IPDPS.2008.4536276

H. Chafi, A. K. Sujeeth, K. J. Brown, H. Lee, A. R. Atreya et al., A domain-specific approach to heterogeneous parallelism, Proceedings of the 16th ACM symposium on Principles and practice of parallel programming, pp.35-46, 2011.
DOI : 10.1145/2038037.1941561

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

E. Chailloux, P. Manoury, and B. Pagano, Développement d'applications avec Objective Caml, 2000.
DOI : 10.3166/tsi.24.1055-1080

P. Ciechanowicz and H. Kuchen, Enhancing Muesli's Data Parallel Skeletons for Multi-core Computer Architectures, 2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC), pp.108-113, 2010.
DOI : 10.1109/HPCC.2010.23

M. Cole, Algorithmic Skeletons, pp.2-23, 1989.
DOI : 10.1007/978-1-4471-0841-2_13

M. Cole, Parallel Programming with List Homomorphisms. Parallel Processing Letters, pp.191-203, 1995.

M. Cole, Bringing skeletons out of the closet: a pragmatic manifesto for skeletal parallel programming, Parallel Computing, vol.30, issue.3, pp.389-406, 2004.
DOI : 10.1016/j.parco.2003.12.002

S. Cook, CUDA Programming: A Developer's Guide to Parallel Computing with GPUs, pp.2012-2029

J. O. Coplien, Curiously recurring template patterns, C++ Rep, vol.7, pp.24-27, 1995.

D. Coppersmith and S. Winograd, Matrix multiplication via arithmetic progressions, Proceedings of the nineteenth annual ACM conference on Theory of computing , STOC '87, pp.251-280, 1990.
DOI : 10.1145/28395.28396

URL : http://doi.org/10.1016/s0747-7171(08)80013-2

L. Dagum and R. Menon, OpenMP: an industry standard API for shared-memory programming, IEEE Computational Science and Engineering, vol.5, issue.1, pp.46-55, 1998.
DOI : 10.1109/99.660313

P. D. Alberto and A. Nicolau, Adaptive Strassen and ATLAS's DGEMM: A Fast Square-Matrix Multiply for Modern High-Performance Systems, Proceedings of the Eighth International Conference on High-Performance Computing in Asia-Pacific Region, HPCASIA, p.45, 2005.

M. Danelutto, R. Di-cosmo, X. Leroy, and S. Pelagatti, OcamlP3L a functional parallel programming system. Liens-98-1, ENS, 1998.

M. Danelutto and M. Stigliani, SKElib: Parallel Programming with Skeletons in C, Proceedings from the 6th International Euro-Par Conference on Parallel Processing, pp.1175-1184, 2000.
DOI : 10.1007/3-540-44520-X_166

M. Danelutto and P. Teti, Lithium: A Structured Parallel Programming Environment in Java, LNCS, vol.2330, pp.844-853, 2002.

J. Darlington, Y. Guo, H. W. To, and J. Yang, Parallel skeletons for structured composition, Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming, PPOPP '95, pp.19-28, 1995.
DOI : 10.1145/209936.209940

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

R. , D. Cosmo, and M. Danelutto, A ???Minimal Disruption??? Skeleton Experiment: Seamless Map & Reduce Embedding in OCaml, ICCS, volume 9 of Procedia Computer Science, pp.1837-1846, 2012.
DOI : 10.1016/j.procs.2012.04.202

URL : https://hal.archives-ouvertes.fr/hal-00692515

J. J. Dongarra, J. Du-croz, S. Hammarling, and I. S. Duff, A set of level 3 basic linear algebra subprograms, ACM Transactions on Mathematical Software, vol.16, issue.1, pp.1-17, 1990.
DOI : 10.1145/77626.79170

N. Drosinos and N. Koziris, Performance comparison of pure MPI vs hybrid MPI-OpenMP parallelization models on SMP clusters, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings., p.15, 2004.
DOI : 10.1109/IPDPS.2004.1302919

J. Enmyren and C. Kessler, SkePU, Proceedings of the fourth international workshop on High-level parallel programming and applications, HLPP '10, 2010.
DOI : 10.1145/1863482.1863487

S. Ernsting and H. Kuchen, Algorithmic skeletons for multi-core, multi-GPU systems and clusters, International Journal of High Performance Computing and Networking, vol.7, issue.2, pp.129-138, 2012.
DOI : 10.1504/IJHPCN.2012.046370

J. Falcou, J. Sérot, T. Chateau, and J. Lapresté, Quaff: efficient C++ design for parallel skeletons, Parallel Computing, vol.32, issue.7-8, pp.604-615, 2006.
DOI : 10.1016/j.parco.2006.06.001

URL : https://hal.archives-ouvertes.fr/hal-00167412

S. Fortune and J. Wyllie, Parallelism in random access machines, Proceedings of the tenth annual ACM symposium on Theory of computing , STOC '78, pp.114-118, 1978.
DOI : 10.1145/800133.804339

E. Gamma, R. Helm, R. Johnson, and J. Vlissides, Design patterns: elements of reusable object-oriented software, 1995.

L. Gesbert, Z. Hu, F. Loulergue, K. Matsuzaki, and J. Tesson, Systematic Development of Correct Bulk Synchronous Parallel Programs, 2010 International Conference on Parallel and Distributed Computing, Applications and Technologies, pp.334-340, 2010.
DOI : 10.1109/PDCAT.2010.86

URL : https://hal.archives-ouvertes.fr/hal-00512867

J. Gonzalez-dominguez, G. Taboada, B. Fraguela, M. Martin, and J. Tourio, Servet: A benchmark suite for autotuning on multicore clusters, 2010 IEEE International Symposium on Parallel & Distributed Processing (IPDPS), pp.1-9, 2010.
DOI : 10.1109/IPDPS.2010.5470358

J. B. Goodenough, Exception handling: issues and a proposed notation, Communications of the ACM, vol.18, issue.12, pp.683-696, 1975.
DOI : 10.1145/361227.361230

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

S. Gorlatch and H. Bischof, Formal Derivation of Divide-and-Conquer Programs: A Case Study in the Multidimensional FFT's, Formal Methods for Parallel Programming: Theory and Applications, pp.80-94, 1997.

P. Haller and M. Odersky, Scala Actors: Unifying thread-based and event-based programming, Theoretical Computer Science, vol.410, issue.2-3, pp.202-220, 2009.
DOI : 10.1016/j.tcs.2008.09.019

URL : http://doi.org/10.1016/j.tcs.2008.09.019

M. H. Halstead, Elements of Software Science, 1977.

K. Hamidouche, J. Falcou, and D. Etiemble, Hybrid bulk synchronous parallelism library for clustered smp architectures, Proceedings of the fourth international workshop on High-level parallel programming and applications, HLPP '10, pp.55-62, 2010.
DOI : 10.1145/1863482.1863494

P. H. Hargrove and J. C. , Berkeley lab checkpoint/restart (BLCR) for Linux clusters, Journal of Physics: Conference Series, vol.46, issue.1, p.494, 2006.
DOI : 10.1088/1742-6596/46/1/067

X. He and C. Huang, Communication Efficient BSP Algorithm for All Nearest Smaller Values Problem, Journal of Parallel and Distributed Computing, vol.61, issue.10, pp.611425-1438, 2001.
DOI : 10.1006/jpdc.2001.1741

J. L. Henning, SPEC CPU2006 benchmark descriptions, ACM SIGARCH Computer Architecture News, vol.34, issue.4, pp.1-17, 2006.
DOI : 10.1145/1186736.1186737

C. A. Herrmann and C. Lengauer, Hdc: A higher-order language for divide-andconquer . Parallel Processing Letters, pp.239-250, 2000.

A. P. Dongarra and P. Luszczek, LINPACK Benchmark, pp.803-820, 2003.
DOI : 10.1007/978-0-387-09766-4_155

N. Javed, Meta-programmed Algorithmic Skeletons: Implementations, Performances and Semantics, 2011.

E. Johnson and D. Gannon, HPC++, Proceedings of the 11th international conference on Supercomputing , ICS '97, pp.124-131, 1997.
DOI : 10.1145/263580.263614

Y. Karasawa and H. Iwasaki, A Parallel Skeleton Library for Multi-core Clusters, 2009 International Conference on Parallel Processing, pp.84-91, 2009.
DOI : 10.1109/ICPP.2009.18

C. F. Kemerer, An empirical validation of software cost estimation models, Communications of the ACM, vol.30, issue.5, pp.416-429, 1987.
DOI : 10.1145/22899.22906

K. Kennedy, C. Koelbel, and H. Zima, The rise and fall of High Performance Fortran, Proceedings of the third ACM SIGPLAN conference on History of programming languages , HOPL III, pp.7-8, 2007.
DOI : 10.1145/1238844.1238851

H. Kuchen, A Skeleton Library, 8th International Euro-Par Conference, pp.620-629, 2002.
DOI : 10.1007/3-540-45706-2_86

C. L. Lawson, R. J. Hanson, D. R. Kincaid, and F. T. Krogh, Basic Linear Algebra Subprograms for Fortran Usage, ACM Transactions on Mathematical Software, vol.5, issue.3, pp.308-323, 1979.
DOI : 10.1145/355841.355847

J. Legaux, Z. Hu, F. Loulergue, K. Matsuzaki, and J. Tesson, Programming with BSP Homomorphisms, 2013.
DOI : 10.1007/978-3-642-40047-6_46

URL : https://hal.archives-ouvertes.fr/hal-00822992

J. Legaux, S. Jubertie, and F. Loulergue, Experiments in Parallel Matrix Multiplication on Multi-Core Systems, ICA3PP 2012, pp.362-376, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00708824

J. Legaux, F. Loulergue, and S. Jubertie, Managing Arbitrary Distributions of Arrays in Orléans Skeleton Library, International Conference on High Performance Computing and Simulation (HPCS), pp.437-444, 2013.

J. Legaux, F. Loulergue, and S. Jubertie, OSL: An Algorithmic Skeleton Library with Exceptions, International Conference on Computational Science (ICCS), volume 18 of Procedia Computer Science, pp.260-269, 2013.
DOI : 10.1016/j.procs.2013.05.189

URL : https://hal.archives-ouvertes.fr/hal-00822990

M. Leyton, Advanced features for algorithmic skeleton programming, 2008.

M. Leyton, L. Henrio, and J. M. Piquer, Exceptions for Algorithmic Skeletons, 16th International Euro-Par Conference, pp.14-25, 2010.
DOI : 10.1007/978-3-642-15291-7_3

URL : https://hal.archives-ouvertes.fr/hal-00486108

M. Leyton and J. M. Piquer, Skandium: Multi-core Programming with Algorithmic Skeletons, 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing, pp.289-296, 2010.
DOI : 10.1109/PDP.2010.26

F. Loulergue, F. Gava, and D. Billiet, Bulk Synchronous Parallel ML: Modular Implementation and Performance Prediction, LNCS, vol.3515, pp.1046-1054, 2005.
DOI : 10.1007/11428848_132

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=

C. Makassikis, V. Galtier, and S. Vialle, A Skeletal-Based Approach for the Development of Fault-Tolerant SPMD Applications, 2010 International Conference on Parallel and Distributed Computing, Applications and Technologies, pp.239-248, 2010.
DOI : 10.1109/PDCAT.2010.89

URL : https://hal.archives-ouvertes.fr/inria-00548953

C. Makassikis, S. Vialle, and X. Warin, FT-GReLoSSS: A Skeletal-Based Approach towards Application Parallelization and Low-Overhead Fault Tolerance, 2012 20th Euromicro International Conference on Parallel, Distributed and Network-based Processing, pp.237-244
DOI : 10.1109/PDP.2012.18

URL : https://hal.archives-ouvertes.fr/hal-00640558

G. Martín, M. Marinescu, D. Singh, and J. Carretero, FLEX-MPI: An MPI Extension for Supporting Dynamic Load Balancing on Heterogeneous Non-dedicated Systems, Euro-Par 2013 Parallel Processing, pp.138-149, 2013.
DOI : 10.1007/978-3-642-40047-6_16

K. Matsuzaki, H. Iwasaki, K. Emoto, and Z. Hu, A library of constructive skeletons for sequential style of parallel programming, Proceedings of the 1st international conference on Scalable information systems , InfoScale '06, pp.29-112, 2006.
DOI : 10.1145/1146847.1146860

A. Mazouz, S. Touati, and D. Barthou, Study of Variations of Native Program Execution Times on Multi-Core Architectures, 2010 International Conference on Complex, Intelligent and Software Intensive Systems, pp.919-924, 2010.
DOI : 10.1109/CISIS.2010.96

URL : https://hal.archives-ouvertes.fr/hal-00643731

T. J. Mccabe, A Complexity Measure, ICSE '76: Proceedings of the 2nd international conference on Software engineering, 1976.
DOI : 10.1109/TSE.1976.233837

W. F. Mccoll, Scalability, portability and predictability: The BSP approach to parallel programming, Future Generation Computer Systems, vol.12, issue.4, pp.265-272, 1996.
DOI : 10.1016/S0167-739X(96)00030-1

F. Mcmahon, K. Morita, A. Morihata, K. Matsuzaki, Z. Hu et al., The Livermore Fortran Kernels: A Computer Test of Numerical Performance Range Automatic Inversion Generates Divide-and-Conquer Parallel Programs, Conference on Programming Language Design and Implementation (PLDI), pp.146-155, 1986.

A. Munshi, B. Gaster, T. G. Mattson, J. Fung, and D. Ginsburg, OpenCL Programming Guide, p.17, 2011.

B. Nichols, D. Buttlar, and J. Farrell, Pthreads Programming: A POSIX Standard for Better Multiprocessing, 1996.

S. Pelagatti, A Methodology for the Development and the Support of Massively Parallel Programs, p.26, 1993.

S. Pelagatti, Structured Development of Parallel Programs, 1998.

A. Peleg and U. Weiser, MMX technology extension to the Intel architecture, IEEE Micro, vol.16, issue.4, pp.42-50, 1996.
DOI : 10.1109/40.526924

J. Reinders, Intel Threading Building Blocks: Outfitting C++ for Multi-core Processor Parallelism. O'Reilly, pp.16-29, 2007.

M. Rintala, Exceptions in remote procedure calls using C++ template metaprogramming, Software: Practice and Experience, vol.23, issue.3, pp.231-246, 2007.
DOI : 10.1002/spe.754

J. Serot and D. Ginhac, Skeletons for parallel image processing: an overview of the SKIPPER project, Parallel Computing, vol.28, issue.12, pp.1685-1708, 2002.
DOI : 10.1016/S0167-8191(02)00189-8

URL : https://hal.archives-ouvertes.fr/hal-00704336

J. P. Singh, W. Weber, and A. Gupta, SPLASH, ACM SIGARCH Computer Architecture News, vol.20, issue.1, 1992.
DOI : 10.1145/130823.130824

M. Snir and W. Gropp, MPI the Complete Reference, pp.16-100, 1998.

M. Steuwer, P. Kegel, and S. Gorlatch, SkelCL - A Portable Skeleton Library for High-Level GPU Programming, 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum, pp.1176-1182, 2011.
DOI : 10.1109/IPDPS.2011.269

V. Strassen, Gaussian elimination is not optimal, Numerische Mathematik, vol.13, issue.4, pp.354-356, 1969.
DOI : 10.1007/BF02165411

URL : http://www.digizeitschriften.de/download/PPN362160546_0013/PPN362160546_0013___log38.pdf

A. Strey and M. Bange, Performance Analysis of Intel???s MMX and SSE: A Case Study, Proceedings of the 7th International Euro-Par Conference Manchester on Parallel Processing, Euro-Par '01, pp.142-147, 2001.
DOI : 10.1007/3-540-44681-8_21

J. Tesson, Environnement pour le développement et la preuve de correction systématiques de programmes parallèles fonctionnels, pp.115-117, 2011.

J. Tesson, H. Hashimoto, Z. Hu, F. Loulergue, and M. Takeichi, Program Calculation in Coq, Thirteenth International Conference on Algebraic Methodology And Software Technology (AMAST2010), pp.163-179, 2010.
DOI : 10.1145/1596614.1596616

URL : https://hal.archives-ouvertes.fr/inria-00448751

J. Tesson and F. Loulergue, A Verified Bulk Synchronous Parallel ML Heat Diffusion Simulation, International Conference on Computational Science (ICCS), Procedia Computer Science, pp.36-45, 2011.
DOI : 10.1016/j.procs.2011.04.005

URL : https://hal.archives-ouvertes.fr/hal-00588894

A. Tiskin, The Design and Analysis of Bulk-Synchronous Parallel Algorithms, pp.92-93, 1998.

S. Touati, J. Worms, and S. Briais, The speedup-test: a statistical methodology for programme speedup analysis and computation. Concurrency and Computation: Practice and Experience, pp.1410-1426

L. G. Valiant, A bridging model for parallel computation, Communications of the ACM, vol.33, issue.8, pp.20-60, 1990.
DOI : 10.1145/79173.79181

M. Vanneschi, The programming model of ASSIST, an environment for parallel and distributed portable applications, Parallel Computing, vol.28, issue.12, pp.1709-1732, 2002.
DOI : 10.1016/S0167-8191(02)00188-6

T. Veldhuizen, Techniques for Scientific C++ Computer science technical report 542, pp.60-76, 2000.

T. L. Veldhuizen, C++ templates are turing complete, 2003.

T. L. Veldhuizen and D. Gannon, Active libraries: Rethinking the roles of compilers and libraries, Proceedings of the SIAM Workshop on Object Oriented Methods for Inter-operable Scientific and Engineering Computing (OO'98, 1998.

A. Wollrath, R. Riggs, J. Waldo, S. C. Woo, M. Ohara et al., A distributed object model for the javatm system The splash-2 programs: characterization and methodological considerations, Proceedings of the 2nd conference on USENIX Conference on Object- Oriented Technologies (COOTS) -Volume, pp.17-1724, 1995.

G. Yaikhom, M. Cole, S. Gilmore, and J. Hillston, A Structural Approach for Modelling Performance of Systems Using Skeletons, Electronic Notes in Theoretical Computer Science, vol.190, issue.3, pp.167-183, 2007.
DOI : 10.1016/j.entcs.2007.07.010

K. Yotov, K. Pingali, and P. Stodghill, X-ray: a tool for automatic measurement of hardware parameters, Second International Conference on the Quantitative Evaluation of Systems (QEST'05), p.168, 2005.
DOI : 10.1109/QEST.2005.44