ils arrivent tous en fin de classement, même si seul le cpp de calcul est pris en compte. Le plus efficace étant paradoxalement celui qui ,
efficacité énergétique de ces machines pour des images plus petites : 300×300 (tableau 5.8) Dans ce cas, toutes les machines sont surdimensionnées : le bi-Yorkfield affichant une cadence de traitement de 26 315 images/sec et le "petit" Penryn U9300 une cadence de 2 777 images/sec. L'ordre de performance est maintenant respecté : le U9300 ,
les performances des machines étaient évaluées pour des tailles fixes d'images. Il peut être intéressant de prendre le problème à l'envers et de s'interroger sur l'intervalle de taille d'image pour lequel ces processeurs sont performants ,
Muskel : an expandable skeleton environment, Scalable Computing : Practice and Experience, pp.325-341, 2007. ,
An advanced environment supporting structured parallel programming in Java, Future Generation Computer Systems, vol.19, issue.5, pp.611-626, 2003. ,
DOI : 10.1016/S0167-739X(02)00172-3
Validity of the single processor approach to achieving large scale computing capabilities, spring joint computer conference, AFIPS '67 (Spring), Proceedings of the, pp.483-485, 1967. ,
SkIE: A heterogeneous environment for HPC applications, Parallel Computing, vol.25, issue.13-14, pp.13-141827, 1999. ,
DOI : 10.1016/S0167-8191(99)00072-1
Summarising an experiment in parallel programming language design, Proceedings of the International Conference and Exhibition on High-Performance Computing and Networking, HPCN Europe '95, pp.7-13, 1995. ,
DOI : 10.1007/BFb0046602
Integration of GRID Superscalar and GridWay Metascheduler with the DRMAA OGF Standard, Euro-Par '08 : Proceedings of the 14th international Euro-Par conference on Parallel Processing, pp.445-455, 2008. ,
DOI : 10.1007/978-3-540-85451-7_49
The IBM RISC System/6000 processor: Hardware overview, IBM Journal of Research and Development, vol.34, issue.1, pp.12-22, 1990. ,
DOI : 10.1147/rd.341.0012
CellSs: a Programming Model for the Cell BE Architecture, ACM/IEEE SC 2006 Conference (SC'06), 2006. ,
DOI : 10.1109/SC.2006.17
Linux on Cell Broadband Engine status update, Proceedings of the Linux Symposium, pp.21-27, 2007. ,
Instruction scheduling in the TOBEY compiler, IBM Journal of Research and Development, vol.38, issue.5, pp.577-593, 1994. ,
DOI : 10.1147/rd.385.0577
Heterogeneous Multicore Parallel Programming for Graphics Processing Units, Scientific Programming, pp.325-336, 2009. ,
DOI : 10.1155/2009/784893
Skil : An imperative language with algorithmic skeletons for efficient distributed programming, Proceedings of the 5th IEEE International Symposium on High Performance Distributed Computing, HPDC '96, p.243, 1996. ,
Brook for gpus : stream computing on graphics hardware ,
Portable programming with the PARMACS message-passing library, Parallel Computing, vol.20, issue.4, pp.615-632, 1994. ,
DOI : 10.1016/0167-8191(94)90031-0
Fine Tuning Algorithmic Skeletons, Euro-Par, pp.72-81, 2007. ,
DOI : 10.1007/978-3-540-74466-5_9
The Münster Skeleton Library Muesli -A Comprehensive Overview, 2009. ,
Domain decomposition and skeleton programming with OCamlP3l, Parallel Computing, vol.32, issue.7-8, pp.539-550, 2006. ,
DOI : 10.1016/j.parco.2006.04.003
Bringing skeletons out of the closet: a pragmatic manifesto for skeletal parallel programming, Parallel Computing, vol.30, issue.3, pp.389-406, 2004. ,
DOI : 10.1016/j.parco.2003.12.002
Algorithmic Skeletons : Structured Management of Parallel Computation, 1989. ,
Parallélisation d'opéateurs de TI : multi-coeurs, Cell ou GPU ?, Actes de la Conférence du GRETSI, 2009. ,
Performance Analysis of Cell Broadband Engine for High Memory Bandwidth Applications, Proceedings of the IEEE International Symposium on Performance Analysis of Systems & Software, pp.210-219, 2007. ,
OpenMP: an industry standard API for shared-memory programming, IEEE Computational Science and Engineering, vol.5, issue.1, pp.46-55, 1998. ,
DOI : 10.1109/99.660313
SKElib: Parallel Programming with Skeletons in C, Proceedings from the 6th International Euro-Par Conference on Parallel Processing, Euro-Par '00, pp.1175-1184, 2000. ,
DOI : 10.1007/3-540-44520-X_166
Parallel programming using skeleton functions, Proceedings of the 5th International PARLE Conference on Parallel Architectures and Languages Europe, PARLE '93, pp.146-160, 1993. ,
DOI : 10.1007/3-540-56891-3_12
Abstract machine models for highly parallel computers, 1995. ,
AltiVec extension to PowerPC accelerates media processing, IEEE Micro, vol.20, issue.2, pp.85-95, 2000. ,
DOI : 10.1109/40.848475
Hmpp : A hybrid multi-core parallel, First Workshop on General Purpose Processing on Graphics Processing Units, pp.1-5, 2007. ,
A set of level 3 basic linear algebra subprograms, ACM Transactions on Mathematical Software, vol.16, issue.1, pp.1-17, 1990. ,
DOI : 10.1145/77626.79170
llc : a Parallel Skeletal Language . Parallel Processing Letters, pp.437-448, 2003. ,
Using advanced compiler technology to exploit the performance of the Cell Broadband Engine architecture, IBM Syst. J, vol.45, issue.1, pp.59-84, 2006. ,
Optimizing Compiler for the CELL Processor, 14th International Conference on Parallel Architectures and Compilation Techniques (PACT'05), pp.161-172, 2005. ,
DOI : 10.1109/PACT.2005.33
Quaff: efficient C++ design for parallel skeletons, Parallel Computing, vol.32, issue.7-8, pp.604-615, 2006. ,
DOI : 10.1016/j.parco.2006.06.001
URL : https://hal.archives-ouvertes.fr/hal-00167412
Un cluster pour la Vison Temps Réel Architecture, Outils et Applications, 2006. ,
High Level Parallel Programming EDSL -A BOOST Libraries Use Case, BOOST'CON 09, 2009. ,
Formal semantics applied to the implementation of a skeleton-based parallel programming library, PARCO, pp.243-252, 2007. ,
EVE, an Object Oriented SIMD Library, International Conference on Computational Science, pp.314-321, 2004. ,
DOI : 10.1007/978-3-540-24688-6_43
URL : https://hal.archives-ouvertes.fr/hal-00103176
Sequoia: Programming the Memory Hierarchy, ACM/IEEE SC 2006 Conference (SC'06), 2006. ,
DOI : 10.1109/SC.2006.55
JaSkel: a Java skeleton-based framework for structured cluster and grid computing, Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID'06), pp.301-304, 2006. ,
DOI : 10.1109/CCGRID.2006.65
Some computer organizations and their effectiveness, IEEE Transactions on Computers, vol.21, issue.9, pp.948-960, 1972. ,
Parallel architectures, ACM Comput. Surv, vol.28, issue.1, pp.67-70, 1996. ,
A survey of algorithmic skeleton frameworks: high-level structured parallel programming enablers, Software: Practice and Experience, vol.21, issue.6, pp.1135-1160, 2010. ,
DOI : 10.1002/spe.1026
Shared memory multiprocessor support for functional array processing in SAC, Journal of Functional Programming, vol.15, issue.3, pp.353-401, 2005. ,
DOI : 10.1017/S0956796805005538
An Open Source Environment for Cell Broadband Engine System Software, Computer, vol.40, issue.6, pp.37-47, 2007. ,
DOI : 10.1109/MC.2007.192
Reevaluating Amdahl's law, Communications of the ACM, vol.31, issue.5, pp.532-533, 1988. ,
DOI : 10.1145/42411.42415
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.509.6892
A Combined Corner and Edge Detector, Procedings of the Alvey Vision Conference 1988, pp.147-151, 1988. ,
DOI : 10.5244/C.2.23
Hdc : A higher-order language for divide-and-conquer. Parallel Processing Letters, pp.239-250, 2000. ,
Power Efficient Processor Architecture and The Cell Processor, 11th International Symposium on High-Performance Computer Architecture, pp.258-262, 2005. ,
DOI : 10.1109/HPCA.2005.26
Parallel elementwise processable functions in concurrent clean, Mathematical and Computer Modelling, vol.38, issue.7-9, pp.865-875, 2003. ,
DOI : 10.1016/S0895-7177(03)90071-9
1003.1c-1995 thread extensions, 1995. ,
Programmation par squelettes algorithmiques pour le processeur cell, SYMPA '08 : SYMPosium en Architectures nouvelles de machines, 2008. ,
Introduction to the Cell Broadband Engine Architecture, IBM Journal of Research and Development, vol.51, issue.5, pp.503-519, 2007. ,
DOI : 10.1147/rd.515.0503
Introduction to the Cell multiprocessor, IBM Journal of Research and Development, vol.49, issue.4.5, pp.589-604, 2005. ,
DOI : 10.1147/rd.494.0589
The Imagine Stream Processor, Proceedings. IEEE International Conference on Computer Design: VLSI in Computers and Processors, pp.282-288, 2002. ,
DOI : 10.1109/ICCD.2002.1106783
Programmable stream processors, Computer, vol.36, issue.8, pp.54-62, 2003. ,
Measuring parallel processor performance, Communications of the ACM, vol.33, issue.5, pp.539-543, 1990. ,
DOI : 10.1145/78607.78614
Optimizing compilers for modern architectures : a dependence-based approach, 2002. ,
The rise and fall of High Performance Fortran, Proceedings of the third ACM SIGPLAN conference on History of programming languages , HOPL III, pp.7-8, 2007. ,
DOI : 10.1145/1238844.1238851
The OpenCL Specification, version 1.0, 2008. ,
Cell Multiprocessor Communication Network: Built for Speed, IEEE Micro, vol.26, issue.3, pp.10-23, 2006. ,
DOI : 10.1109/MM.2006.49
A Skeleton Library, Euro-Par, pp.620-629, 2002. ,
DOI : 10.1007/3-540-45706-2_86
A Buffered-Mode MPI Implementation for the Cell BETM Processor, Computational Science ? U ICCS 2007, pp.603-610, 2007. ,
DOI : 10.1007/978-3-540-72584-8_80
Systolic Arrays for (VLSI). CMU-CS, Dept. of Computer Science, 1978. ,
Openmp to gpgpu : a compiler framework for automatic translation and optimization, Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming, PPoPP '09, pp.101-110, 2009. ,
Skandium: Multi-core Programming with Algorithmic Skeletons, 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing, pp.289-296, 2010. ,
DOI : 10.1109/PDP.2010.26
Parallel functional programming in Eden, Journal of Functional Programming, vol.15, issue.3, pp.431-475, 2005. ,
DOI : 10.1017/S0956796805005526
A library of constructive skeletons for sequential style of parallel programming, Proceedings of the 1st international conference on Scalable information systems , InfoScale '06, 2006. ,
DOI : 10.1145/1146847.1146860
Data-Parallel Programming on the Cell BE and the GPU using the RapidMind Development Platform, Proceeding of GSPx Multicore Applications Conference, 2006. ,
When and how to develop domain-specific languages, ACM Computing Surveys, vol.37, issue.4, pp.316-344, 2005. ,
DOI : 10.1145/1118890.1118892
Nested algorithmic skeletons from higher order functions. Parallel Algorithms and Applications, pp.181-206, 2001. ,
Cramming More Components Onto Integrated Circuits, Proceedings of the IEEE, vol.86, issue.1, pp.114-117, 1965. ,
DOI : 10.1109/JPROC.1998.658762
Obstacle avoidance and navigation in the real world by a seeing robot rover, Robotics Institute, 1980. ,
Intel's Array Building Blocks: A retargetable, dynamic compiler and embedded language, International Symposium on Code Generation and Optimization (CGO 2011), pp.224-235, 2011. ,
DOI : 10.1109/CGO.2011.5764690
Scalable parallel programming with cuda. Queue, pp.40-53, 2008. ,
Supporting openmp on cell, International Journal of Parallel Programming, vol.36, issue.10, pp.289-311, 1007. ,
MPI microtask for programming the Cell Broadband Engine??? processor, IBM Systems Journal, vol.45, issue.1, pp.85-102, 2006. ,
DOI : 10.1147/sj.451.0085
A Survey of General-Purpose Computation on Graphics Hardware, Computer Graphics Forum, vol.7, issue.4, pp.80-113, 2007. ,
DOI : 10.1016/j.rti.2005.04.002
Parallelisation d'opérateurs de ti : multi-coeurs, cell ou gpu ? TS, pp.161-187, 2010. ,
Computational Discrete Mathematics : Combinatorics and Graph Theory with Mathematica, pp.336-337, 2003. ,
DOI : 10.1017/CBO9781139164849
Algorithmic Skeletons within an Embedded Domain Specific Language for the CELL Processor, 2009 18th International Conference on Parallel Architectures and Compilation Techniques, 2009. ,
DOI : 10.1109/PACT.2009.21
URL : https://hal.archives-ouvertes.fr/hal-00905054
Parallelization Strategies for the Points of Interests Algorithm on the Cell Processor, ISPA '07 : Proceedings of the 5th International Symposium on Parallel and Distributed Processing and Applications, 2007. ,
DOI : 10.1007/978-3-540-74742-0_12
Parallelization Schemes for Memory Optimization on the Cell Processor: A Case Study on the Harris Corner Detector, Transactions on High-Performance Embedded Architectures and Compilers III, pp.177-200, 2011. ,
DOI : 10.1007/s10766-007-0034-5
URL : https://hal.archives-ouvertes.fr/hal-00753708
Parallelization schemes for memory optimization on the cell processor, Proceedings of the 2007 workshop on MEmory performance DEaling with Applications, systems and architecture, MEDEA '07, 2007. ,
DOI : 10.1145/1327171.1327172
URL : https://hal.archives-ouvertes.fr/hal-00753708
Cell-MPI mastering the cell broadband engine architecture through a boost based parallel communication library, the 5th Annual Boost Libraries Conference, 2011. ,
Skeletons for parallel image processing: an overview of the SKIPPER project, Parallel Computing, vol.28, issue.12, pp.1685-1708, 2002. ,
DOI : 10.1016/S0167-8191(02)00189-8
PVM: A framework for parallel distributed computing, Concurrency: Practice and Experience, vol.4, issue.4, pp.315-339, 1990. ,
DOI : 10.1002/cpe.4330020404
The harris algorithm revisited on the cell processor, Proceedings of the 1st International Workshop on Highly Efficient Accelerators and Reconfigurable Technologies HEART 2010, 2010. ,
StreamIt: A Language for Streaming Applications, International Conference on Compiler Construction, 2002. ,
DOI : 10.1007/3-540-45937-5_14
C++ Templates : The Complete Guide ,
The programming model of ASSIST, an environment for parallel and distributed portable applications, Parallel Computing, vol.28, issue.12, pp.1709-1732, 2002. ,
DOI : 10.1016/S0167-8191(02)00188-6
Expression templates, 1995. ,
C++ templates as partial evaluation, PEPM, pp.13-18, 1999. ,
First draft of a report on the EDVAC, IEEE Annals of the History of Computing, vol.15, issue.4, pp.27-75, 1993. ,
DOI : 10.1109/85.238389
More iteration space tiling, Proceedings of the 1989 ACM/IEEE conference on Supercomputing , Supercomputing '89, 1989. ,
DOI : 10.1145/76263.76337