D. Extending, 40 3.6.1 The DDF Model, p.42

R. Support and .. , Monitoring The Global, tous droits réservés 5.2.1 Language Compiler And, p.72

P. Ddf-programs, 79 5.3.1 ORCC Extensions, p.79

M. Selva, L. Morel, K. Marquet, and S. Frénot, Extending dataflow programs with throughput properties, Proceedings of the First International Workshop on Many-core Embedded Systems, MES '13, pp.54-57
DOI : 10.1145/2489068.2489077

M. Selva, L. Morel, K. Marquet, and S. Frénot, A Monitoring System for Runtime Adaptations of Streaming Applications, 2015 23rd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, p.3, 2015.
DOI : 10.1109/PDP.2015.53

URL : https://hal.archives-ouvertes.fr/hal-01229025

E. A. Lee, The Problem with Threads, Computer, vol.39, issue.5, pp.33-42, 2006.
DOI : 10.1109/MC.2006.180

V. Pankratius, A. Jannesari, and W. F. Tichy, Parallelizing Bzip2: A Case Study in Multicore Software Engineering, IEEE Software, vol.26, issue.6, pp.70-77, 2009.
DOI : 10.1109/MS.2009.183

D. Q. Guthrie, B. Nguyen, C. F. Blaner, E. Marino, P. Retter et al., IBM POWER7 multicore server processor, IBM Journal of Research and Development, vol.551, issue.3, pp.1-1, 2011.

J. Yang, H. Cui, J. Wu, Y. Tang, and G. Hu, Determinism is not enough: Making parallel programs reliable with stable multithreading, Communications of the ACM, pp.2014-2020

J. E. Stone, D. Gohara, and G. Shi, OpenCL: A Parallel Programming Standard for Heterogeneous Computing Systems, Computing in Science & Engineering, vol.12, issue.3, pp.66-73, 2010.
DOI : 10.1109/MCSE.2010.69

G. Kahn, The Semantics of Simple Language for Parallel Programming, IFIP Congress, pp.471-475, 1974.

J. B. Dennis, First version of a data flow procedure language, Symposium on Programming, pp.241-271, 1974.
DOI : 10.1007/3-540-06859-7_145

. Intel, Intel Xeon Phi Coprocessor Instruction Set Architecture Reference Manual, p.93, 2012.

B. Dupont-de-dinechin, D. Van-amstel, M. Poulhiès, and G. Lager, Time-critical Computing on a Single-chip Massively Parallel Processor European Design and Automation Association, Proceedings of the Conference on Design, pp.1-97, 2014.

D. J. Sorin, M. D. Hill, and D. A. Wood, A Primer on Memory Consistency and Cache Coherence, Synthesis Lectures on Computer Architecture, vol.6, issue.3, p.10, 2011.
DOI : 10.2200/S00346ED1V01Y201104CAC016

U. Drepper, What Every Programmer Should Know About Memory, p.11, 2007.

. Muller, Memory Performance and Cache Coherency Effects on an Intel Nehalem Multiprocessor System, Proceedings of the 2009 18th International Conference on Parallel Architectures and Compilation Techniques, PACT '09, pp.261-270, 2009.

R. Lachaize, B. Lepers, and V. Quéma, MemProf: A Memory Profiler for NUMA Multicore Systems, Proceedings of the 2012 USENIX Conference on Annual Technical Conference, USENIX ATC'12, pp.5-5
URL : https://hal.archives-ouvertes.fr/hal-00945731

T. David, R. Guerraoui, and V. Trigonakis, Everything you always wanted to know about synchronization but were afraid to ask, Proceedings of the Twenty-Fourth ACM Symposium on Operating Systems Principles, SOSP '13, pp.33-48, 2013.
DOI : 10.1145/2517349.2522714

Z. Majo and T. R. Gross, Memory system performance in a NUMA multicore multiprocessor, Proceedings of the 4th Annual International Conference on Systems and Storage, SYSTOR '11, pp.1-12, 2011.
DOI : 10.1145/1987816.1987832

Z. Majo and T. R. Gross, Memory Management in NUMA Multicore Systems: Trapped Between Cache Contention and Interconnect Overhead, Proceedings of the International Symposium on Memory Management, pp.11-20, 2011.

S. Blagodurov, S. Zhuravlev, M. Dashti, and A. Fedorova, A case for NUMA-aware contention management on multicore systems, Proceedings of the 19th international conference on Parallel architectures and compilation techniques, PACT '10, pp.1-1
DOI : 10.1145/1854273.1854350

B. Verghese, S. Devine, A. Gupta, and M. Rosenblum, Operating system support for improving data locality on CC-NUMA compute servers, ACM SIGOPS Operating Systems Review, vol.30, issue.5, pp.279-289, 1996.
DOI : 10.1145/248208.237205

P. Van, R. , and S. Haridi, Concepts, Techniques, and Models of Computer Programming, pp.14-15, 2004.

A. Edward, T. M. Lee, and . Parks, Dataflow process networks, Proceedings of the IEEE, vol.83, issue.5, pp.773-801, 1995.

A. Edward, D. G. Lee, and . Messerschmitt, Synchronous data flow, Proceedings of the IEEE, vol.75, issue.28, pp.1235-1245

E. Ashford, L. , and D. G. Messerschmitt, Static Scheduling of Synchronous Data Flow Programs for Digital Signal Processing, IEEE Trans. Comput, vol.36, issue.29, pp.24-35, 1987.

W. Thies and S. Amarasinghe, An empirical characterization of stream programs and its implications for language and compiler design, Proceedings of the 19th international conference on Parallel architectures and compilation techniques, PACT '10, pp.2010-2031
DOI : 10.1145/1854273.1854319

G. Insa-de-lyon, M. Bilsen, R. Engels, J. A. Lauwereins, and . Peperstraete, Cyclostatic data flow, these.pdf © [M. Selva], [2015] Acoustics, Speech, and Signal Processing, pp.3255-3258, 1995.

B. D. Theelen, M. C. Geilen, T. Basten, J. P. Voeten, S. V. Gheorghita et al., A scenario-aware data flow model for combined long-run average and worst-case performance analysis. In Formal Methods and Models for Co-Design, MEMOCODE '06. Proceedings. Fourth ACM and IEEE International Conference on, pp.185-194, 2006.

P. Fradet, A. Girault, and P. Poplavko, SPDF: A schedulable parametric data-flow MoC, 2012 Design, Automation & Test in Europe Conference & Exhibition (DATE), pp.769-774
DOI : 10.1109/DATE.2012.6176572

URL : https://hal.archives-ouvertes.fr/hal-00744376

M. Dardaillon, K. Marquet, T. Risset, J. Martin, and H. Charles, A compilation flow for parametric dataflow, Proceedings of the 2014 International Conference on Compilers, Architecture and Synthesis for Embedded Systems, CASES '14, pp.1-8
DOI : 10.1145/2656106.2656110

URL : https://hal.archives-ouvertes.fr/hal-01048649

I. Amer, C. Lucarz, G. Roquier, M. Mattavelli, M. Raulet et al., Reconfigurable video coding on multicore, IEEE Signal Processing Magazine, vol.26, issue.6, pp.113-123, 2009.
DOI : 10.1109/MSP.2009.934107

URL : https://hal.archives-ouvertes.fr/hal-00429360

H. Yviquel, A. Lorence, K. Jerbi, G. Cocherel, A. Sanchez et al., Orcc, Proceedings of the 21st ACM international conference on Multimedia, MM '13, pp.863-866
DOI : 10.1145/2502081.2502231

URL : https://hal.archives-ouvertes.fr/hal-01059858

J. Gorin, M. Raulet, Y. Cheng, H. Lin, N. Siret et al., An RVC dataflow description of the AVC Constrained Baseline Profile decoder, 2009 16th IEEE International Conference on Image Processing (ICIP), pp.753-756, 2009.
DOI : 10.1109/ICIP.2009.5414265

E. Bezati, M. Mattavelli, and M. Raulet, RVC-CAL dataflow implementations of MPEG AVC/H.264 CABAC decoding, 2010 Conference on Design and Architectures for Signal and Image Processing (DASIP), pp.207-213, 2010.
DOI : 10.1109/DASIP.2010.5706266

URL : https://hal.archives-ouvertes.fr/hal-00565297

M. Chavarrias, F. Pescador, M. Garrido, E. Juárez, and M. Raulet, A DSP-Based HEVC decoder implementation using an actor language dataflow model Consumer Electronics, IEEE Transactions on, vol.59, issue.4, pp.839-847, 2013.

H. Salunkhe, O. Moreira, and K. Van-berkel, Modecontrolled Dataflow Based Modeling & Analysis of a 4G-LTE Receiver European Design and Automation Association, Proceedings of the Conference on Design, pp.1-212, 2014.

M. Pelcat, S. Aridhi, J. Piat, and J. Nezan, Physical Layer Multi-Core Prototyping: A Dataflow-Based Approach for LTE eNodeB, pp.2012-2035
DOI : 10.1007/978-1-4471-4210-2

URL : https://hal.archives-ouvertes.fr/hal-00739957

S. Junaid-jameel-ahmad, A. R. Li, T. Sadeghi, and . Schneider, CTL: A Platform-Independent Crypto Tools Library Based on Dataflow Programming Paradigm, Proceedings of 16th International Conference Financial Cryptography and Data Security, pp.2012-2012, 2012.

A. Ghamarian, M. C. Geilen, S. Stuijk, T. Basten, A. J. Moonen et al., Throughput Analysis of Synchronous Data Flow Graphs, Application of Concurrency to System Design Sixth International Conference on, pp.25-36, 2006.

S. Stuijk, T. Basten, M. C. Geilen, and H. Corporaal, Multiprocessor Resource Allocation for Throughput-Constrained Synchronous Dataflow Graphs, Design Automation Conference, pp.777-782, 2007.

A. H. Ghamarian, M. C. Geilen, T. Basten, and S. Stuijk, Parametric Throughput Analysis of Synchronous Data Flow Graphs, Design, Automation and Test in Europe DATE '08, pp.116-121, 2008.

A. Bonfietti, L. Benini, M. Lombardi, and M. Milano, An efficient and complete approach for throughput-maximal SDF allocation and scheduling on multi-core platforms, 2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010), pp.897-902, 2010.
DOI : 10.1109/DATE.2010.5456924

S. Carpov, L. Cudennec, and R. Sirdey, Throughput Constrained Parallelism Reduction in Cyclo-static Dataflow Applications, Procedia Computer Science, vol.18, pp.30-39
DOI : 10.1016/j.procs.2013.05.166

URL : https://hal.archives-ouvertes.fr/hal-00832508

R. L. Collins and L. P. Carloni, Flexible filters, Proceedings of the seventh ACM international conference on Embedded software, EMSOFT '09, pp.205-214, 2009.
DOI : 10.1145/1629335.1629363

Y. Choi, C. Li, D. Da-silva, A. Bivens, and E. Schenfeld, Adaptive task duplication using on-line bottleneck detection for streaming applications, Proceedings of the 9th conference on Computing Frontiers, CF '12, pp.163-172
DOI : 10.1145/2212908.2212932

C. Min and Y. I. Eom, DANBI: Dynamic Scheduling of Irregular Stream Programs for Many-core Systems, Proceedings of the 22Nd International Conference on Parallel Architectures and Compilation Techniques , PACT '13, pp.189-200

H. Yviquel, E. Casseau, M. Raulet, P. Jääskeläinen, and J. Takala, Towards run-time actor mapping of dynamic dataflow programs onto multi-core platforms, 2013 8th International Symposium on Image and Signal Processing and Analysis (ISPA), pp.725-730, 2013.
DOI : 10.1109/ISPA.2013.6703834

URL : https://hal.archives-ouvertes.fr/hal-00909408

W. Thies, M. Karczmarek, and S. Amarasinghe, StreamIt: A Language for Streaming Applications, Proceedings of the 11th International Conference on Compiler Construction, pp.179-196, 2002.
DOI : 10.1007/3-540-45937-5_14

Z. Fang, C. Venkatramani, R. Wagle, and K. Schwan, Cache Topology Aware Mapping of Stream Processing Applications onto CMPs, Distributed Computing Systems (ICDCS), 2013 IEEE 33rd International Conference on, pp.52-61, 2013.

H. Amir, Y. Hormati, M. Choi, R. Kudlur, T. Rabbah et al., Flextream: Adaptive Compilation of Streaming Applications for Heterogeneous Architectures, Proceedings of the 2009 18th International Conference on Parallel Architectures and Compilation Techniques, pp.214-223, 2009.

I. Michael and . Gordon, Compiler Techniques for Scalable Performance of Stream Programs on Multicore Architectures, pp.53-78

M. Wipliez, Compilation infrastructure for dataflow programs. Theses, INSA de Rennes, p.28, 2010.
URL : https://hal.archives-ouvertes.fr/tel-00598914

M. Wipliez and M. Raulet, Classification and transformation of dynamic dataflow programs. Design and Architectures for Signal and, pp.303-310, 2010.
URL : https://hal.archives-ouvertes.fr/hal-00565290

R. M. Karp, A characterization of the minimum cycle mean in a digraph, Discrete Mathematics, p.28, 1978.

C. Tan, A hybrid static/dynamic approach to scheduling stream programs, pp.29-97, 2009.

S. Shuvra, E. A. Bhattacharyya, P. K. Lee, and . Murthy, Software Synthesis from Dataflow Graphs, p.31, 1996.

W. Thies, Language and compiler support for stream programs, p.33, 2009.

I. Michael, W. Gordon, S. Thies, and . Amarasinghe, Exploiting Coarse-grained Task, Data, and Pipeline Parallelism in Stream Programs, SIGARCH Comput. Archit. News, vol.34, issue.5, pp.151-162, 2006.

S. Casale-brunet, E. Bezati, C. Alberti, G. Roquier, M. Mattavelli et al., Design space exploration and implementation of RVC-CAL applications using the TURNUS framework, Design and Architectures for Signal and Image Processing (DASIP), 2013 Conference on, pp.341-342, 2013.

S. Sameer, A. D. Shende, and . Malony, The Tau Parallel Performance System, Int. J. High Perform. Comput. Appl, vol.20, issue.2, pp.287-311, 2006.

C. Mccurdy and J. Vetter, Memphis: Finding and fixing NUMA-related performance problems on multi-core platforms, 2010 IEEE International Symposium on Performance Analysis of Systems & Software (ISPASS), pp.47-98, 2010.
DOI : 10.1109/ISPASS.2010.5452060

X. Liu and J. Mellor-crummey, A Tool to Analyze the Performance of Multithreaded Programs on NUMA Architectures, Proceedings of the 19th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming , PPoPP '14, pp.259-272

A. Drebes, P. Antoniu, K. Heydemann, A. Cohen, and N. Drach, Aftermath: A graphical tool for performance analysis and debugging of fine-grained task-parallel programs and run-time systems, Seventh Workshop on Programmability Issues for Heterogeneous Multicores (MULTIPROG-2014), p.48, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01136508

A. Pop and A. Cohen, OpenStream: Expressiveness and Dataflow Compilation of OpenMP Streaming Programs, ACM Trans. Archit
URL : https://hal.archives-ouvertes.fr/hal-00786675

S. M. Farhad, Y. Ko, B. Burgstaller, and B. Scholz, Profile-guided deployment of stream programs on multicores, Proceedings of the 13th ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, Tools and Theory for Embedded Systems, LCTES '12, pp.79-88
DOI : 10.1145/2248418.2248430

T. Goubier, R. Sirdey, S. Louise, and V. David, Sigma-C: A Programming Model and Language for Embedded Manycores, Algorithms and Architectures for Parallel Processing, 7016 of Lecture Notes in Computer Science, pp.385-394

H. Yviquel, E. Casseau, M. Wipliez, and M. Raulet, Efficient multicore scheduling of dataflow process networks, 2011 IEEE Workshop on Signal Processing Systems (SiPS), pp.198-203, 2011.
DOI : 10.1109/SiPS.2011.6088974

URL : https://hal.archives-ouvertes.fr/hal-00687750

J. Sugerman, K. Fatahalian, S. Boulos, K. Akeley, and P. Hanrahan, GRAMPS, ACM Transactions on Graphics, vol.28, issue.1, pp.1-4, 2009.
DOI : 10.1145/1477926.1477930

S. Stuijk, M. Geilen, and T. Basten, A Predictable Multiprocessor Design Flow for Streaming Applications with Dynamic Behaviour, 2010 13th Euromicro Conference on Digital System Design: Architectures, Methods and Tools, pp.2010-53
DOI : 10.1109/DSD.2010.31

J. Dongarra, K. London, S. Moore, P. Mucci, and D. Terpstra, Using PAPI for Hardware Performance Monitoring on Linux Systems, Conference on Linux Clusters: The HPC Revolution, Linux Clusters Institute, pp.62-98, 2001.

P. M. Carpenter, A. Ramirez, and E. Ayguade, Mapping stream programs onto heterogeneous multiprocessor systems, Proceedings of the 2009 international conference on Compilers, architecture, and synthesis for embedded systems, CASES '09, pp.57-66, 2009.
DOI : 10.1145/1629395.1629406

URL : http://doi.acm.org/10.1145/1629395.1629406

M. Sardar, Y. Farhad, B. Ko, B. Burgstaller, and . Scholz, Orchestration by Approximation: Mapping Stream Programs Onto Multicore Architectures, SIGPLAN Not, vol.47, issue.4, pp.357-368, 2011.

G. J. Sullivan, J. Ohm, W. Han, and T. Wiegand, Overview of the High Efficiency Video Coding (HEVC) Standard. Circuits and Systems for Video Technology, IEEE Transactions on, vol.22, issue.12, pp.1649-1668, 2012.

G. Ofenbeck, R. Steinmann, V. Caparros, D. G. Spampinato, and M. Puschel, Applying the roofline model, 2014 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS), pp.76-85, 2014.
DOI : 10.1109/ISPASS.2014.6844463

R. Gu, J. Janneck, M. Raulet, . Shuvras, and . Bhattacharyya, Exploiting Statically Schedulable Regions in Dataflow Programs, Journal of Signal Processing Systems, vol.55, issue.6, pp.129-142, 2011.
DOI : 10.1007/s11265-009-0445-1

URL : https://hal.archives-ouvertes.fr/hal-00398829