Using Machine Learning to Focus Iterative Optimization, International Symposium on Code Generation and Optimization (CGO'06), 2006. ,
DOI : 10.1109/CGO.2006.37
Instancewise array dependence test for recursive programs ,
URL : https://hal.archives-ouvertes.fr/hal-01257308
Finding effective compilation sequences, Proc. Languages, Compilers, and Tools for Embedded Systems (LCTES), pp.231-239, 2004. ,
The fortress language specification 0.866, 2006. ,
Integrated predicated and speculative execution in the IMPACT EPIC architecture, Proceedings of the 25th Intl. Symp. on Computer Architecture, 1998. ,
Scanning polyhedra with DO loop, ACM Symp. on Principles and Practice of Parallel Programming (PPoPP'91), pp.39-50, 1991. ,
STAPL: An Adaptive, Generic Parallel C++ Library, Languages and Compilers for Parallel Computing (LCPC'01), pp.193-208, 2001. ,
DOI : 10.1007/3-540-35767-X_13
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.207.3775
Automatic translation of FORTRAN programs to vector form, ACM Transactions on Programming Languages and Systems, vol.9, issue.4, pp.491-542, 1987. ,
DOI : 10.1145/29873.29875
Optimizing Compilers for Modern Architectures, 2002. ,
An Automata-Theoretic Modelization of Instancewise Program Analysis: Transducers as mappings from Instances to Memory Locations, 2004. ,
Synthesizing transformations for locality enhancement of imperfectly-nested loop nests, ACM Supercomputing'00, 2000. ,
Type inclusion constraints and type inference, Proceedings of the conference on Functional programming languages and computer architecture , FPCA '93, pp.31-41, 1993. ,
DOI : 10.1145/165180.165188
Optimizing matrix multiply using PHiPAC: A portable, high-performance, ANSI C coding methodology, ACM Intl. Conf. on Supercomputing (ICS'97), pp.340-347, 1997. ,
Dependence Analysis for Supercomputing, 1988. ,
Loop Transformations for Restructuring Compilers: The Foundations, 1992. ,
Array Dataflow Analysis in Presence of Non-affine Constraints, 1998. ,
Efficient code generation for automatic parallelization and optimization, ISPDC'2 IEEE International Symposium on Parallel and Distributed Computing, 2003. ,
Oceans -optimising compilers for embedded applications, Euro-Par'99, pp.1171-1775, 1999. ,
Evaluating support for global address space languages on the Cray X1, Proceedings of the 18th annual international conference on Supercomputing , ICS '04, 2004. ,
DOI : 10.1145/1006209.1006236
Maximal static expansion, Proceedings of the 25th ACM SIGPLAN-SIGACT symposium on Principles of programming languages , POPL '98, pp.98-106, 1998. ,
DOI : 10.1145/268946.268955
URL : https://hal.archives-ouvertes.fr/hal-01257319
Maximal static expansion, Proceedings of the 25th ACM SIGPLAN-SIGACT symposium on Principles of programming languages , POPL '98, pp.213-243, 2000. ,
DOI : 10.1145/268946.268955
URL : https://hal.archives-ouvertes.fr/hal-01257319
The synchronous languages 12 years later, Proceedings of the IEEE, 2003. ,
Fuzzy Array Dataflow Analysis, Journal of Parallel and Distributed Computing, vol.40, issue.2, pp.210-226, 1997. ,
DOI : 10.1006/jpdc.1996.1261
URL : https://hal.archives-ouvertes.fr/hal-00551673
Putting polyhedral loop transformations to work, Languages and Compilers for Parallel Computing (LCPC'03), pp.23-30, 2003. ,
URL : https://hal.archives-ouvertes.fr/inria-00071681
Parallel programming with Polaris, IEEE Computer, issue.12, pp.2978-82, 1996. ,
Transductions and Context-Free Languages. Teubner, 1979. ,
The Foundations of Esterel, 2000. ,
Improving Data Locality by Chunking, CC Intl. Conf. on Compiler Construction, number 2622 in LNCS, pp.320-335, 2003. ,
DOI : 10.1007/3-540-36579-6_23
URL : https://hal.archives-ouvertes.fr/inria-00001055
More Legal Transformations for Locality, Euro-Par'10, number 3149 in LNCS, pp.272-283, 2004. ,
DOI : 10.1007/978-3-540-27866-5_36
URL : https://hal.archives-ouvertes.fr/inria-00001056
Adjusting a program transformation for legality. Parallel processing letters, pp.3-17, 2005. ,
Probabilistic source-level optimisation of embedded systems software, ACM SIGPLAN/SIGBED Conference on Languages, Compilers, and Tools for Embedded Systems (LCTES'05), 2005. ,
Automatic intra-register vectorization for the intel architecture, International Journal of Parallel Programming, vol.30, issue.2, pp.65-98, 2002. ,
DOI : 10.1023/A:1014230429447
Ptolemy: A Framework for Simulating and Prototyping Heterogeneous Systems, Int. J. in Computer Simulation, vol.4, issue.2, pp.155-182, 1994. ,
DOI : 10.1016/B978-155860702-6/50048-X
Iterative compilation in a non-linear optimisation space, Proc. Workshop on Profile and Feedback Directed Compilation, 1998. ,
COSY communication IP's, Proceedings of the 37th conference on Design automation , DAC '00, pp.406-409, 2000. ,
DOI : 10.1145/337292.337515
Synchronous programming with events and relations: the SIGNAL language and its semantics, Science of Computer Programming, vol.16, issue.2, pp.103-149, 1991. ,
DOI : 10.1016/0167-6423(91)90001-E
Abstract interpretation by dynamic partitioning, J. of Functional Programming, vol.2, issue.4, pp.407-423, 1992. ,
Salto: System for assembly-language transformation and optimization, Workshop on Compilers for Parallel Computers (CPC'96), 1996. ,
URL : https://hal.archives-ouvertes.fr/inria-00073718
Precise widening operators for convex polyhedra, Int. Symp. on Static Analysis (SAS'03), 2003. ,
Embedded control: From asynchrony to synchrony and back, EMSOFT'01, 2001. ,
Abstract interpretation, Proceedings of the 4th ACM SIGACT-SIGPLAN symposium on Principles of programming languages , POPL '77, pp.238-252, 1977. ,
DOI : 10.1145/512950.512973
URL : https://hal.archives-ouvertes.fr/inria-00528590
Instancewise reaching definition analysis for recursive programs using context-free transductions, Parallel Architectures and Compilation Techniques (PACT'98), pp.332-340, 1998. ,
URL : https://hal.archives-ouvertes.fr/hal-01257320
Data-flow analysis of recursive structures, Proc. of the 6 th Workshop on Compilers for Parallel Computers (CPC'96), pp.181-192, 1996. ,
URL : https://hal.archives-ouvertes.fr/hal-01257322
Deep jam: conversion of coarse-grain parallelism to instruction-level and vector parallelism for irregular applications, 14th International Conference on Parallel Architectures and Compilation Techniques (PACT'05), pp.291-300, 2005. ,
DOI : 10.1109/PACT.2005.16
URL : https://hal.archives-ouvertes.fr/hal-01257293
Ambient Intelligence: Impact on Embedded-System Design, chapter Application Domain-Driven System Design for Pervasive Video Processing, pp.251-270, 2003. ,
Synchronization of periodic clocks, Proceedings of the 5th ACM international conference on Embedded software , EMSOFT '05, pp.339-342, 2005. ,
DOI : 10.1145/1086228.1086289
URL : https://hal.archives-ouvertes.fr/hal-01257295
N-sychronous Kahn networks, 33 th ACM Symp. on Principles of Programming Languages (PoPL'06), pp.180-193, 2006. ,
DOI : 10.1145/1111037.1111054
In search of a program generator to implement generic transformations for high-performance computing, Science of Computer Programming, vol.62, issue.1, pp.25-46, 2004. ,
DOI : 10.1016/j.scico.2005.10.013
URL : https://hal.archives-ouvertes.fr/hal-01257287
Improving software pipelining with unroll-and-jam, Proceedings of HICSS-29: 29th Hawaii International Conference on System Sciences, 1996. ,
DOI : 10.1109/HICSS.1996.495462
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.1.9319
Efficiently computing static single assignment form and the control dependence graph, ACM Trans. on Programming Languages and Systems, vol.13, issue.4, pp.451-490, 1991. ,
Towards a higher-order synchronous data-flow language, Proceedings of the fourth ACM international conference on Embedded software , EMSOFT '04, 2004. ,
DOI : 10.1145/1017753.1017792
Facilitating the search for compositions of program transformations, Proceedings of the 19th annual international conference on Supercomputing , ICS '05, pp.151-160, 2005. ,
DOI : 10.1145/1088149.1088169
URL : https://hal.archives-ouvertes.fr/hal-01257296
A Polyhedral Approach to Ease the Composition of Program Transformations, Euro-Par'04, number 3149 in LNCS, pp.292-303, 2004. ,
DOI : 10.1007/978-3-540-27866-5_38
URL : https://hal.archives-ouvertes.fr/hal-01257301
Automatic discovery of linear restraints among variables of a program, Proceedings of the 5th ACM SIGACT-SIGPLAN symposium on Principles of programming languages , POPL '78, pp.84-96, 1978. ,
DOI : 10.1145/512760.512770
Globally-Asynchronous Locally-Synchronous Systems, 1984. ,
The ParaScope parallel programming environment, Proceedings of the IEEE, pp.244-263, 1993. ,
Optimizing strategies for telescoping languages, Proceedings of the 15th international conference on Supercomputing , ICS '01, pp.92-101, 2001. ,
DOI : 10.1145/377792.377812
Introduction to Algorithms, 1989. ,
Architectural support for scalable speculative parallelization in shared-memory multiprocessors, ACM/IEEE Intl. Symp. on Computer Architecture (ISCA'00), pp.13-24, 2000. ,
Program Analysis and Transformation: from the Polytope Model to Formal Languages, 1999. ,
Automatic parallelization of while-loops using speculative execution, Intl. J. of Parallel Programming, vol.23, issue.2, pp.191-219, 1995. ,
Reasoning About Program Transformations, 2002. ,
Semantic foundations of programs analysis, 1981. ,
Program analysis: The abstract interpretation perspective, ACM Computing Surveys, issue.4es, p.28, 1996. ,
Synchronous Kahn networks, ICFP '96: Proceedings of the 1 st ACM SIGPLAN Intl. Conf. on Functional programming, pp.226-238, 1996. ,
DOI : 10.1145/232627.232651
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.15.9168
Clocks as First Class Abstract Types, EMSOFT'03, pp.134-155, 2003. ,
DOI : 10.1007/978-3-540-45212-6_10
Array Region Analyses and Applications, 1996. ,
Optimizing for reduced code space using genetic algorithms, Proc. Languages, Compilers, and Tools for Embedded Systems (LCTES), pp.1-9, 1999. ,
Implementing Multi-stage Languages Using ASTs, Gensym, and Reflection, ACM SIGPLAN/SIGSOFT Intl. Conf. Generative Programming and Component Engineering (GPCE'03), pp.57-76, 2003. ,
DOI : 10.1007/978-3-540-39815-8_4
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.10.6148
Scheduling Structured Systems, EuroPar'99, pp.409-412, 1999. ,
DOI : 10.1007/3-540-48311-X_53
Open-source compiler technology for source-to-source optimization ,
A Language for the Compact Representation of Multiple Program Versions, Languages and Compilers for Parallel Computing (LCPC'05), 2005. ,
DOI : 10.1007/978-3-540-69330-7_10
URL : https://hal.archives-ouvertes.fr/hal-00141067
Interprocedural may-alias analysis for pointers: beyond k-limiting, ACM Symp. on Programming Language Design and Implementation (PLDI'94), pp.230-241, 1994. ,
Loop shifting for loop parallelization, International Journal of Parallel Programming, vol.28, issue.5, pp.499-534, 2000. ,
DOI : 10.1023/A:1007506711786
ProfileMe: Hardware support for instruction level profiling on out-of-order processors, Proceedings of the 30 th International Symposium on Microarchitecture, NC, 1997. ,
YAPI, Proceedings of the 37th conference on Design automation , DAC '00, 2000. ,
DOI : 10.1145/337292.337511
Techniques for the translation of MATLAB programs into Fortran 90, ACM Transactions on Programming Languages and Systems, vol.21, issue.2, pp.286-323, 1999. ,
DOI : 10.1145/316686.316693
Mapping uniform loop nests onto distributed memory architectures, Parallel Computing, vol.20, issue.5, pp.679-710, 1994. ,
DOI : 10.1016/0167-8191(94)90001-9
URL : https://hal.archives-ouvertes.fr/hal-00857077
Scheduling and Automatic Parallelization, Birkhaüser, 2000. ,
DOI : 10.1007/978-1-4612-1362-8
URL : https://hal.archives-ouvertes.fr/hal-00856645
Combining Retiming and Scheduling Techniques for Loop Parallelization and Loop Tiling, Parallel Processing Letters, vol.07, issue.04, pp.379-392, 1997. ,
DOI : 10.1142/S0129626497000383
URL : https://hal.archives-ouvertes.fr/hal-00856890
Static scheduling of synchronous data flow programs for digital signal processing, IEEE Trans. Computers, vol.36, issue.1, pp.24-25, 1987. ,
Word Processing in Groups, 1992. ,
An Automata-Theoretic Approach to Interprocedural Data-Flow Analysis, FOSSACS'99, 1999. ,
DOI : 10.1007/3-540-49019-1_2
On Relations Defined by Generalized Finite Automata, IBM Journal of Research and Development, vol.9, issue.1, pp.45-68, 1965. ,
DOI : 10.1147/rd.91.0047
Efficient algorithms for pre* and post* on interprocedural parallel flow graphs, Proceedings of the 27th ACM SIGPLAN-SIGACT symposium on Principles of programming languages , POPL '00, pp.1-11, 2000. ,
DOI : 10.1145/325694.325697
Vectorization for simd architectures with alignment constraints, ACM Symp. on Programming Language Design and Implementation (PLDI '04), pp.82-93, 2004. ,
A Practical Method for Quickly Evaluating Program Optimizations, Intl. Conf. on High Performance Embedded Architectures and Compilers (HiPEAC'05), number 3793 in LNCS, pp.29-46, 2005. ,
DOI : 10.1007/11587514_4
URL : https://hal.archives-ouvertes.fr/inria-00001054
Array expansion, ACM Intl. Conf. on Supercomputing, pp.429-441, 1988. ,
Parametric integer programming, RAIRO Recherche Opérationnelle, vol.22, pp.243-268, 1988. ,
Dataflow analysis of scalar and array references, Intl. J. of Parallel Programming, vol.20, issue.1, pp.23-53, 1991. ,
Some efficient solutions to the affine scheduling problem, part II, multidimensional time See also Part I, one dimensional time, Intl. J. of Parallel Programming, vol.21, issue.215, pp.389-420315, 1992. ,
A parallelization framework for recursive tree programs, EuroPar'98, 1998. ,
Scalable and structured scheduling, Intl. J. of Parallel Programming, vol.28, 2006. ,
On index set splitting, Parallel Architectures and Compilation Techniques (PACT'99), 1999. ,
FFTW: an adaptive software architecture for the FFT, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181), pp.1381-1384, 1998. ,
DOI : 10.1109/ICASSP.1998.681704
Shape types, Proceedings of the 24th ACM SIGPLAN-SIGACT symposium on Principles of programming languages , POPL '97, pp.27-39, 1997. ,
DOI : 10.1145/263699.263706
Evaluating Iterative Compilation, 11 th Languages and Compilers for Parallel Computing, 2002. ,
DOI : 10.1007/11596110_24
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.1.4652
Generation of synchronous code for automatic parallelization of while loops, EuroPar'95, pp.315-326, 1995. ,
DOI : 10.1007/BFb0020474
Is it a tree, a DAG, or a cyclic graph? A shape analysis for heap-directed pointers in C, Proceedings of the 23rd ACM SIGPLAN-SIGACT symposium on Principles of programming languages , POPL '96, pp.1-15, 1996. ,
DOI : 10.1145/237721.237724
DiST: A simple, reliable and scalable method to significantly reduce processor architecture simulation time, Intl. Conf. on Measurement and Modeling of Computer Systems, ACM SIGMETRICS'03, 2003. ,
URL : https://hal.archives-ouvertes.fr/hal-01257307
Group-based fields See also " Design and Implementation of 81/2, a Declarative Data-Parallel Language, RR 1012, Proc. of the Parallel Symbolic Languages and Systems, 1995. ,
Interconnect and memory organization in SOCs for advanced set-top boxes and TV ? evolution, analysis, and trends, Interconnect-Centric Design for Advanced SoC and NoC, chapter 15, pp.399-423, 2004. ,
Hardware design methodology with the Alpha language, FDL'01, 2001. ,
Beyond induction variables: detecting and classifying sequences using a demand-driven SSA form, ACM Transactions on Programming Languages and Systems, vol.17, issue.1, pp.85-122, 1995. ,
DOI : 10.1145/200994.201003
Semi-automatic composition of loop transformations for deep parallelism and memory hierarchies, Intl. J. of Parallel Programming, vol.34, issue.3, 2006. ,
The interprocedural analysis and automatic parallelisation of Scheme programs, Lisp and Symbolic Computation, pp.176-396, 1989. ,
UFC: a global trade-off strategy for loop unrolling for VLIW architectures, Proc. Compilers for Parallel Computers (CPC), pp.59-70, 2003. ,
The synchronous data flow programming language LUSTRE, Proceedings of the IEEE, vol.79, issue.9, pp.1305-1320, 1991. ,
DOI : 10.1109/5.97300
Abstractions for recursive pointer data structures: improving the analysis and transformation of imperative programs, ACM Symp. on Programming Language Design and Implementation (PLDI'92), pp.249-260, 1992. ,
Composable memory transactions, ACM Symp. on Principles and Practice of Parallel Programming (PPoPP'05), 2005. ,
Semantical interprocedural parallelization: An overview of the pips project, ACM Intl. Conf. on Supercomputing (ICS'91), 1991. ,
URL : https://hal.archives-ouvertes.fr/hal-00984684
The semantics of a simple language for parallel programming, Information processing, pp.471-475, 1974. ,
Optimization within a unified transformation framework, 1996. ,
The effect of cache models on iterative compilation for combined tiling and unrolling, Parallel Architectures and Compilation Techniques (PACT'00, 2001. ,
Iterative compilation in program optimization, Proc. CPC'10 (Compilers for Parallel Computers), pp.35-44, 2000. ,
Drawing Graphs With dot, 2002. ,
Code generation for multiple mappings, Proceedings Frontiers '95. The Fifth Symposium on the Frontiers of Massively Parallel Computation, 1995. ,
DOI : 10.1109/FMPC.1995.380437
Graph types, ACM Symp. on Principles of Programming Languages (PoPL'93), pp.196-205, 1993. ,
DOI : 10.7146/dpb.v21i421.7952
Array SSA form and its use in parallelization, Proceedings of the 25th ACM SIGPLAN-SIGACT symposium on Principles of programming languages , POPL '98, pp.107-120, 1998. ,
DOI : 10.1145/268946.268956
Automatic storage management for parallel programs, Parallel Computing, vol.24, issue.3-4, pp.649-671, 1998. ,
DOI : 10.1016/S0167-8191(98)00029-5
A heuristic search algorithm based on unified transformation framework, 7th Intl. Workshop on High Performance Scientific and Engineering Computing (HPSEC-05), 2005. ,
A dynamically tuned sorting library, ACM Conf. on Code Generation and Optimization (CGO'04), 2004. ,
Communication-free parallelization via affine transformations, 24 th ACM Symp. on Principles of Programming Languages, pp.201-214, 1997. ,
DOI : 10.1007/BFb0025873
Loop induction variable canonicalization in parallelizing compilers, Proceedings of the 1996 Conference on Parallel Architectures and Compilation Techniques (PACT '96), p.228, 1996. ,
Blocking and array contraction across arbitrarily nested loops using affine partitioning, ACM Symp. on Principles and Practice of Parallel Programming (PPoPP'01), pp.102-112, 2001. ,
Adaptive java optimisation using instance-based learning, Proceedings of the 18th annual international conference on Supercomputing , ICS '04, pp.237-246, 2004. ,
DOI : 10.1145/1006209.1006243
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.58.5381
A singular loop transformation framework based on non-singular matrices, International Journal of Parallel Programming, vol.16, issue.4, pp.183-205, 1994. ,
DOI : 10.1007/BF02577874
Retiming synchronous circuitry, Algorithmica, vol.9, issue.No. 1, 1991. ,
DOI : 10.1007/BF01759032
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.368.3222
Transition Phase Classification and Prediction, 11th International Symposium on High-Performance Computer Architecture, 2005. ,
DOI : 10.1109/HPCA.2005.39
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.102.4905
POSH, Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming , PPoPP '06, pp.158-167, 2006. ,
DOI : 10.1145/1122971.1122997
POLYCHRONY for System Design, Systems and Computers, Special Issue on Application Specific Hardware Design, 2003. ,
DOI : 10.1142/S0218126603000763
URL : https://hal.archives-ouvertes.fr/hal-00730480
Parameterized polyhedra and their vertices, Intl. J. of Parallel Programming, vol.25, issue.6, 1997. ,
URL : https://hal.archives-ouvertes.fr/inria-00534851
Array dataflow analysis and its use in array privatization, 20 th ACM Symp. on Principles of Programming Languages, pp.2-15, 1993. ,
Mathematical Theory of Computation, 1974. ,
A Machine Learning Approach to Automatic Production of Compiler Heuristics, Proc. AIMSA, number 2443 in LNCS, pp.41-50, 2002. ,
DOI : 10.1007/3-540-46148-5_5
Timing analysis model for network based multiprocessor systems, Proc. of ProRISC, 15th annual Workshop of Circuits, System and Signal Processing, pp.91-99, 2004. ,
Architectural semantics for practical transactional memory, ACM/IEEE Intl. Symp. on Computer Architecture (ISCA'06), 2006. ,
Advanced Compiler Design & Implementation, 1997. ,
Autovectorization in GCC, Proceedings of the 2004 GCC Developers Summit, pp.105-118, 2004. ,
Principles of Program Analysis, 1999. ,
DOI : 10.1007/978-3-662-03811-6
MARS: a distributed memory approach to shared memory compilation, Proc. Language , Compilers and Runtime Systems for Scalable Computing, 1998. ,
In search of speculative thread-level parallelism, 1999 International Conference on Parallel Architectures and Compilation Techniques (Cat. No.PR00425), 1999. ,
DOI : 10.1109/PACT.1999.807576
Functional data structures Advanced Functional Programming, pp.131-158, 1996. ,
Feedback assisted iterative compiplation, Proc. LCR, 2000. ,
On context-free languages, J. of the ACM, vol.13, issue.4, pp.570-581, 1966. ,
Iterative Optimization in the Polyhedral Model: Part I, One-Dimensional Time, International Symposium on Code Generation and Optimization (CGO'07), 2007. ,
DOI : 10.1109/CGO.2007.21
URL : https://hal.archives-ouvertes.fr/hal-01257281
Graphite: Loop optimizations based on the polyhedral model for GCC, Proc. of the 4 th GCC Developper's Summit, 2006. ,
Induction Variable Analysis with Delayed Abstractions, Intl. Conf. on High Performance Embedded Architectures and Compilers (HiPEAC'05), number 3793 in LNCS, pp.218-232, 2005. ,
DOI : 10.1007/11587514_15
URL : https://hal.archives-ouvertes.fr/hal-01257294
The Data Parallel Programming Model, Number 1132 in LNCS, 1996. ,
DOI : 10.1007/3-540-61736-1
Types and Programming Languages, 2002. ,
Four generations of SPEC CPU benchmarks: what has changed and what has not, 2004. ,
The SSA Representation Framework: Semantics, Analyses and GCC Implementation, 2006. ,
Simplifying subtyping constraints, ACM Intl. Conf. on Functional Programming (ICFP'96), pp.122-133, 1996. ,
On the representation of finite deterministic 2-tape automata, Theoretical Computer Science, vol.225, issue.1-2, pp.1-63, 1999. ,
DOI : 10.1016/S0304-3975(98)00179-0
Spiral: A Generator for Platform-Adapted Libraries of Signal Processing Algorithms, Journal of High Performance Computing and Applications, special issue on Automatic Performance Tuning, pp.21-45, 2004. ,
DOI : 10.1177/1094342004041291
Towards a Systematic, Pragmatic and Architecture-Aware Program Optimization Process for Complex Processors, Proceedings of the ACM/IEEE SC2004 Conference, 2004. ,
DOI : 10.1109/SC.2004.61
URL : https://hal.archives-ouvertes.fr/hal-01257302
On Increasing Architecture Awareness in Program Optimizations to Bridge the Gap between Peak and Sustained Processor Performance - Matrix-Multiply Revisited, ACM/IEEE SC 2002 Conference (SC'02), 2002. ,
DOI : 10.1109/SC.2002.10054
The omega test: a fast and practical integer programming algorithm for dependence analysis, ACM/IEEE Conf. on Supercomputing, pp.4-13, 1991. ,
The Omega test: a fast and practical integer programming algorithm for dependence analysis, Proceedings of the 1991 ACM/IEEE conference on Supercomputing , Supercomputing '91, pp.4-13, 1991. ,
DOI : 10.1145/125826.125848
Uniform techniques for loop optimization, ACM Intl. Conf. on Supercomputing (ICS'91), pp.341-352, 1991. ,
A practical algorithm for exact array dependence analysis, Communications of the ACM, vol.35, issue.8, pp.27-47, 1992. ,
Optimizing memory usage in the polyhedral model, Institut de Recherche en Informatique et Systèmes Aléatoires, 1999. ,
Generation of efficient nested loops from polyhedra, International Journal of Parallel Programming, vol.28, issue.5, pp.469-498, 2000. ,
DOI : 10.1023/A:1007554627716
Precise interprocedural dataflow analysis via graph reachability, Proceedings of the 22nd ACM SIGPLAN-SIGACT symposium on Principles of programming languages , POPL '95, 1995. ,
DOI : 10.1145/199448.199462
The LRPD test: speculative run-time parallelization of loops with privatization and reduction parallelization, IEEE Transactions on Parallel and Distributed Systems, vol.10, issue.2, pp.160-180, 1999. ,
DOI : 10.1109/71.752782
Hybrid analysis, Proceedings of the 16th international conference on Supercomputing , ICS '02, pp.251-283, 2003. ,
DOI : 10.1145/514191.514229
The value evolution graph and its use in memory reference analysis, Proceedings. 13th International Conference on Parallel Architecture and Compilation Techniques, 2004. PACT 2004., 2004. ,
DOI : 10.1109/PACT.2004.1342558
Predicting Unroll Factors Using Supervised Classification, International Symposium on Code Generation and Optimization, 2005. ,
DOI : 10.1109/CGO.2005.29
Meta optimization: improving compiler heuristics with machine learning, ACM Symp. on Programming Language Design and Implementation (PLDI'03), pp.77-90, 2003. ,
High-level synthesis of nonprogrammable hardware accelerators, 2000. ,
Schedule-independant storage mapping for loops, ACM Symp. on Architectural Support for Programming Languages and Operating Systems (ASPLOS'98), 1998. ,
Overcoming the challenges to feedback-directed optimization, ACM SIGPLAN Workshop on Dynamic and Adaptive Compilation and Optimization, pp.1-11, 2000. ,
Program Flow Analysis: Theory and Applications, chapter Two Approaches to Interprocedural Data Flow Analysis, 1981. ,
Automatically characterizing large scale program behavior, 10th International Conference on Architectural Support for Programming Languages and Operating Systems, 2002. ,
DOI : 10.1145/605397.605403
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.118.6150
Parametric shape analysis via 3-valued logic, ACM Symp. on Principles of Programming Languages (PoPL'99), pp.105-118, 1999. ,
DOI : 10.1145/514188.514190
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.29.3161
Automatic parallelization of fortran programs in the presence of procedure calls, Proc. of the 1 st European Symp. on Programming (ESOP'86), number 213 in LNCS, pp.210-222, 1986. ,
DOI : 10.1007/3-540-16442-1_16
StreamIt: A Language for Streaming Applications, Intl. Conf. on Compiler Construction, 2002. ,
DOI : 10.1007/3-540-45937-5_14
Automatic array privatization, 6 th Languages and Compilers for Parallel Computing, number 768 in LNCS, pp.500-521, 1993. ,
DOI : 10.1007/3-540-45403-9_8
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.3.5746
The theory and practice of compiler writing, 1985. ,
Compiler optimization-space exploration, International Symposium on Code Generation and Optimization, 2003. CGO 2003., 2005. ,
DOI : 10.1109/CGO.2003.1191546
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.131.1622
A unified framework for schedule and storage optimization, ACM Symp. on Programming Language Design and Implementation (PLDI'01), pp.232-242, 2001. ,
URL : https://hal.archives-ouvertes.fr/hal-00808285
Optimizing program locality through CMEs and GAs, Oceans 2002 Conference and Exhibition. Conference Proceedings (Cat. No.02CH37362), pp.68-78, 2003. ,
DOI : 10.1109/PACT.2003.1238003
Polyhedral Code Generation in the Real World, Proceedings of the International Conference on Compiler Construction (ETAPS CC'06), pp.185-201, 2006. ,
DOI : 10.1007/11688839_16
URL : https://hal.archives-ouvertes.fr/inria-00001106
Multi-dimentsional incremetal loops fusion for data locality, ASAP, pp.17-27, 2003. ,
Violated dependence analysis, Proceedings of the 20th annual international conference on Supercomputing , ICS '06, 2006. ,
DOI : 10.1145/1183401.1183448
URL : https://hal.archives-ouvertes.fr/hal-01257290
Efficient Symbolic Analysis for Optimizing Compilers, Proceedings of the International Conference on Compiler Construction (ETAPS CC'01), pp.118-132, 2001. ,
DOI : 10.1007/3-540-45306-7_9
Active libraries: Rethinking the roles of compilers and libraries, SIAM Workshop on Object Oriented Methods for Inter-operable Scientific and Engineering Computing, 1998. ,
Stratego: A language for program transformation based on rewriting strategies. System description of Stratego 0.5, Rewriting Techniques and Applications (RTA'01), volume 2051 of Lecture Notes in Computer Science, pp.357-361, 2001. ,
On circuits and numbers, IEEE Trans. on Computers, vol.43, issue.8, pp.868-879, 1994. ,
Monotonic evolution, Proceedings of the 15th international conference on Supercomputing , ICS '01, 2001. ,
DOI : 10.1145/377792.377809
URL : https://hal.archives-ouvertes.fr/hal-01257312
Improving Locality and Parallelism in Nested Loops, 1992. ,
High Performance Compilers for Parallel Computing, 1996. ,
Constraint-Based Array Dependence Analysis, 1995. ,
Automated empirical optimizations of software and the atlas project, Parallel Computing, 2000. ,
Automating non-unimodular loop transformations for massive parallelism, Parallel Computing, vol.20, issue.5, pp.711-728, 1994. ,
A comparison of empirical and model-driven optimization, ACM Symp. on Programming Language Design and Implementation (PLDI'03), 2003. ,