Other applications S1: S2: Value range analysis: Might intersect GCD test: Might intersect S1: S2: S1: S2: Value range analysis: Do not intersect S1, S2: S1: S2: GCD test: Do not intersect S1, p.2 ,
Compiler and runtime support for efficient software transactional memory, Proceedings of the 2006 ACM SIGPLAN conference on Programming language design and implementation, PLDI '06, pp.26-37, 2006. ,
Automatic translation of FORTRAN programs to vector form, ACM Transactions on Programming Languages and Systems, vol.9, issue.4, pp.491-542, 1987. ,
DOI : 10.1145/29873.29875
Petabricks: a language and compiler for algorithmic choice, PLDI '09, pp.38-49, 2009. ,
A framework for reducing the cost of instrumented code, Proceedings of the ACM SIGPLAN 2001 conference on Programming language design and implementation, PLDI '01, pp.168-179, 2001. ,
SPEComp: A New Benchmark Suite for Measuring Parallel Computer Performance, WOMPAT '01, pp.1-10, 2001. ,
DOI : 10.1007/3-540-44587-0_1
Analyzing parallel programs with pin, Computer, vol.43, issue.3, pp.34-41, 2010. ,
Dynamo: a transparent dynamic optimization system, Proceedings of the ACM SIGPLAN 2000 conference on Programming language design and implementation, PLDI '00, pp.1-12, 2000. ,
Loop Transformations for Restructuring Compilers: The Foundations, 1993. ,
Generating loops for scanning polyhedra, 2002. ,
Putting Polyhedral Loop Transformations to Work, LCPC'16 Intl. Workshop on Languages and Compilers for Parallel Computers, pp.209-225, 2003. ,
DOI : 10.1007/978-3-540-24644-2_14
URL : https://hal.archives-ouvertes.fr/inria-00071681
Code generation in the polyhedral model is easier than you think, Proceedings. 13th International Conference on Parallel Architecture and Compilation Techniques, 2004. PACT 2004., pp.7-16, 2004. ,
DOI : 10.1109/PACT.2004.1342537
URL : https://hal.archives-ouvertes.fr/hal-00017260
An updated set of basic linear algebra subprograms (BLAS), ACM Transactions on Mathematical Software, vol.28, issue.2, pp.135-151, 2001. ,
DOI : 10.1145/567806.567807
Polaris: The next generation in parallelizing compilers, Workshop on Languages and Compilers for Parallel Computing, pp.10-11 ,
Automatic Transformations for Communication-Minimized Parallelization and Locality Optimization in the Polyhedral Model, Proceedings of the Joint European Conferences on Theory and Practice of Software 17th international conference on Compiler construction, CC'08/ETAPS'08, pp.132-146, 2008. ,
DOI : 10.1007/978-3-540-78791-4_9
A practical automatic polyhedral parallelizer and locality optimizer, Proceedings of the 2008 ACM SIGPLAN conference on Programming language design and implementation, PLDI, 2008. ,
Effective automatic parallelization and locality optimization using the polyhedral model, p.3325799, 2008. ,
A new approach to debugging optimized code, ACM SIGPLAN Conf. on Programming Language Design and Implementation, PLDI, 1992. ,
An infrastructure for adaptive dynamic optimization, International Symposium on Code Generation and Optimization, 2003. CGO 2003., 2003. ,
DOI : 10.1109/CGO.2003.1191551
Softspec: Software-based speculative parallelism, ACM Workshop on Feedback-Directed and Dynamic Optimization, 2000. ,
Extending LLVM: Adding instructions, intrinsics, types, etc ,
An API for Runtime Code Patching, International Journal of High Performance Computing Applications, vol.14, issue.4, 2000. ,
DOI : 10.1177/109434200001400404
Software transactional memory, Communications of the ACM, vol.51, issue.11, pp.46-58, 2008. ,
DOI : 10.1145/1400214.1400228
Method-specific dynamic compilation using logistic regression, ACM SIGPLAN Notices, vol.41, issue.10, pp.229-240, 2006. ,
DOI : 10.1145/1167515.1167492
Rodinia: A benchmark suite for heterogeneous computing, 2009 IEEE International Symposium on Workload Characterization (IISWC), pp.44-54, 2009. ,
DOI : 10.1109/IISWC.2009.5306797
An efficient algorithm for the run-time parallelization of DOACROSS loops, Proceedings of Supercomputing '94, pp.518-527, 1994. ,
DOI : 10.1109/SUPERC.1994.344315
Adaptive Multi-versioning for OpenMP Parallelization via Machine Learning, 2009 15th International Conference on Parallel and Distributed Systems, 2009. ,
DOI : 10.1109/ICPADS.2009.77
Dynamic hot data stream prefetching for generalpurpose programs, ACM SIGPLAN Conf. on Programming Language Design and Implementation, PLDI, 2002. ,
Toward efficient and robust software speculative parallelization on multiprocessors, Proceedings of the ninth ACM SIG- PLAN symposium on Principles and practice of parallel programming, PPoPP '03, pp.13-24, 2003. ,
Counting solutions to linear and nonlinear constraints through ehrhart polynomials: applications to analyze and transform scientific programs, Proceedings of the 10th international conference on Supercomputing, ICS '96, pp.278-285, 1996. ,
URL : https://hal.archives-ouvertes.fr/hal-01100306
A Polyhedral Approach to Ease the Composition of Program Transformations, Euro-Par'04, no. 3149 in LNCS, pp.292-303, 2004. ,
DOI : 10.1007/978-3-540-27866-5_38
URL : https://hal.archives-ouvertes.fr/hal-01257301
Efficiently computing static single assignment form and the control dependence graph, ACM Transactions on Programming Languages and Systems, vol.13, issue.4, pp.451-490, 1991. ,
DOI : 10.1145/115372.115320
The R-LRPD test: speculative parallelization of partially parallel loops, Proceedings 16th International Parallel and Distributed Processing Symposium, p.318, 2002. ,
DOI : 10.1109/IPDPS.2002.1015493
Scheduling and Automatic Parallelization, 2000. ,
DOI : 10.1007/978-1-4612-1362-8
URL : https://hal.archives-ouvertes.fr/hal-00856645
PIPS ? An Interprocedural, Extensible, Source-to-Source Compiler Infrastructure for Code Transformations and Instrumentations, Tutorial at PPoPP Tutorial at CGO, 2010. ,
Qingyu Zhao, and Tin- Fook Ngai. A cost-driven compilation framework for speculative parallelization of sequential programs, Proceedings of the ACM SIGPLAN 2004 conference on Programming language design and implementation, PLDI '04, pp.71-81, 2004. ,
Blended analysis for performance understanding of framework-based applications, Proceedings of the 2007 international symposium on Software testing and analysis, ISSTA '07, 2007. ,
DOI : 10.1145/1273463.1273480
Vulcan binary transformation in a distributed environment, 2001. ,
Parametric integer programming, RAIRO - Operations Research, vol.22, issue.3, pp.243-268, 1988. ,
DOI : 10.1051/ro/1988220302431
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.30.9957
Dataflow analysis of array and scalar references, International Journal of Parallel Programming, vol.24, issue.4, pp.23-53, 1991. ,
DOI : 10.1007/BF01407931
Array expansion, In In ACM Int. Conf. on Supercomputing, pp.429-441, 1988. ,
DOI : 10.1145/2591635.2667159
URL : https://hal.archives-ouvertes.fr/hal-01099746
Some efficient solutions to the affine scheduling problem. I. One-dimensional time, International Journal of Parallel Programming, vol.40, issue.6, pp.313-348, 1992. ,
DOI : 10.1007/BF01407835
Some efficient solutions to the affine scheduling problem. Part II. Multidimensional time, International Journal of Parallel Programming, vol.2, issue.4, 1992. ,
DOI : 10.1007/BF01379404
Automatic parallelization in the polytope model Université de Versailles St-Quentin en Yvelines, 45, avenue des États- Unis, F-78035 Versailles Cedex, Laboratoire PRiSM, pp.79-103, 1996. ,
A case for automatic run-time code optimization, 1999. ,
Spicec: scalable parallelism via implicit copying and explicit commit, Proceedings of the 16th ACM symposium on Principles and practice of parallel programming, pp.69-80, 2011. ,
A Practical Method for Quickly Evaluating Program Optimizations, Proceedings of the International Conference on High Performance Embedded Architectures and Compilers, pp.29-46, 2005. ,
DOI : 10.1007/11587514_4
URL : https://hal.archives-ouvertes.fr/inria-00001054
Practical Run-time Adaptation with Procedure Cloning to Enable Continuous Collective Compilation, GCC Developers' Summit, 2007. ,
URL : https://hal.archives-ouvertes.fr/hal-01257279
Collective optimization, ACM Transactions on Architecture and Code Optimization, vol.7, issue.4, 2010. ,
DOI : 10.1145/1880043.1880047
URL : https://hal.archives-ouvertes.fr/inria-00445326
Reducing overheads for acquiring dynamic memory traces, Workload Characterization Symposium, 2005. ,
Armin Grösslinger, and Louis-Noël Pouchet. Polly -polyhedral optimization in llvm, First International Workshop on Polyhedral Compilation Techniques (IM- PACT'11), 2011. ,
Low-overhead memory leak detection using adaptive statistical profiling, 11th Int. Conf. on Architectural support for programming languages and operating systems, ASPLOS-XI, 2004. ,
Machine virtuelle d'analyse et d'optimisation dynamique de programmes, 2010. ,
Bursty tracing: A framework for lowoverhead temporal profiling, 4th ACM Workshop on FeedbackDirected and Dynamic Optimization FDDO4, 2001. ,
Transforming GCC into a research-friendly environment: plugins for optimization tuning and reordering, function cloning and program instrumentation, 2nd Int. Workshop on GCC Research Opportunities (GROW'10), 2010. ,
URL : https://hal.archives-ouvertes.fr/inria-00451106
FULLDOC: A Full Reporting Debugger for Optimized Code, 7th International Symposium on Static Analysis, SAS '00, 2000. ,
DOI : 10.1007/978-3-540-45099-3_13
VMAD: A virtual machine for advanced dynamic analysis of programs, (IEEE ISPASS) IEEE INTERNATIONAL SYMPOSIUM ON PERFORMANCE ANALYSIS OF SYSTEMS AND SOFTWARE, pp.125-126, 2011. ,
DOI : 10.1109/ISPASS.2011.5762725
URL : https://hal.archives-ouvertes.fr/inria-00544501
Speculative thread decomposition through empirical optimization, Proceedings of the 12th ACM SIGPLAN symposium on Principles and practice of parallel programming , PPoPP '07, pp.205-214, 2007. ,
DOI : 10.1145/1229428.1229474
Prediction and trace compression of data access addresses through nested loop recognition, Proceedings of the sixth annual IEEE/ACM international symposium on Code generation and optimization , CGO '08, pp.94-103, 2008. ,
DOI : 10.1145/1356058.1356071
URL : https://hal.archives-ouvertes.fr/inria-00504597
Improving performance of optimized kernels through fast instantiations of templates, Concurrency and Computation: Practice and Experience, vol.35, issue.5, pp.59-70, 2009. ,
DOI : 10.1002/cpe.1333
Automatic speculative DOALL for clusters, Proceedings of the Tenth International Symposium on Code Generation and Optimization, CHO '12, pp.94-103, 2012. ,
DOI : 10.1145/2259016.2259029
Program phase and runtime distribution-aware online dvfs for combined vdd/vbb scaling, Proceedings of the Conference on Design, Automation and Test in Europe, DATE '09 European Design and Automation Association, pp.417-422, 2009. ,
SD3: A Scalable Approach to Dynamic Data-Dependence Profiling, 2010 43rd Annual IEEE/ACM International Symposium on Microarchitecture, pp.535-546, 2010. ,
DOI : 10.1109/MICRO.2010.49
Design of the java hotspot client compiler for java 6, ACM Trans. Archit. Code Optim, vol.5, 2008. ,
Optimistic parallelism requires abstractions, Communications of the ACM, vol.52, issue.9, pp.89-97, 2009. ,
DOI : 10.1145/1562164.1562188
Transparent Debugging of Dynamically Optimized Code, 2009 International Symposium on Code Generation and Optimization, 2009. ,
DOI : 10.1109/CGO.2009.28
Loop-level parallelism in numeric and symbolic programs, IEEE Transactions on Parallel and Distributed Systems, vol.4, issue.7, pp.812-826, 1993. ,
DOI : 10.1109/71.238302
LLVM language reference manual ,
PEBIL: Efficient static binary instrumentation for Linux, 2010 IEEE International Symposium on Performance Analysis of Systems & Software (ISPASS), 2010. ,
DOI : 10.1109/ISPASS.2010.5452024
Exploiting threadlevel speculative parallelism with software value prediction, Proceedings of the 10th Asia-Pacific conference on Advances in Computer Systems Architecture, ACSAC'05, pp.367-388, 2005. ,
POSH, Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming , PPoPP '06, pp.158-167, 2006. ,
DOI : 10.1145/1122971.1122997
Pin: building customized program analysis tools with dynamic instrumentation, Proceedings of the 2005 ACM SIGPLAN conference on Programming language design and implementation, PLDI '05, pp.190-200, 2005. ,
Finding representative sets of optimizations for adaptive multiversioning applications, International Workshop on Statistical and Machine learning approaches to ARchitectures and compilaTion, 2009. ,
URL : https://hal.archives-ouvertes.fr/inria-00436034
Dynamic performance tuning for speculative threads, ACM SIGARCH Computer Architecture News, vol.37, issue.3, pp.462-473, 2009. ,
DOI : 10.1145/1555815.1555812
Limits of parallelism using dynamic dependency graphs, Proceedings of the Seventh International Workshop on Dynamic Analysis, WODA '09, pp.42-48, 2009. ,
DOI : 10.1145/2134243.2134253
Literace: effective sampling for lightweight data-race detection, ACM SIGPLAN Conf. on Programming Language Design and Implementation, PLDI, 2009. ,
Scenario Based Optimization: A Framework for Statically Enabling Online Optimizations, 2009 International Symposium on Code Generation and Optimization, pp.169-179, 2009. ,
DOI : 10.1109/CGO.2009.24
Parallelizing sequential applications on commodity hardware using a low-cost software transactional memory, ACM SIGPLAN Notices, vol.44, issue.6, pp.166-176, 2009. ,
DOI : 10.1145/1543135.1542495
Double Inspection for Run-Time Loop Parallelization, Proceedings of the 24th International Workshop on Languages and Compilers for Parallel Computing, 2011. ,
DOI : 10.1007/978-3-642-36036-7_4
Valgrind: A framework for heavyweight dynamic binary instrumentation, PLDI '07: Proc. of ACM SIGPLAN Conf. on Programming Language Design and Implementation, 2007. ,
Automatic, template-based run-time specialization: implementation and experimental study, Proceedings of the 1998 International Conference on Computer Languages (Cat. No.98CB36225), pp.132-142, 1998. ,
DOI : 10.1109/ICCL.1998.674164
Languages and compilers for parallel computing, LCPC, chapter Set-Congruence Dynamic Analysis for Thread-Level Speculation (TLS), pp.156-171, 2008. ,
A lightweight in-place implementation for software thread-level speculation, Proceedings of the twentyfirst annual symposium on Parallelism in algorithms and architectures, SPAA '09, pp.223-232, 2009. ,
OpenMP source code repository ,
An evaluation of different modeling techniques for iterative compilation, Proceedings of the 14th international conference on Compilers, architectures and synthesis for embedded systems, CASES '11, pp.65-74, 2011. ,
DOI : 10.1145/2038698.2038711
Implementation Issues of Loop-Level Speculative Run-Time Parallelization, Compiler Construction, pp.1-99, 1999. ,
DOI : 10.1007/978-3-540-49051-7_13
u-strasbg.fr/PolyLib. [97] Louis-Noël Pouchet. FM: the Fourier-Motzkin library ,
Loop transformations: convexity , pruning and optimization, Proceedings of the 38th annual ACM SIGPLAN- SIGACT symposium on Principles of programming languages, pp.549-562, 2011. ,
URL : https://hal.archives-ouvertes.fr/inria-00551077
Using thread-level speculation to simplify manual parallelization, Proceedings of the ninth ACM SIGPLAN symposium on Principles and practice of parallel programming, PPoPP '03, pp.1-12, 2003. ,
Static and dynamic methods of polyhedral compilation for an efficient execution in multicore environments, 2011. ,
URL : https://hal.archives-ouvertes.fr/tel-00733856
Adaptive Runtime Selection of Parallel Schedules in the Polytope Model, High Performance Computing Symposium, 2011. ,
Mitosis compiler: an infrastructure for speculative threading based on pre-computation slices, Proceedings of the 2005 ACM SIGPLAN conference on Programming language design and implementation , PLDI '05, pp.269-279, 2005. ,
Speculative parallelization using software multi-threaded transactions, Proceedings of the fifteenth edition of ASPLOS on Architectural support for programming languages and operating systems, ASPLOS '10, pp.65-76, 2010. ,
Spice, Proceedings of the sixth annual IEEE/ACM international symposium on Code generation and optimization , CGO '08, pp.175-184, 2008. ,
DOI : 10.1145/1356058.1356082
The LRPD test: speculative runtime parallelization of loops with privatization and reduction parallelization, Proceedings of the ACM SIGPLAN 1995 conference on Programming language design and implementation, PLDI '95, pp.218-232, 1995. ,
A Programming Language Interface to Describe Transformations and Code Generation, Proceedings of the 23rd international conference on Languages and compilers for parallel computing, pp.136-150, 2011. ,
DOI : 10.1145/1809028.1806606
Theory of linear and integer programming, 1986. ,
Discovering and exploiting program phases, IEEE Micro, vol.23, issue.6, pp.84-93, 2003. ,
DOI : 10.1109/MM.2003.1261391
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.58.4012
Compiling for template-based run-time code generation, Journal of Functional Programming, vol.13, issue.3, pp.677-708, 2003. ,
DOI : 10.1017/S095679680200463X
Green governors: A framework for Continuously Adaptive DVFS, 2011 International Green Computing Conference and Workshops, pp.1-8, 2011. ,
DOI : 10.1109/IGCC.2011.6008552
A scalable approach to thread-level speculation, Proceedings of the 27th annual international symposium on Computer architecture, ISCA '00, pp.1-12, 2000. ,
Speculation for parallelizing runtime checks, Proceedings of the 11th International Symposium on Stabilization, Safety, and Security of Distributed Systems, SSS '09, pp.698-710, 2009. ,
A framework for adaptive algorithm selection in STAPL, Proceedings of the tenth ACM SIGPLAN symposium on Principles and practice of parallel programming , PPoPP '05, pp.277-288, 2005. ,
DOI : 10.1145/1065944.1065981
Speculative parallelization using state separation and multiple value prediction, Proceedings of the 2010 international symposium on Memory management, ISMM '10, pp.63-72, 2010. ,
DOI : 10.1145/1806651.1806663
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.193.3128
Supporting speculative parallelization in the presence of dynamic data structures, Proceedings of the 2010 ACM SIG- PLAN conference on Programming language design and implementation, PLDI '10, pp.62-73, 2010. ,
Copy or discard execution model for speculative parallelization on multicores, Proceedings of the 41st annual IEEE/ACM International Symposium on Microarchitecture, MICRO 41, pp.330-341, 2008. ,
Speculative Parallelization of Sequential Loops on Multicores, International Journal of Parallel Programming, vol.40, issue.5, pp.508-535, 2009. ,
DOI : 10.1007/s10766-009-0111-z
Enhanced speculative parallelization via incremental recovery, Proceedings of the 16th ACM symposium on Principles and practice of parallel programming, pp.189-200, 2011. ,
DOI : 10.1145/2038037.1941580
URL : http://alumni.cs.ucr.edu/~tianc/publications/ppopp11.pdf
Towards a holistic approach to auto-parallelization: integrating profile-driven parallelism detection and machine-learning based mapping, Proceedings of the 2009 ACM SIGPLAN conference on Programming language design and implementation , PLDI '09, pp.177-187, 2009. ,
Alter: exploiting breakable dependences for parallelization, Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation, PLDI '11, pp.480-491, 2011. ,
Origin tracking, Journal of Symbolic Computation, vol.15, issue.5-6, 1993. ,
DOI : 10.1016/S0747-7171(06)80004-0
The Paralax infrastructure, Proceedings of the 19th international conference on Parallel architectures and compilation techniques, PACT '10, pp.389-400, 2010. ,
DOI : 10.1145/1854273.1854322
Scalable Program Optimization Techniques in the Polyhedral Model, 2007. ,
isl: An Integer Set Library for the Polyhedral Model, In Lecture Notes in Computer Science, 2010. ,
DOI : 10.1007/978-3-642-15582-6_49
Counting Integer Points in Parametric Polytopes Using Barvinok's Rational Functions, Algorithmica, vol.48, issue.1, pp.37-66, 2007. ,
DOI : 10.1007/s00453-006-1231-0
Modeling optimistic concurrency using quantitative dependence analysis, Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of parallel programming , PPoPP '08, pp.185-196, 2008. ,
DOI : 10.1145/1345206.1345234
High-level adaptive program optimization with ADAPT, PPoPP '01, pp.93-102, 2001. ,
Subroutine profiling results for the CPU2006 benchmarks, ACM SIGARCH Computer Architecture News, vol.35, issue.1, 2007. ,
DOI : 10.1145/1241601.1241621
The suif compiler system: a parallelizing and optimizing research compiler, 1994. ,
A loop transformation theory and an algorithm to maximize parallelism, IEEE Transactions on Parallel and Distributed Systems, vol.2, issue.4, pp.452-471, 1991. ,
DOI : 10.1109/71.97902
Languages and compilers for parallel computing, LCPC, chapter Compiler- Driven Dependence Profiling to Guide Program Parallelization, pp.232-248 ,
Program Phase Detection Based Dynamic Control Mechanisms for Pipeline Stage Unification Adoption, Proceedings of the 6th international symposium on high-performance computing and 1st international conference on Advanced low power systems, ISHPC'05/ALPS'06, pp.494-507, 2008. ,
DOI : 10.1007/978-3-540-77704-5_46
Alchemist: A Transparent Dependence Distance Profiling Infrastructure, 2009 International Symposium on Code Generation and Optimization ,
DOI : 10.1109/CGO.2009.15
Uncovering hidden loop level parallelism in sequential applications, HPCA, pp.290-301, 2008. ,
A scheme to enforce data dependence on large multiprocessor systems, IEEE Trans. Softw. Eng, vol.13, pp.726-739, 1987. ,
Translation and Run-Time Validation of Loop Transformations, Formal Methods in System Design, vol.9, issue.3, pp.335-360, 2005. ,
DOI : 10.1007/s10703-005-3402-z
A AACADAAAAIAQ)-EPPPPPP88 FGHACPFDAAFGHACPP% ,