A. Aggarwal, S. Vitter, and J. , The input/output complexity of sorting and related problems, Communications of the ACM, vol.31, issue.9, pp.1116-1127, 1988.
DOI : 10.1145/48529.48535

URL : https://hal.archives-ouvertes.fr/inria-00075827

M. Agrawal, N. Kayal, and N. Saxena, PRIMES is in P, Annals of Mathematics, vol.160, issue.2, pp.781-793, 2004.
DOI : 10.4007/annals.2004.160.781

L. Arge, The Buffer Tree: A Technique for Designing Batched External Data Structures, Algorithmica, vol.37, issue.1, pp.1-24, 2003.
DOI : 10.1007/s00453-003-1021-x

L. Arge, M. A. Bender, E. D. Demaine, B. Holland-minkley, and J. I. Munro, Cacheoblivious priority queue and graph algorithm applications, Proc./ of the 34th Annual Symposium on Theory of Computing, pp.268-276, 2002.

L. Arge and M. Thorup, Ram-efficient external memory sorting, Algorithms and Computation, pp.491-501

A. Atkin and D. Bernstein, Prime sieves using binary quadratic forms, Mathematics of Computation, vol.73, issue.246, pp.1023-1030, 2004.
DOI : 10.1090/S0025-5718-03-01501-1

J. Barkley-rosser and L. Schoenfeld, Approximate formulas for some functions of prime numbers, Illinois J. Math, vol.6, pp.64-94, 1962.

C. Bays and R. H. Hudson, The segmented sieve of eratosthenes and primes in arithmetic progressions to 1012, BIT, vol.17, issue.2, pp.121-127, 1977.
DOI : 10.1007/BF01932283

S. Bengelloun, An incremental primal sieve, Acta Informatica, vol.24, issue.2, pp.119-125, 1986.
DOI : 10.1145/359657.359660

R. P. Brent, The first occurrence of large gaps between successive primes, Mathematics of Computation, vol.27, issue.124, pp.959-963, 1973.
DOI : 10.1090/S0025-5718-1973-0330021-0

D. A. Cox, Primes of the form x 2 + ny 2 : Fermat, Class Field Theory, and Complex Multiplication, 1989.

B. Dunten, J. Jones, and J. Sorenson, A space-efficient fast prime number sieve, Information Processing Letters, vol.59, issue.2, pp.79-84, 1996.
DOI : 10.1016/0020-0190(96)00099-3

M. Farach-colton and M. Tsai, On the Complexity of Computing Prime Tables, Algorithms and Computation -26th International Symposium, ISAAC'15, 2015.
DOI : 10.1007/s11227-006-7285-5

R. Gale and V. Pratt, CGOL?an algebraic notation for MACLISP users, 1977.

D. Gries and J. Misra, A linear sieve algorithm for finding prime numbers, Communications of the ACM, vol.21, issue.12, pp.999-1003, 1978.
DOI : 10.1145/359657.359660

G. H. Hardy and E. M. Wright, An introduction to the theory of numbers, Bulletin of the American Mathematical Society, vol.35, issue.6, 1979.
DOI : 10.1090/S0002-9904-1929-04793-1

S. Horsley, . Ko?kinon-epato??enoy?, and . Or, The Sieve of Eratosthenes Being an Account of His Method of Finding All the Prime Numbers, by the Rev, FRS. Philosophical Transactions, pp.327-347

H. W. Lenstra-jr and C. Pomerance, Primality Testing with Gaussian Periods, Lecture Notes in Computer Science, pp.1-1, 2002.
DOI : 10.1007/3-540-36206-1_1

H. G. Mairson, Some new upper bounds on the generation of prime numbers, Communications of the ACM, vol.20, issue.9, pp.664-669, 1977.
DOI : 10.1145/359810.359838

F. News, World's largest prime number discovered ? all 17 million digits. https: //web.archive.org/web, 2013.

M. Patrascu and M. Thorup, Dynamic Integer Sets with Optimal Rank, Select, and Predecessor Search, 2014 IEEE 55th Annual Symposium on Foundations of Computer Science, pp.166-175, 2014.
DOI : 10.1109/FOCS.2014.26

C. Pomerance, J. L. Selfridge, and S. S. Wagstaff, The pseudoprimes to 25·10 9, Mathematics of Computation, vol.35, issue.151, pp.1003-1026, 1980.

P. Pritchard, A sublinear additive sieve for finding prime number, Communications of the ACM, vol.24, issue.1, pp.18-23, 1981.
DOI : 10.1145/358527.358540

P. Pritchard, Linear prime-number sieves: A family tree, Science of Computer Programming, vol.9, issue.1, pp.17-35, 1987.
DOI : 10.1016/0167-6423(87)90024-4

P. Pritchard, Linear prime-number sieves: A family tree, Science of Computer Programming, vol.9, issue.1, pp.17-35, 1987.
DOI : 10.1016/0167-6423(87)90024-4

R. C. Singleton, Algorithm 357: an efficient prime number generator [A1], Communications of the ACM, pp.563-564, 1969.
DOI : 10.1145/363235.363247

J. Sorenson, An introduction to prime number sieves, 1990.

J. P. Sorenson, The Pseudosquares Prime Sieve, Algorithmic number theory, pp.193-207, 2006.
DOI : 10.1007/11792086_15

J. S. Vitter, External memory algorithms and data structures: dealing with massive data, ACM Computing Surveys, vol.33, issue.2, pp.209-271, 2001.
DOI : 10.1145/384192.384193

H. C. Williams, Edouard lucas and primality testing, Canadian Mathematics Society Series of Monographs and Advanced Texts, issue.22, 1998.

, N ) space, but it can be segmented among level curves as in Appendix B.1 to use O N 1/2+o(N ) Note that in what follows objects are passed to functions as references. The code for some functions has been omitted

W. Computewheelmodulus,

U. Computeunitsmod,

, create three empty lists of pairs L1, L2 and L3

L. Constructprincipaldomain,

L. Constructprincipaldomain,

L. Constructprincipaldomain,

, create an empty min priority queue Q that only stores values

W. Insertvaluesfromdomain-(-q,

W. Insertvaluesfromdomain-(-q,

, Q.Insert(?

, create an empty queue S

S. Eliminateeven,

, Print all the primes dividing W ; EliminateSquaresAndPrint(S)

, Stony Brook, NY 11794-2424 USA. Email: {bender, rob, smccauley,shiksingh}@cs.stonybrook.edu. ? MS 1326

?. Po and . Box, Email: tmkroeg@sandia.gov. § LIP, ENS de Lyon, 46 allee d'Italie

, Intel Corporation, 2200 Mission College Blvd

, As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only. Permission to make digital or hard copies for personal or classroom use is granted. Copies must bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. To copy otherwise, distribute, republish, or post, requires prior specific permission and/or a fee, ACM

, PODS'16, p.0, 2016.

. Doi, 2902276 HI cache-oblivious B-trees. Searches take O(log B N ) I/Os; inserts and deletes take O( log 2 N B + log B N ) amortized I/Os with high probability; and range queries returning k elements take O(log B N + k/B) I/Os. Our HI external-memory skip list achieves optimal bounds with high probability, analogous to in-memory skip lists: O(log B N )

I. Os and . Point, log B N ) I/Os for in- serts/deletes Range queries returning k elements run in O(log B N + k/B) I/Os In contrast, the best possible highprobability bounds for inserting into the folklore B-skip list, which promotes elements with probability 1/B, is just ?(log N ) I/Os. This is no better than the bounds one gets from running an inmemory skip list in external memory

8. Abraham, J. Aspnes, and J. Yuan, Skip B-Trees, Proc. of the 9th Annual International Conference on Principles of Distributed Systems (OPODIS), p.366, 2006.
DOI : 10.1109/TNET.2002.808407

U. A. Acar, G. E. Blelloch, R. Harper, J. L. Vittes, and S. L. Woo, Dynamizing static algorithms, with applications to dynamic trees and history independence, Proc. of the 15th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp.531-540, 2004.

A. Aggarwal and J. S. Vitter, The input/output complexity of sorting and related problems, Communications of the ACM, vol.31, issue.9, pp.1116-1127, 1988.
DOI : 10.1145/48529.48535

URL : https://hal.archives-ouvertes.fr/inria-00075827

O. Amble and D. E. Knuth, Ordered hash tables, The Computer Journal, vol.17, issue.2, pp.135-142, 1974.
DOI : 10.1093/comjnl/17.2.135

A. Anagnostopoulos, M. Goodrich, and R. Tamassia, Persistent authenticated dictionaries and their applications. Information Security, pp.379-393, 2001.

A. Andersson and T. Ottmann, Faster uniquely represented dictionaries, [1991] Proceedings 32nd Annual Symposium of Foundations of Computer Science, pp.642-649, 1991.
DOI : 10.1109/SFCS.1991.185430

C. R. Aragon and R. G. Seidel, Randomized search trees, Proc. of the 30th Annual IEEE Symposium on Foundations of Computer Science (FOCS), pp.540-545, 1989.

L. Arge, D. Eppstein, and M. T. Goodrich, Skip-webs, Proceedings of the twenty-fourth annual ACM SIGACT-SIGOPS symposium on Principles of distributed computing , PODC '05, pp.69-76, 2005.
DOI : 10.1145/1073814.1073827

J. Aspnes and G. Shah, Skip graphs, ACM Transactions on Algorithms, vol.3, issue.4, p.37, 2007.
DOI : 10.1145/1290672.1290674

S. Bajaj, A. Chakraborti, and R. Sion, The foundations of history independence. arXiv preprint, 2015.
DOI : 10.1109/tifs.2015.2491309

URL : http://arxiv.org/pdf/1501.06508

S. Bajaj and R. Sion, Ficklebase: Looking into the future to erase the past, 2013 IEEE 29th International Conference on Data Engineering (ICDE), pp.86-97, 2013.
DOI : 10.1109/ICDE.2013.6544816

S. Bajaj and R. Sion, HIFS, Proceedings of the 2013 ACM SIGSAC conference on Computer & communications security, CCS '13, pp.1285-1296, 2013.
DOI : 10.1145/2508859.2516724

M. A. Bender, R. Cole, and R. Raman, Exponential Structures for Efficient Cache-Oblivious Algorithms, Proc. of the 29th Annual International Colloquium on Automata, Languages, and Programming (ICALP), pp.195-207, 2002.
DOI : 10.1007/3-540-45465-9_18

URL : http://www.daimi.au.dk/~large/ioS05/BCR.pdf

M. A. Bender, E. D. Demaine, and M. Farach-colton, Cache-Oblivious B-Trees, SIAM Journal on Computing, vol.35, issue.2, pp.341-358, 2005.
DOI : 10.1137/S0097539701389956

M. A. Bender, Z. Duan, J. Iacono, and J. Wu, A locality-preserving cache-oblivious dynamic dictionary, Journal of Algorithms, vol.53, issue.2, pp.115-136, 2004.
DOI : 10.1016/j.jalgor.2004.04.014

URL : http://www.cs.technion.ac.il/~itai/Courses/Cache/locality-full.ps

M. A. Bender, M. Farach-colton, and B. C. , Cache-oblivious string B-trees, Proceedings of the twenty-fifth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems , PODS '06, pp.233-242, 2006.
DOI : 10.1145/1142351.1142385

URL : http://www.cs.sunysb.edu/~bender/pub/PODS06-BFK.pdf

M. A. Bender, J. T. Fineman, S. Gilbert, and B. C. , Concurrent cache-oblivious b-trees, Proceedings of the 17th annual ACM symposium on Parallelism in algorithms and architectures , SPAA'05, pp.228-237, 2005.
DOI : 10.1145/1073970.1074009

URL : http://erikdemaine.org/papers/CacheObliviousBTrees_SICOMP/paper.pdf

M. A. Bender and H. Hu, An adaptive packed-memory array, ACM Transactions on Database Systems, vol.32, issue.4, p.26, 2007.
DOI : 10.1145/1292609.1292616

URL : http://www.cs.sunysb.edu/~bender/pub/BenderHu-TODS07.pdf

J. Bethencourt, D. Boneh, and B. Waters, Cryptographic methods for storing ballots on a voting machine, Proc. of the 14th Network and Distributed System Security Symposium (NDSS), 2007.

G. E. Blelloch and D. Golovin, Strongly History-Independent Hashing with Applications, 48th Annual IEEE Symposium on Foundations of Computer Science (FOCS'07), pp.272-282, 2007.
DOI : 10.1109/FOCS.2007.36

URL : http://www.cs.cmu.edu/~dgolovin/papers/focs07.pdf

G. E. Blelloch, D. Golovin, and V. Vassilevska, Uniquely Represented Data Structures for Computational Geometry, Proc. of the 11th Scandinavian Workshop on Algorithm Theory (SWAT), pp.17-28, 2008.
DOI : 10.1007/978-3-540-69903-3_4

URL : http://www-2.cs.cmu.edu/~virgi/urtechreport.pdf

G. S. Brodal, R. Fagerberg, and R. Jacob, Cache Oblivious Search Trees via Binary Trees of Small Height, BRICS Report Series, vol.8, issue.36
DOI : 10.7146/brics.v8i36.21696

URL : http://www14.in.tum.de/personen/jacob/Publications/soda02.pdf

, Proc. of the 13th Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pp.39-48, 2002.

N. Buchbinder and E. Petrank, Lower and upper bounds on obtaining history independence, Advances in Cryptology, pp.445-462, 2003.

J. Bulánek, M. Kouck-`-kouck-`-y, and M. Saks, Tight lower bounds for the online labeling problem, Proc. of the 44th Annual ACM Symposium on Theory of Computing (STOC), pp.1185-1198, 2012.

P. Callahan, M. T. Goodrich, and K. Ramaiyer, Topology B-trees and their applications, Proc. of the 4th International Workshop on Algorithms and Data Structures (WADS), pp.381-392, 1995.
DOI : 10.1007/3-540-60220-8_78

V. Ciriani, P. Ferragina, F. Luccio, and S. Muthukrishnan, Static optimality theorem for external memory string access, The 43rd Annual IEEE Symposium on Foundations of Computer Science, 2002. Proceedings., pp.219-227, 2002.
DOI : 10.1109/SFCS.2002.1181945

L. Devroye, A limit theory for random skip lists. The Annals of Applied Probability, pp.597-609, 1992.

M. Fomitchev and E. Ruppert, Lock-free linked lists and skip lists, Proceedings of the twenty-third annual ACM symposium on Principles of distributed computing , PODC '04, pp.50-59, 2004.
DOI : 10.1145/1011767.1011776

M. Frigo, C. E. Leiserson, H. Prokop, and S. Ramachandran, Cache-oblivious algorithms, Proc. of the 40th Annual IEEE Symposium on the Foundations of Computer Science (FOCS), pp.285-298, 1999.

M. Frigo, C. E. Leiserson, H. Prokop, and S. Ramachandran, Cache-oblivious algorithms, ACM Transactions on Algorithms, vol.8, issue.1, p.4, 2012.

D. Golovin, Uniquely Represented Data Structures with Applications to Privacy, 2008.

D. Golovin, B-Treaps: A Uniquely Represented Alternative to B-Trees, Proc. of the 36th Annual International Colloquium on Automata, Languages, and Programming (ICALP), pp.487-499, 2009.
DOI : 10.1561/0400000014

URL : http://www.cs.cmu.edu/~dgolovin/papers/btreap.pdf

D. Golovin, The B-skip-list: A simpler uniquely represented alternative to B-trees. arXiv preprint, 2010.
DOI : 10.1007/978-3-642-02927-1_41

URL : http://www.cs.cmu.edu/~dgolovin/papers/btreap.pdf

M. T. Goodrich and R. Tamassia, Efficient authenticated dictionaries with skip lists and commutative hashing, p.15, 2000.
DOI : 10.1109/discex.2001.932160

URL : http://www.cs.brown.edu/cgc/stms/papers/discex2001.pdf

J. A. Halderman, S. D. Schoen, N. Heninger, W. Clarkson, W. Paul et al., Lest we remember, Communications of the ACM, vol.52, issue.5, pp.91-98, 2009.
DOI : 10.1145/1506409.1506429

J. D. Hartline, E. S. Hong, A. E. Mohr, W. R. Pentney, and E. C. Rocke, Characterizing History Independent Data Structures, Algorithmica, vol.42, issue.1, pp.57-74, 2005.
DOI : 10.1007/s00453-004-1140-z

M. Herlihy, Y. Lev, V. Luchangco, and N. Shavit, A Simple Optimistic Skiplist Algorithm, Proc. of the 14th Annual Colloquium on Structural Information and Communication Complexity (SIROCCO), p.124, 2007.
DOI : 10.1007/978-3-540-72951-8_11

A. Itai, A. Konheim, and M. Rodeh, A sparse table implementation of priority queues, Proc. of the 8th Annual International Colloquium on Automata, Languages, and Programming (ICALP), pp.417-431, 1981.
DOI : 10.1007/3-540-10843-2_34

URL : https://link.springer.com/content/pdf/10.1007%2F3-540-10843-2_34.pdf

R. Jacob, A. Richa, C. Scheideler, S. Schmid, and H. Täubig, A distributed polylogarithmic time algorithm for self-stabilizing skip graphs, Proceedings of the 28th ACM symposium on Principles of distributed computing, PODC '09, pp.131-140, 2009.
DOI : 10.1145/1582716.1582741

URL : http://www.public.asu.edu/~aricha/mypapers/podc09.pdf

Z. Kasheff, Cache-oblivious dynamic search trees, 2004.

I. , Implicit data structures based on local reorganizations, 2002.

P. Kirschenhofer and H. Prodinger, The path length of random skip lists, Acta Informatica, vol.43, issue.42, pp.31775-792, 1994.
DOI : 10.1007/BF01178735

D. Micciancio, Oblivious data structures, Proceedings of the twenty-ninth annual ACM symposium on Theory of computing , STOC '97, pp.456-464, 1997.
DOI : 10.1145/258533.258638

D. Molnar, T. Kohno, N. Sastry, and D. Wagner, Tamper-evident, history-independent, subliminal-free data structures on PROM storage -or- how to store ballots on a voting machine, 2006 IEEE Symposium on Security and Privacy (S&P'06), 2006.
DOI : 10.1109/SP.2006.39

T. Moran, M. Naor, and G. Segev, Deterministic History-Independent Strategies for Storing Information on Write-Once Memories, Proc. of the 34th International Colloquium on Automata, Languages and Programming (ICALP), 2007.
DOI : 10.1007/978-3-540-73420-8_28

M. Naor, G. Segev, and U. Wieder, History-Independent Cuckoo Hashing, Proc. of the 35th International Colloquium on Automata, Languages and Programming (ICALP), pp.631-642, 2008.
DOI : 10.1007/978-3-540-70583-3_51

M. Naor and V. Teague, Anti-persistence: history independent data structures, Proc. of the 33rd Annual ACM Symposium on Theory of Computing (STOC), pp.492-501, 2001.

J. Nievergelt and E. M. Reingold, Binary Search Trees of Bounded Balance, SIAM Journal on Computing, vol.2, issue.1, pp.33-43, 1973.
DOI : 10.1137/0202005

R. Oshman and N. Shavit, The SkipTrie, Proceedings of the 2013 ACM symposium on Principles of distributed computing, PODC '13, pp.23-32, 2013.
DOI : 10.1145/2484239.2484270

T. Papadakis, J. I. Munro, and P. V. Poblete, Analysis of the expected search cost in skip lists, Proc. of the 2nd Scandinavian Workshop on Algorithm Theory (SWAT), pp.160-172, 1990.
DOI : 10.1007/3-540-52846-6_86

H. Prokop, Cache oblivious algorithms Master's thesis, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology [52] W. Pugh. Incremental computation and the incremental evaluation of functional programs, 1988.

W. Pugh, Skip lists: a probabilistic alternative to balanced trees, Communications of the ACM, vol.33, issue.6, pp.668-676, 1990.
DOI : 10.1145/78973.78977

W. Pugh and T. Teitelbaum, Incremental computation via function caching, Proceedings of the 16th ACM SIGPLAN-SIGACT symposium on Principles of programming languages , POPL '89, pp.315-328, 1989.
DOI : 10.1145/75277.75305

N. Rahman, R. Cole, and R. Raman, Optimised Predecessor Data Structures for Internal Memory, Proc. of the 5th International Workshop on Algorithm Engineering (WAE), pp.67-78, 2001.
DOI : 10.1007/3-540-44688-5_6

D. S. Roche, A. J. Aviv, and S. G. Choi, Oblivious secure deletion with bounded history independence, 2015.

N. Shavit and I. Lotan, Skiplist-based concurrent priority queues, Proceedings 14th International Parallel and Distributed Processing Symposium. IPDPS 2000, pp.263-268, 2000.
DOI : 10.1109/IPDPS.2000.845994

J. Shun and G. E. Blelloch, Phase-concurrent hash tables for determinism, Proceedings of the 26th ACM symposium on Parallelism in algorithms and architectures, SPAA '14, pp.96-107, 2014.
DOI : 10.1145/2612669.2612687

L. Snyder, On uniquely represented data strauctures, 18th Annual Symposium on Foundations of Computer Science (sfcs 1977), pp.142-146, 1977.
DOI : 10.1109/SFCS.1977.22

R. Sundar and R. E. Tarjan, Unique binary search tree representations and equality-testing of sets and sequences, Proc. of the 22nd Annual ACM Symposium on Theory of Computing (STOC), pp.18-25, 1990.

T. Tzouramanis, History-independence, Proceedings of the 27th Annual ACM Symposium on Applied Computing, SAC '12, pp.7-12, 2012.
DOI : 10.1145/2245276.2245279

J. S. Vitter, Random sampling with a reservoir, ACM Transactions on Mathematical Software, vol.11, issue.1, pp.37-57, 1985.
DOI : 10.1145/3147.3165

D. E. Willard, Inserting and deleting records in blocked sequential files, 1981.

D. E. Willard, Maintaining dense sequential files in a dynamic environment, Proc. of the 14th Annual ACM Symposium on Theory of Computing (STOC), pp.114-121, 1982.

D. E. Willard, Good worst-case algorithms for inserting and deleting records in dense sequential files, ACM SIGMOD Record, vol.15, issue.2, pp.251-260, 1986.
DOI : 10.1145/16856.16879

D. E. Willard, A density control algorithm for doing insertions and deletions in a sequentially ordered file in a good worst-case time, Information and Computation, vol.97, issue.2, pp.150-204, 1992.
DOI : 10.1016/0890-5401(92)90034-D

A. B. , History-independent sparse tables and dictionaries

]. A. Aggarwal and J. S. Vitter, The input/output complexity of sorting and related problems, Communications of the ACM, vol.31, issue.9, pp.1116-1127, 1988.
DOI : 10.1145/48529.48535

URL : https://hal.archives-ouvertes.fr/inria-00075827

M. Agrawal, N. Kayal, and N. Saxena, PRIMES is in P, Annals of Mathematics, vol.160, issue.2, pp.781-793, 2004.
DOI : 10.4007/annals.2004.160.781

E. Agullo, On the Out-Of-Core Factorization of Large Sparse Matrices, 2008.
URL : https://hal.archives-ouvertes.fr/tel-00563463

E. Agullo, P. R. Amestoy, A. Buttari, A. Guermouche, J. L. Excellent et al., Robust Memory-Aware Mappings for Parallel Multifrontal Factorizations, SIAM Journal on Scientific Computing, vol.38, issue.3, 2016.
DOI : 10.1137/130938505

URL : https://hal.archives-ouvertes.fr/hal-00726644

E. Agullo, O. Beaumont, L. Eyraud-dubois, and S. Kumar, Are Static Schedules so Bad? A Case Study on Cholesky Factorization, 2016 IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp.1021-1030, 2016.
DOI : 10.1109/IPDPS.2016.90

URL : https://hal.archives-ouvertes.fr/hal-01223573

E. Agullo, A. Buttari, A. Guermouche, and F. Lopez, Implementing Multifrontal Sparse Solvers for Multicore Architectures with Sequential Task Flow Runtime Systems, ACM Transactions on Mathematical Software, vol.43, issue.2, p.13, 2016.
DOI : 10.1109/71.993206

URL : https://hal.archives-ouvertes.fr/hal-01333645

M. Amaris, G. Lucarelli, C. Mommessin, and D. Trystram, Generic algorithms for scheduling applications on heterogeneous multi-core platforms, 2017.

M. Amaris, G. Lucarelli, C. Mommessin, and D. Trystram, Generic Algorithms for Scheduling Applications on Hybrid Multi-core Machines, In: Euro-Par, vol.40, issue.5, pp.220-231, 2017.
DOI : 10.1109/HCW.1999.765092

URL : https://hal.archives-ouvertes.fr/hal-01420798

P. R. Amestoy, I. S. Duff, J. Koster, and J. Excellent, A Fully Asynchronous Multifrontal Solver Using Distributed Dynamic Scheduling, SIAM Journal on Matrix Analysis and Applications, vol.23, issue.1, pp.15-41, 2001.
DOI : 10.1137/S0895479899358194

URL : https://hal.archives-ouvertes.fr/hal-00808293

P. R. Amestoy, A. Guermouche, J. Excellent, and S. Pralet, Hybrid scheduling for the parallel solution of linear systems, Parallel Computing, vol.32, issue.2, pp.136-156, 2006.
DOI : 10.1016/j.parco.2005.07.004

URL : https://hal.archives-ouvertes.fr/inria-00070599

P. R. Amestoy, A. Buttari, I. S. Duff, A. Guermouche, J. L. Excellent et al., In: Encyclopedia of Parallel Computing, pp.1232-1238, 2011.

L. Arge, The buffer tree: A technique for designing batched external data structures, In: Algorithmica, vol.371, pp.1-24, 2003.

L. Arge and M. Thorup, RAM-Efficient External Memory Sorting, In: Algorithms and Computation, vol.8283, pp.491-501, 2013.
DOI : 10.1007/978-3-642-45030-3_46

R. G. Ashcraft, J. G. Grimes, B. W. Lewis, H. D. Peyton, and . Simon, Progress in Sparse Matrix Methods for Large Linear Systems On Vector Supercomputers, The International Journal of Supercomputing Applications, vol.1, issue.4, pp.10-30, 1987.
DOI : 10.1137/0602010

A. Atkin and D. Bernstein, Prime sieves using binary quadratic forms, Mathematics of Computation, vol.73, issue.246, pp.1023-1030, 2004.
DOI : 10.1090/S0025-5718-03-01501-1

C. Augonnet, J. Clet-ortega, S. Thibault, and R. Namyst, Data-Aware Task Scheduling on Multi-accelerator Based Platforms, 2010 IEEE 16th International Conference on Parallel and Distributed Systems, pp.291-298, 2010.
DOI : 10.1109/ICPADS.2010.129

URL : https://hal.archives-ouvertes.fr/inria-00523937

C. Augonnet, S. Thibault, R. Namyst, and P. Wacrenier, StarPU: a unified platform for task scheduling on heterogeneous multicore architectures, Concurrency and Computation: Practice and Experience 23, pp.187-198, 2011.
URL : https://hal.archives-ouvertes.fr/inria-00384363

W. G. Aulbur, Parallel implementations of quasiparticle calculations of semiconductors and insulators, 1996.

G. Aupy, C. Brasseur, and L. Marchal, Dynamic Memory-Aware Task-Tree Scheduling, 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp.758-767, 2017.
DOI : 10.1109/IPDPS.2017.58

URL : https://hal.archives-ouvertes.fr/hal-01472062

O. Beaumont, N. Bonichon, L. Eyraud-dubois, and L. Marchal, Minimizing Weighted Mean Completion Time for Malleable Tasks Scheduling, 2012 IEEE 26th International Parallel and Distributed Processing Symposium, pp.273-284, 2012.
DOI : 10.1109/IPDPS.2012.34

URL : https://hal.archives-ouvertes.fr/inria-00564056

O. Beaumont, L. Eyraud-dubois, and S. Kumar, Approximation Proofs of a Fast and Efficient List Scheduling Algorithm for Task-Based Runtime Systems on Multicores and GPUs, 2017 IEEE International Parallel and Distributed Processing Symposium (IPDPS), pp.768-777
DOI : 10.1109/IPDPS.2017.71

URL : https://hal.archives-ouvertes.fr/hal-01386174

O. Beaumont, T. Cojean, L. Eyraud-dubois, A. Guermouche, and S. Kumar, Scheduling of Linear Algebra Kernels on Multiple Heterogeneous Resources, 2016 IEEE 23rd International Conference on High Performance Computing (HiPC), 2016.
DOI : 10.1109/HiPC.2016.045

URL : https://hal.archives-ouvertes.fr/hal-01361992

O. Beaumont and A. Guermouche, Task Scheduling for Parallel Multifrontal Methods, Parallel Processing International Conference, pp.758-766, 2007.
DOI : 10.1007/978-3-540-74466-5_80

URL : https://hal.archives-ouvertes.fr/hal-00358626

L. A. Belady, A study of replacement algorithms for a virtual-storage computer, IBM Systems Journal, vol.5, issue.2, pp.78-101, 1966.
DOI : 10.1147/sj.52.0078

M. Benazouz, O. Marchetti, A. Munier-kordon, and T. Michel, A new method for minimizing buffer sizes for Cyclo-Static Dataflow graphs, 2010 8th IEEE Workshop on Embedded Systems for Real-Time Multimedia, pp.11-20, 2010.
DOI : 10.1109/ESTMED.2010.5666980

URL : https://hal.archives-ouvertes.fr/hal-00461647

M. Benazouz, O. Marchetti, A. Munier-kordon, and P. Urard, A new approach for minimizing buffer capacities with throughput constraint for embedded system design, ACS/IEEE International Conference on Computer Systems and Applications, AICCSA 2010, pp.1-8, 2010.
DOI : 10.1109/AICCSA.2010.5586972

URL : https://hal.archives-ouvertes.fr/hal-00368648

M. A. Bender, E. D. Demaine, and M. Farach-colton, Cache-Oblivious B-Trees, SIAM Journal on Computing, vol.35, issue.2, pp.341-358, 2005.
DOI : 10.1137/S0097539701389956

M. A. Bender, Z. Duan, J. Iacono, and J. Wu, A locality-preserving cache-oblivious dynamic dictionary, Journal of Algorithms, vol.53, issue.2, pp.115-136, 2004.
DOI : 10.1016/j.jalgor.2004.04.014

URL : http://www.cs.technion.ac.il/~itai/Courses/Cache/locality-full.ps

M. A. Bender and H. Hu, An adaptive packed-memory array, In: ACM Transactions on Database Systems, vol.324, p.26, 2007.
DOI : 10.1145/1292609.1292616

URL : http://www.cs.sunysb.edu/~bender/pub/BenderHu-TODS07.pdf

S. Bharathi, A. Chervenak, E. Deelman, G. Mehta, M. Su et al., Characterization of scientific workflows, 2008 Third Workshop on Workflows in Support of Large-Scale Science, pp.1-10, 2008.
DOI : 10.1109/WORKS.2008.4723958

S. Bharathi and A. Chervenak, Scheduling data-intensive workflows on storage constrained resources, Proceedings of the 4th Workshop on Workflows in Support of Large-Scale Science, WORKS '09, 2009.
DOI : 10.1145/1645164.1645167

URL : http://www.isi.edu/~annc/papers/works2009.pdf

R. Bleuse, T. Gautier, J. V. Lima, G. Mounié, and D. Trystram, Scheduling Data Flow Program in XKaapi: A New Affinity Based Algorithm for Heterogeneous Architectures, In: Euro-Par, vol.2014, pp.560-571, 2014.
DOI : 10.1007/978-3-319-09873-9_47

URL : https://hal.archives-ouvertes.fr/hal-01081629

R. Bleuse, S. Hunold, S. Kedad-sidhoum, F. Monna, G. Mounié et al., Scheduling Independent Moldable Tasks on Multi-Cores with GPUs, IEEE Transactions on Parallel and Distributed Systems, vol.28, issue.9, pp.2689-2702, 2017.
DOI : 10.1109/TPDS.2017.2675891

URL : https://hal.archives-ouvertes.fr/hal-01263100

R. Bleuse, S. Kedad-sidhoum, F. Monna, G. Mounié, and D. Trystram, Scheduling independent tasks on multi-cores with GPU accelerators, Concurrency and Computation: Practice and Experience 27, pp.1625-1638, 2015.
DOI : 10.1007/s00607-003-0011-9

URL : https://hal.archives-ouvertes.fr/hal-01081625

R. D. Blumofe, C. F. Joerg, B. C. Kuszmaul, C. E. Leiserson, K. H. Randall et al., Cilk: An Efficient Multithreaded Runtime System, In: SIGPLAN Not, vol.308, pp.207-216, 1995.
DOI : 10.1006/jpdc.1996.0107

URL : http://www.lcs.mit.edu/publications/pubs/pdf/MIT-LCS-TM-548.pdf

G. Bosilca, A. Bouteiller, A. Danalis, M. Faverge, T. Herault et al., PaRSEC: Exploiting Heterogeneity to Enhance Scalability, Computing in Science & Engineering, vol.15, issue.6, pp.36-45, 2013.
DOI : 10.1109/MCSE.2013.98

Z. Budimli´cbudimli´c, M. Burke, V. Cavé, K. Knobe, G. Lowney et al., Concurrent collections, In: Scientific Programming, vol.18, pp.3-4, 2010.

J. Bulánek, M. Koucky, and M. Saks, Tight lower bounds for the online labeling problem, Proc. of the 44th Annual ACM Symposium on Theory of Computing (STOC). 2012, pp.1185-1198

A. Buttari, Fine Granularity Sparse QR Factorization for Multicore Based Systems, International Conference on Applied Parallel and Scientific Computing. 2012, pp.226-236
DOI : 10.1137/0910005

L. Canon, L. Marchal, and F. Vivien, Low-Cost Approximation Algorithms for Scheduling Independent Tasks on Hybrid Platforms, In: Euro-Par, vol.63, issue.11, pp.232-244, 2017.
DOI : 10.1016/S0743-7315(03)00108-4

URL : https://hal.archives-ouvertes.fr/hal-01559898

, Chameleon, a dense linear algebra software for heterogeneous architectures

C. Chen and C. Chu, A 3.42-approximation algorithm for scheduling malleable tasks under precedence constraints, IEEE Transactions on Parallel and Distributed Systems, vol.248, pp.1479-1488, 2013.
URL : https://hal.archives-ouvertes.fr/hal-01253311

L. Chen, D. Ye, and G. Zhang, ONLINE SCHEDULING OF MIXED CPU-GPU JOBS, International Journal of Foundations of Computer Science, vol.25, issue.06, pp.6-745, 2014.
DOI : 10.1016/j.ipl.2011.05.022

F. A. Chudak and D. B. Shmoys, Approximation Algorithms for Precedence-Constrained Scheduling Problems on Parallel Machines that Run at Different Speeds, Journal of Algorithms, vol.30, issue.2, pp.323-343, 1999.
DOI : 10.1006/jagm.1998.0987

E. Ciurea and L. Ciupalâ, Sequential and parallel algorithms for minimum flows, Journal of Applied Mathematics and Computing, vol.3, issue.1-2, pp.53-75, 2004.
DOI : 10.1007/BF02935746

G. Cordasco, R. D. Chiara, and A. L. Rosenberg, Assessing the Computational Benefits of AREA-Oriented DAG-Scheduling, Euro-Par 2011 Parallel Processing -17th International Conference Proceedings, Part I. 2011, pp.180-192, 2011.
DOI : 10.1016/j.parco.2009.12.005

T. H. Cormen, C. E. Leiserson, R. L. Rivest, and C. Stein, Introduction to Algorithms, Third Edition, 2009.

J. R. Correa and A. S. Schulz, Single-machine scheduling with precedence constraints, In: Mathematics of Operations Research, vol.304, pp.1005-1021, 2005.

R. F. Da-silva, W. Chen, G. Juve, K. Vahi, and E. Deelman, Community Resources for Enabling Research in Distributed Scientific Workflows, 2014 IEEE 10th International Conference on e-Science, pp.177-184, 2014.
DOI : 10.1109/eScience.2014.44

T. A. Davis, Direct Methods for Sparse Linear Systems. Fundamentals of Algorithms, 2006.

T. A. Davis, J. R. Gilbert, S. I. Larimore, and E. G. Ng, Algorithm 836, ACM Transactions on Mathematical Software, vol.30, issue.3, pp.377-380, 2004.
DOI : 10.1145/1024074.1024080

T. A. Davis and Y. Hu, The university of Florida sparse matrix collection, ACM Transactions on Mathematical Software, vol.38, issue.1, pp.1-125, 2011.
DOI : 10.1145/2049662.2049663

F. Desprez and F. Suter, A Bi-criteria Algorithm for Scheduling Parallel Task Graphs on Clusters, 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing, pp.243-252, 2010.
DOI : 10.1109/CCGRID.2010.43

URL : https://hal.archives-ouvertes.fr/hal-00533904

D. E. Dolan and J. J. Moré, Benchmarking optimization software with performance profiles, Mathematical Programming, vol.91, issue.2, pp.201-213, 2002.
DOI : 10.1007/s101070100263

M. Drozdowski and W. Kubiak, Scheduling parallel tasks with sequential heads and tails, Annals of Operations Research, vol.90, issue.0, pp.221-246, 1999.
DOI : 10.1023/A:1018964732122

M. Drozdowski, Scheduling multiprocessor tasks ??? An overview, European Journal of Operational Research, vol.94, issue.2, pp.215-230, 1996.
DOI : 10.1016/0377-2217(96)00123-3

M. Drozdowski, Scheduling Parallel Tasks ? Algorithms and Complexity In: Handbook of Scheduling, p.1584883979, 2004.

J. Du and J. Y. Leung, Complexity of Scheduling Parallel Task Systems, SIAM Journal on Discrete Mathematics, vol.2, issue.4, pp.473-487, 1989.
DOI : 10.1137/0402042

I. S. Duff and J. K. Reid, The Multifrontal Solution of Indefinite Sparse Symmetric Linear, ACM Transactions on Mathematical Software, vol.9, issue.3, pp.302-325, 1983.
DOI : 10.1145/356044.356047

B. Dunten, J. Jones, and J. Sorenson, A space-efficient fast prime number sieve, Information Processing Letters, vol.59, issue.2, pp.79-84, 1996.
DOI : 10.1016/0020-0190(96)00099-3

C. Eisenbeis, F. Gasperoni, and U. Schwiegelshohn, Allocating registers in multiple instructionissuing processors, Proceedings of the IFIP WG10. 3 working conference on Parallel architectures and compilation techniques. IFIP Working Group on Algol, pp.290-293, 1995.
URL : https://hal.archives-ouvertes.fr/inria-00074059

L. Eyraud-dubois, L. Marchal, O. Sinnen, and F. Vivien, Parallel Scheduling of Task Trees with Limited Memory, ACM Transactions on Parallel Computing, vol.2, issue.2, p.13, 2015.
DOI : 10.1145/321607.321620

URL : https://hal.archives-ouvertes.fr/hal-01160118

L. Fan, F. Zhang, G. Wang, and Z. Liu, An effective approximation algorithm for the Malleable Parallel Task Scheduling problem, Journal of Parallel and Distributed Computing, vol.72, issue.5, pp.693-704, 2012.
DOI : 10.1016/j.jpdc.2012.01.011

D. Feitelson, Workload modeling for computer systems performance evaluation, In: Book Draft, pp.1-601, 2014.
DOI : 10.1017/CBO9781139939690

M. Frigo, C. E. Leiserson, H. Prokop, and S. Ramachandran, Cache-oblivious algorithms, Proceesdings of the 40th Annual Symposium on Foundations of Computer Science, pp.285-297, 1999.

M. R. Garey and D. S. Johnson, Computers and Intractability, a Guide to the Theory of NP- Completeness, 1979.

M. Garey, D. Johnson, and L. Stockmeyer, Some simplified NP-complete graph problems, Theoretical Computer Science, vol.1, issue.3, pp.237-267, 1976.
DOI : 10.1016/0304-3975(76)90059-1

T. Gautier, X. Besseron, and L. Pigeon, KAAPI, Proceedings of the 2007 international workshop on Parallel symbolic computation, PASCO '07, pp.15-23, 2007.
DOI : 10.1145/1278177.1278182

URL : https://hal.archives-ouvertes.fr/hal-00727795

A. V. Goldberg and R. E. Tarjan, A New Approach to the Maximum Flow Problem, Proceedings of the Eighteenth Annual ACM Symposium on Theory of Computing. STOC '86, pp.136-146, 1986.

D. Golovin, The B-skip-list: A simpler uniquely represented alternative to B-trees, 2010.

A. González-escribano, A. J. Van-gemund, and V. Cardeñoso-payo, Mapping Unstructured Applications into Nested Parallelism, High Performance Computing for Computational Science -VECPAR 2002, 5th International Conference, pp.407-420, 2002.

R. L. Graham, Bounds for Certain Multiprocessing Anomalies, Bell System Technical Journal, vol.45, issue.9, pp.1563-1581, 1966.
DOI : 10.1002/j.1538-7305.1966.tb01709.x

R. L. Graham, Bounds on Multiprocessing Timing Anomalies, SIAM Journal on Applied Mathematics, vol.17, issue.2, pp.416-429, 1969.
DOI : 10.1137/0117039

R. L. Graham, E. L. Lawler, J. K. Lenstra, and A. R. Kan, Optimization and Approximation in Deterministic Sequencing and Scheduling: a Survey, In: Annals of discrete mathematics, vol.5, pp.287-326, 1979.
DOI : 10.1016/S0167-5060(08)70356-X

S. L. Graham, M. Snir, and C. A. Patterson, Getting up to speed: The future of supercomputing, 2005.

D. Gries and J. Misra, A linear sieve algorithm for finding prime numbers, Communications of the ACM, vol.21, issue.12, pp.999-1003, 1978.
DOI : 10.1145/359657.359660

E. Günther, F. König, and N. Megow, Scheduling and packing malleable and parallel tasks with precedence constraints of bounded width, Journal of Combinatorial Optimization, vol.12, issue.1, pp.164-181, 2014.
DOI : 10.1287/ijoc.12.1.57.11901

G. Hardy, J. Littlewood, and G. Pólya, Inequalities In: Cambridge Mathematical Library, 1952.

J. D. Hartline, E. S. Hong, A. E. Mohr, W. R. Pentney, and E. C. Rocke, Characterizing history independent data structures, In: Algorithmica, vol.421, pp.57-74, 2005.

P. Hénon, P. Ramet, and J. Roman, PaStiX: a high-performance parallel direct solver for sparse symmetric positive definite systems, Parallel Computing, vol.28, issue.2, pp.301-321, 2002.
DOI : 10.1016/S0167-8191(01)00141-7

M. D. Hill and A. J. Smith, Evaluating associativity in CPU caches, IEEE Transactions on Computers, vol.38, issue.12, pp.1612-1630, 1989.
DOI : 10.1109/12.40842

S. Horsley and F. , KO?KINON EPATO??ENOY?. or, The Sieve of Eratosthenes Being an Account of His Method of Finding All the Prime Numbers, by the Rev, In: Philosophical Transactions, pp.327-347, 1772.

A. Hugo, A. Guermouche, P. Wacrenier, and R. Namyst, A Runtime Approach to Dynamic Resource Allocation for Sparse Direct Solvers, 2014 43rd International Conference on Parallel Processing, pp.481-490, 2014.
DOI : 10.1109/ICPP.2014.57

URL : https://hal.archives-ouvertes.fr/hal-01101054

A. Hugo, A. Guermouche, P. Wacrenier, and R. Namyst, Composing multiple StarPU applications over heterogeneous machines: A supervised approach, In: IJHPCA, vol.283, pp.285-300, 2014.
URL : https://hal.archives-ouvertes.fr/hal-00824514

S. Hunold, One step toward bridging the gap between theory and practice in moldable task scheduling with precedence constraints, Concurrency and Computation: Practice and Experience, pp.1010-1026, 2015.
DOI : 10.1145/944618.944621

M. S. Hybertsen and S. G. Louie, Electron correlation in semiconductors and insulators: Band gaps and quasiparticle energies, Physical Review B, vol.1, issue.8, p.5390, 1986.
DOI : 10.1103/PhysRevB.1.910

C. Imreh, Scheduling Problems on Two Sets of Identical Machines, Computing, vol.70, issue.4, pp.277-294, 2003.
DOI : 10.1007/s00607-003-0011-9

M. Jacquelin, L. Marchal, Y. Robert, and B. Uçar, On Optimal Tree Traversals for Sparse Matrix Factorization, 2011 IEEE International Parallel & Distributed Processing Symposium, pp.556-567, 2011.
DOI : 10.1109/IPDPS.2011.60

URL : https://hal.archives-ouvertes.fr/ensl-00527462

A. Jain and C. Lin, Back to the future, 2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA), pp.78-89, 2016.
DOI : 10.1145/2155620.2155672

K. Jansen and H. Zhang, An Approximation Algorithm for Scheduling Malleable Tasks Under General Precedence Constraints, Z. Du. Lecture Notes in Computer Science, vol.3827, pp.236-245, 2005.
DOI : 10.1007/11602613_25

K. Jansen and H. Zhang, Scheduling malleable tasks with precedence constraints, In: Journal of Computer and System Sciences, vol.781, pp.245-259, 2012.

R. M. Karp, Reducibility among combinatorial problems In: Complexity of computer computations, pp.85-103, 1972.

S. Kedad-sidhoum, F. Monna, and D. Trystram, Scheduling Tasks with Precedence Constraints on Hybrid Multi-core Machines, 2015 IEEE International Parallel and Distributed Processing Symposium Workshop, pp.27-33
DOI : 10.1109/IPDPSW.2015.119

URL : https://hal.archives-ouvertes.fr/hal-01230286

N. Kell and J. Havill, Improved upper bounds for online malleable job scheduling, Journal of Scheduling, vol.17, issue.4, pp.393-410, 2015.
DOI : 10.1007/s10878-007-9125-x

H. Kellerer, R. Mansini, U. Pferschy, and M. G. Speranza, An efficient fully polynomial approximation scheme for the Subset-Sum Problem, Journal of Computer and System Sciences, vol.66, issue.2, pp.349-370, 2003.
DOI : 10.1016/S0022-0000(03)00006-0

C. Lam, T. Rauber, G. Baumgartner, D. Cociorva, and P. Sadayappan, Memory-optimal evaluation of expression trees involving large objects, In: Computer Languages, Systems & Structures, vol.372, pp.63-75, 2011.

M. Lampis, G. Kaouri, and V. Mitsou, On the algorithmic effectiveness of digraph decompositions and complexity measures, In: Discrete Optimization, vol.81, pp.129-138, 2011.

E. L. Lawler, Combinatorial optimization: networks and matroids, Courier Corporation, 2001.

E. A. Lee and D. G. Messerschmitt, Synchronous data flow, Proceedings of the IEEE, vol.75, issue.9, pp.1235-1245, 1987.
DOI : 10.1109/PROC.1987.13876

T. J. Lee and G. E. Scuseria, Achieving Chemical Accuracy with Coupled-Cluster Theory, pp.47-108, 1995.
DOI : 10.1007/978-94-011-0193-6_2

R. Lepère, D. Trystram, and G. J. Woeginger, Approximation algorithms for scheduling malleable tasks under precedence constraints, In: International Journal of Foundations of Computer Science, vol.13, pp.4-613, 2002.

J. Y. Leung, Handbook of scheduling: algorithms, models, and performance analysis, 2004.

X. S. Li, An overview of SuperLU, ACM Transactions on Mathematical Software, vol.31, issue.3, pp.302-325, 2005.
DOI : 10.1145/1089014.1089017

J. W. Liu, The Role of Elimination Trees in Sparse Factorization, SIAM Journal on Matrix Analysis and Applications, vol.11, issue.1, pp.134-172, 1990.
DOI : 10.1137/0611010

J. W. Liu, An Application of Generalized Tree Pebbling to Sparse Matrix Factorization, SIAM Journal on Algebraic Discrete Methods, vol.8, issue.3, pp.375-395, 1987.
DOI : 10.1137/0608031

J. W. Liu, On the storage requirement in the out-of-core multifrontal method for sparse factorization, ACM Transactions on Mathematical Software, vol.12, issue.3, 1986.
DOI : 10.1145/7921.11325

K. Makarychev and D. Panigrahi, Precedence-Constrained Scheduling of Malleable Jobs with Preemption, pp.823-834, 2014.
DOI : 10.1007/978-3-662-43948-7_68

E. Mäkinen, Generating random binary trees ??? A survey, Information Sciences, vol.115, issue.1-4, pp.1-4, 1999.
DOI : 10.1016/S0020-0255(98)10080-4

J. M. Martin, Benchmark Studies on Small Molecules, In: Encyclopedia of Computational Chemistry, vol.102, 1998.
DOI : 10.1063/1.469398

R. L. Mattson, J. Gecsei, D. R. Slutz, and I. L. Traiger, Evaluation techniques for storage hierarchies, IBM Systems Journal, vol.9, issue.2, pp.78-117, 1970.
DOI : 10.1147/sj.92.0078

R. Mcnaughton, Scheduling with Deadlines and Loss Functions, Management Science, vol.6, issue.1, pp.1-12, 1959.
DOI : 10.1287/mnsc.6.1.1

D. Micciancio, Oblivious data structures, Proceedings of the twenty-ninth annual ACM symposium on Theory of computing , STOC '97, pp.456-464, 1997.
DOI : 10.1145/258533.258638

P. Michaud, Some Mathematical Facts About Optimal Cache Replacement, ACM Transactions on Architecture and Code Optimization, vol.13, issue.4, p.50, 2016.
DOI : 10.1016/j.ipl.2007.12.001

URL : https://hal.archives-ouvertes.fr/hal-01411156

V. Nagarajan, J. Wolf, A. Balmin, and K. Hildrum, FlowFlex: Malleable Scheduling for Flows of MapReduce Jobs, pp.103-122, 2013.
DOI : 10.1007/978-3-642-45065-5_6

URL : https://hal.archives-ouvertes.fr/hal-01480794

M. Naor and V. Teague, Anti-persistence: history independent data structures, Proc. of the 33rd Annual ACM Symposium on Theory of Computing (STOC), pp.492-501, 2001.

R. Openmp-architecture and . Board, OpenMP Application Program Interface, Version 4, 2013.

F. Pellegrini and J. Roman, Sparse matrix ordering with Scotch, International Conference on High-Performance Computing and Networking, pp.370-378, 1997.
DOI : 10.1007/BFb0031609

P. Shor, stackexchange.com/users/198/peter-shor) Minimum s-t cut in weighted directed acyclic graphs with possibly negative weights

J. Planas, R. M. Badia, E. Ayguadé, and J. Labarta, Hierarchical Task-Based Programming With StarSs, The International Journal of High Performance Computing Applications, vol.17, issue.1, pp.284-299, 2009.
DOI : 10.1109/5.476078

A. Pothen and C. Sun, A Mapping Algorithm for Parallel Sparse Cholesky Factorization, SIAM Journal on Scientific Computing, vol.14, issue.5, pp.1253-1257, 1993.
DOI : 10.1137/0914074

G. N. Prasanna and B. R. Musicus, Generalized multiprocessor scheduling and applications to matrix computations, IEEE Transactions on Parallel and Distributed Systems, vol.7, issue.6, pp.650-664, 1996.
DOI : 10.1109/71.506703

G. N. Prasanna and B. R. Musicus, The Optimal Control Approach to Generalized Multiprocessor Scheduling, Algorithmica, vol.15, issue.1, pp.17-49, 1996.
DOI : 10.1007/s004539900002

W. Pugh, Skip lists: a probabilistic alternative to balanced trees, In: Communications of the ACM, vol.336, pp.668-676, 1990.

A. Radulescu and A. J. Van-gemund, A low-cost approach towards mixed task and data parallel scheduling, International Conference on Parallel Processing, 2001., pp.69-76, 2001.
DOI : 10.1109/ICPP.2001.952048

A. Ramakrishnan, G. Singh, H. Zhao, E. Deelman, R. Sakellariou et al., Scheduling Data-IntensiveWorkflows onto Storage-Constrained Distributed Resources, Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07), pp.401-409, 2007.
DOI : 10.1109/CCGRID.2007.101

F. Sainz, S. Mateo, V. Beltran, J. L. Bosque, X. Martorell et al., Leveraging OmpSs to Exploit Hardware Accelerators, 2014 IEEE 26th International Symposium on Computer Architecture and High Performance Computing, pp.112-119
DOI : 10.1109/SBAC-PAD.2014.26

E. Saule, H. M. Aktulga, C. Yang, E. G. Ng, and Ü. V. Çatalyürek, An Out-of-Core Task-based Middleware for Data-Intensive Scientific Computing, pp.647-667, 2015.
DOI : 10.1007/978-1-4939-2092-1_22

D. Sb?rlea, Z. Budimli´cbudimli´c, and V. Sarkar, Bounded memory scheduling of dynamic task graphs, Proceedings of the 23rd international conference on Parallel architectures and compilation, PACT '14, pp.343-356, 2014.
DOI : 10.1145/2628071.2628090

M. Sergent, D. Goudin, S. Thibault, and O. Aumage, Controlling the Memory Subscription of Distributed Applications with a Task-Based Runtime System, 2016 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), pp.318-327
DOI : 10.1109/IPDPSW.2016.105

URL : https://hal.archives-ouvertes.fr/hal-01284004

R. Sethi, Complete register allocation problems, In: SIAM journal on Computing, vol.43, pp.226-248, 1975.

R. Sethi and J. Ullman, The Generation of Optimal Code for Arithmetic Expressions, Journal of the ACM, vol.17, issue.4, pp.715-728, 1970.
DOI : 10.1145/321607.321620

J. P. Sorenson, The Pseudosquares Prime Sieve, pp.193-207, 2006.
DOI : 10.1007/11792086_15

F. Suter, DAGGEN: A synthetic task graph generator

O. Svensson, Hardness of precedence constrained scheduling on identical machines, In: SIAM Journal on Computing, vol.405, pp.1258-1274, 2011.

T. Tobita and H. Kasahara, A standard task graph set for fair evaluation of multiprocessor scheduling algorithms, Journal of Scheduling, vol.70, issue.5, pp.379-394, 2002.
DOI : 10.1109/TC.1973.5009153

S. Toledo, A survey of out-of-core algorithms in numerical linear algebra, Proceedings of a DIMACS Workshop, pp.161-180, 1998.
DOI : 10.1090/dimacs/050/09

S. Tomov, J. Dongarra, and M. Baboulin, Towards dense linear algebra for hybrid GPU accelerated manycore systems, Parallel Computing, vol.36, issue.5-6, pp.5-6, 2010.
DOI : 10.1016/j.parco.2009.12.005

T. Supercomputer-site, , 2017.

H. Topcuoglu, S. Hariri, and M. Y. Wu, Performance-effective and low-complexity task scheduling for heterogeneous computing, IEEE Transactions on Parallel and Distributed Systems, vol.13, issue.3, pp.260-274, 2002.
DOI : 10.1109/71.993206

S. Touati, Register Pressure in Instruction Level Parallelism, Theses. Université de Versailles- Saint Quentin en Yvelines, 2002.
URL : https://hal.archives-ouvertes.fr/tel-00007405

J. Valdes, R. E. Tarjan, and E. L. Lawler, The Recognition of Series Parallel Digraphs, SIAM Journal on Computing, vol.11, issue.2, pp.298-313, 1982.
DOI : 10.1137/0211023

V. Vizing, Minimization of the maximum delay in servicing systems with interruption, USSR Computational Mathematics and Mathematical Physics, vol.22, issue.3, pp.227-233, 1982.
DOI : 10.1016/0041-5553(82)90143-4

Q. Wang and K. Cheng, A Heuristic of Scheduling Parallel Tasks and Its Analysis, SIAM Journal on Computing, vol.21, issue.2, pp.281-294, 1992.
DOI : 10.1137/0221021

M. H. Wiggers, M. J. Bekooij, and G. J. Smit, Efficient Computation of Buffer Capacities for Cyclo-Static Dataflow Graphs, 44th ACM/IEEE Design Automation Conference, pp.658-663, 2007.

M. H. Wiggers, M. J. Bekooij, P. G. Jansen, and G. J. Smit, Efficient Computation of Buffer Capacities for Cyclo-Static Real-Time Systems with Back-Pressure, 13th IEEE Real Time and Embedded Technology and Applications Symposium (RTAS'07), pp.281-292, 2007.
DOI : 10.1109/RTAS.2007.12

D. E. Willard, A density control algorithm for doing insertions and deletions in a sequentially ordered file in a good worst-case time, Information and Computation, vol.97, issue.2, pp.150-204, 1992.
DOI : 10.1016/0890-5401(92)90034-D

A. Yarkhan, J. Kurzak, and J. Dongarra, Quark users' guide: Queueing and runtime for kernels, 2011.

Y. Zinder and S. Walker, Scheduling flexible multiprocessor tasks on parallel machines In: The 9th Workshop on Models and Algorithms for Planning and Scheduling Problems, 2009.

J. B. Simon, B. Jaumard, and T. H. Le, Deadlock Avoidance and Detection in Railway Simulation Systems, Articles in International Refereed Journals [, pp.45-52, 2014.
DOI : 10.1016/j.trb.2008.03.003

L. Marchal, B. Simon, O. Sinnen, and F. Vivien, Malleable Task-Graph Scheduling with a Practical Speed-Up Model, IEEE Transactions on Parallel and Distributed Systems, vol.29, issue.6, 2018.
DOI : 10.1109/TPDS.2018.2793886

URL : https://hal.archives-ouvertes.fr/hal-01687189

C. B. Simon, B. Jaumard, and T. H. Le, Deadlock Avoidance and Detection in Railway Simulation Systems In: Joint Rail Conference American Society of Mechanical Engineers, Articles in International Refereed Conferences [, pp.10-1115, 2014.

L. [. Guermouche, B. Marchal, F. Simon, and . Vivien, Scheduling Trees of Malleable Tasks for Sparse Linear Algebra, European Conference on Parallel Processing (Euro-Par). 2015, pp.479-490
DOI : 10.1007/978-3-662-48096-0_37

URL : https://hal.archives-ouvertes.fr/hal-01160104

M. A. Bender, J. Berry, R. Johnson, T. M. Kroeger, S. Mccauley et al., Anti-Persistence on Persistent Storage, Proceedings of the 35th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems, PODS '16, 2902251.
DOI : 10.1016/0890-5401(92)90034-D

URL : https://hal.archives-ouvertes.fr/hal-01326312

M. A. Bender, S. Mccauley, B. Simon, S. Singh, and F. Vivien, Resource Optimization for Program Committee Members: A Subreview Article In: Fun with Algorithms (FUN)

M. A. Bender, R. Chowdhury, A. Conway, M. Farach-colton, P. Ganapathi et al., The I/O Complexity of Computing Prime Tables, 12th Latin American Theoretical Informatics Symposium (LATIN). 2016. DOI: 10.1007, pp.978-981
DOI : 10.1007/978-3-662-49529-2_15

URL : https://hal.archives-ouvertes.fr/hal-01326317

[. Canon, L. Marchal, B. Simon, and F. Vivien, Online Scheduling of Sequential Task Graphs on Hybrid Platforms, European Conference on Parallel Processing, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01720064

L. Marchal, H. Nagy, B. Simon, and F. Vivien, Parallel scheduling of DAGs under memory constraints, IPDPS 2018-32st IEEE International Parallel & Distributed Processing Symposium, 2018.
URL : https://hal.archives-ouvertes.fr/hal-01828312

W. L. Marchal, S. Mccauley, B. Simon, and F. Vivien, Minimizing I/Os in out-of-core task tree scheduling, 2017 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW), 2017.
DOI : 10.1109/IPDPSW.2017.58

URL : https://hal.archives-ouvertes.fr/hal-01462213

R. Reports, [. Simon, B. Jaumard, and T. H. Le, Deadlock Avoidance and Detection In Railway Simulation Systems, pp.2013-2056

L. Marchal, B. Simon, and F. Vivien, Scheduling Malleable Task Trees
URL : https://hal.archives-ouvertes.fr/hal-01059704

L. [. Guermouche, B. Marchal, F. Simon, and . Vivien, Scheduling Trees of Malleable Tasks for Sparse Linear Algebra URL: https, INRIA Research Report, vol.8616, 2014.

L. Marchal, B. Simon, O. Sinnen, and F. Vivien, Malleable task-graph scheduling with a practical speed-up model URL: https, INRIA Research Report, vol.8856, 2016.

L. Marchal, S. Mccauley, B. Simon, and F. Vivien, Minimizing I/Os in Out-of-Core Task Tree Scheduling URL: https, INRIA Research Report, vol.9025, 2017.

L. Marchal, H. Nagy, B. Simon, and F. Vivien, Parallel scheduling of DAGs under memory constraints
URL : https://hal.archives-ouvertes.fr/hal-01828312

[. Canon, L. Marchal, B. Simon, and F. Vivien, Online Scheduling of Sequential Task Graphs on Hybrid Platforms URL: https, INRIA Research Report, vol.9150, 2018.