, Zoltan: Parallel partitioning, load balancing and data-management services

A. Thevenin, A. Piacentini, T. Morel, and F. Duchaine, Open-palm: an open source dynamic parallel coupler, IV International Conference on Computational Methods for Coupled Problems in Science and Engineering, 2011.

J. Amaya, E. Collado, B. Cuenot, and T. Poinsot, Coupling LES, radiation and structure in gas turbine simulations, Proceedings of the Summer Program, 2010.

G. Anciaux, J. Coulaud, and . Roman, High performance multiscale simulation or crack propagation, Parallel Processing Workshops, 2006. ICPP 2006 Workshops. 2006 International Conference on, 2006.

C. Aykanat, B. Barla, F. Cambazoglu, T. Findik, and . Kurc, Adaptive decomposition and remapping algorithms for object-spaceparallel direct volume rendering of unstructured grids, J. Parallel Distrib. Comput, vol.67, pp.77-99, 2007.

C. Aykanat, B. Barla, B. Cambazoglu, and . Uçar, Multi-level direct k-way hypergraph partitioning with multiple constraints and fixed vertices, J. Parallel Distrib. Comput, vol.68, pp.609-625, 2008.
URL : https://hal.archives-ouvertes.fr/hal-00803479

. Davida, H. Bader, P. Meyerhenke, C. Sanders, A. Schulz et al., Benchmarking for graph clustering and partitioning, Encyclopedia of Social Network Analysis and Mining, pp.73-82, 2014.

R. Battiti and A. Bertossi, Differential greedy for the 0-1 equicut problem, Proceedings of the DIMACS Workshop on Network Design: Connectivity and Facilities Location, pp.3-21, 1997.

M. J. Berger, H. Shahid, and . Bokhari, A partitioning strategy for nonuniform problems on multiprocessors, IEEE Trans. Computers, vol.36, issue.5, pp.570-580, 1987.

S. Berger, S. Richard, F. Duchaine, G. Staffelbach, and L. Y. Gicquel, On the sensitivity of a helicopter combustor wall temperature to convective and radiative thermal loads, Applied Thermal Engineering, vol.103, pp.1450-1459, 2016.

J. T. Betts and W. P. Huffman, Mesh refinement in direct transcription methods for optimal control, Optimal Control Applications & Methods, vol.19, pp.1-21, 1998.

J. T. Betts, Practical Methods for Optimal Control and Estimation Using Nonlinear Programming, 2009.

A. Byron, . Boville, and . Peter-r-gent, The ncar climate system model, version one*, Journal of Climate, vol.11, issue.6, pp.1115-1130, 1998.

C. Thang-nguyen-bui and . Jones, A heuristic for reducing fill-in in sparse matrix factorization, PPSC, pp.445-452, 1993.

A. Buluç and K. Madduri, Parallel breadth-first search on distributed memory systems, Proceedings of 2011 International Conference for High Performance Computing, Networking, Storage and Analysis, SC '11, vol.65, pp.1-65, 2011.

A. E. Caldwell, A. B. Kahng, A. A. Kennings, and I. L. Markov, Hypergraph partitioning for VLSI CAD: Methodology for heuristic development, experimentation and reporting, Proceedings of the 36th Annual ACM/IEEE Design Automation Conference, DAC '99, pp.349-354, 1999.
DOI : 10.1109/dac.1999.781340

URL : http://www.cs.york.ac.uk/rts/docs/SIGDA-Compendium-1994-2004/papers/1999/dac99/pdffiles/21_2.pdf

P. M. Campbell, K. D. Devine, J. E. Flaherty, L. G. Gervasio, and J. D. Teresco, Dynamic octree load balancing using spacefilling curves, 2003.

U. V. Catalyurek, E. G. Boman, K. D. Devine, D. Bozdag, R. Heaphy et al., Hypergraph-based dynamic load balancing for adaptive scientific computations, IEEE International Parallel and Distributed Processing Symposium, pp.1-11, 2007.
DOI : 10.1109/ipdps.2007.370258

URL : http://www.sandia.gov/~egboman/papers/Catalyurek_IPDPS07.pdf

U. Catalyurek and C. Aykanat, Hypergraph-partitioning-based decomposition for parallel sparse-matrix vector multiplication, IEEE Trans. Parallel Distrib. Syst, vol.10, issue.7, pp.673-693, 1999.
DOI : 10.1109/71.780863

URL : http://www.cs.bilkent.edu.tr/~aykanat/papers/99IEEETPDS.pdf

V. Umit, E. G. Catalyurek, K. D. Boman, D. Devine, R. T. Bozda?-g et al., A repartitioning hypergraph model for dynamic load balancing, J. Parallel Distrib. Comput, vol.69, issue.8, pp.711-724, 2009.

P. Ciarlet and F. Lamour, On the validity of a front-oriented approach to partitioning large sparse graphs with a connectivity constraint, Numerical Algorithms, vol.12, issue.1, pp.193-214, 1996.

M. Anthony-p-craig, R. Vertenstein, and . Jacob, A new flexible coupler for earth system modeling developed for ccsm4 and cesm1. International Journal of High Performance Computing Applications, vol.26, pp.31-42, 2012.

G. Cybenko, Dynamic load balancing for distributed memory multiprocessors, Journal of Parallel and Distributed Computing, vol.7, issue.2, pp.279-301, 1989.
DOI : 10.1016/0743-7315(89)90021-x

URL : http://www.dartmouth.edu/~gvc/Cybenko_JPDP.pdf

J. M. Dennis, M. Vertenstein, P. H. Worley, A. A. Mirin, A. P. Craig et al., Computational performance of ultra-high-resolution capability in the community earth system model, IJHPCA, vol.26, issue.1, pp.5-16, 2012.

R. Diekmann, R. Preis, F. Schlimbach, and C. Walshaw, Shape-optimized mesh partitioning and load balancing for parallel adaptive FEM, Parallel Computing, vol.26, issue.12, pp.1555-1581, 2000.

F. Duchaine, . Jaure, . Poitou, G. Quemerais, . Staffelbach et al., High performance conjugate heat transfer with the openpalm coupler, V International Conference on Coupled Problems in Science and Engineering, 2013.

A. E. Dunlop and B. W. Kernighan, A procedure for placement of standard-cell VLSI circuits, IEEE Trans. on CAD of Integrated Circuits and Systems, vol.4, issue.1, pp.92-98, 1985.

C. Farhat, A simple and efficient automatic fem domain decomposer, Computers & Structures, vol.28, issue.5, pp.579-602, 1988.

, Load Balancing for Parallel Coupled Simulations

C. M. Fiduccia and R. M. Mattheyses, A linear-time heuristic for improving network partitions, 19th Design Automation Conference, pp.175-181, 1982.

R. Michael, D. S. Garey, and . Johnson, Computers and Intractibility: A Guide to the Theory of NP-Completeness, 1979.

B. Hendrickson and K. Devine, Dynamic load balancing in computational mechanics, In Computer Methods in Applied Mechanics and Engineering, vol.184, pp.485-500, 2000.

B. Hendrickson and T. G. Kolda, Graph partitioning models for parallel computing, Parallel Comput, vol.26, issue.12, pp.1519-1534, 2000.

B. Hendrickson and R. Leland, An improved spectral graph partitioning algorithm for mapping parallel computations, SIAM J. Sci. Comput, vol.16, issue.2, 1995.

B. Cuenot, J. Amaya, E. Collado, and T. Poinsot, Coupling les, radiation and structure in gas turbine simulation, Proceedings of the Summer Program Center for Turbulence Research, 2019.

S. Jain, C. Swamy, and K. Balaji, Greedy algorithms for kway graph partitioning, the 6th international conference on advanced computing, 1998.

S. Jauré, F. Duchaine, and L. Gicquel, Comparisons of coupling strategies for massively parallel conjugate heat transfer with large eddy simulation, IV International Conference on Computational Methods for Coupled Problems in Science and Engineering, 2011.

W. Joppich and M. Kürschner, Mpcci-a tool for the simulation of coupled applications, Concurrency and Computation: Practice and Experience, vol.18, issue.2, pp.183-192, 2006.

G. Karypis and V. Kumar, Multilevel algorithms for multi-constraint graph partitioning, Proceedings of the 1998 ACM/IEEE Conference on Supercomputing, SC '98, pp.1-13, 1998.

G. Karypis and V. Kumar, Multilevel k-way partitioning scheme for irregular graphs, Journal of Parallel and Distributed Computing, vol.48, pp.96-129, 1998.

G. Karypis, H. Metis, and P. ,

G. Karypis, Multi-constraint mesh partitioning for contact/impact computations, Proceedings of the 2003 ACM/IEEE Conference on Supercomputing, SC '03, p.56, 2003.

G. Karypis and V. Kumar, A fast and high quality multilevel scheme for partitioning irregular graphs, SIAM J. Sci. Comput, vol.20, issue.1, pp.359-392, 1998.

B. W. Kernighan and S. Lin, An efficient heuristic procedure for partitioning graphs, Bell System Technical Journal, vol.49, pp.291-307, 1970.

S. Kirk, K. George, and K. Vipin, Wavefront diffusion and LMSR: Algorithms for dynamic repartitioning of adaptive meshes, IEEE Trans. Parallel Distrib. Syst, vol.12, issue.5, pp.451-466, 2001.

H. W. Kuhn, The hungarian method for the assignment problem, Naval Research Logistics Quarterly, vol.2, issue.1-2, pp.83-97, 1955.

J. Larson, R. Jacob, and E. Ong, The model coupling toolkit: A new fortran90 toolkit for building multiphysics parallel coupled models, Int. J. High Perform. Comput. Appl, vol.19, issue.3, pp.277-292, 2005.

J. Walter-larson, Ten organising principles for coupling in multiphysics and multiscale models, ANZIAM Journal, vol.48, pp.1090-1111, 2009.

R. Leland and B. Hendrickson, A multilevel algorithm for partitioning graphs, ACM/IEEE conference on Supercomputing, 1995.

M. Leordeanu and M. Hebert, A spectral technique for correspondence problems using pairwise constraints, Proceedings of the Tenth IEEE International Conference on Computer Vision, vol.2, pp.1482-1489, 2005.

B. Henning-meyerhenke, S. Monien, and . Schamberger, Accelerating shape optimizing load balancing for parallel fem simulations by algebraic multigrid, Proceedings of the 20th International Conference Load Balancing for Parallel Coupled Simulations on Parallel and Distributed Processing, IPDPS'06, pp.57-57, 2006.

T. Minyard and Y. Kallinderis, Parallel load balancing for dynamic execution environments, Computer Methods in Applied Mechanics and Engineering, vol.189, pp.1295-1309, 2000.

L. Oliker and R. Biswas, Plum: parallel load balancing for adaptive unstructured meshes, J. Parallel Distrib. Comput, vol.52, pp.150-177, 1998.

F. Pellegrini,

J. R. Pilkington and S. B. Baden, Dynamic partitioning of non-uniform structured workloads with spacefilling curves. Parallel and Distributed Systems, IEEE Transactions on, vol.7, issue.3, pp.288-300, 1996.

S. Plimpton, B. Hendrickson, S. Attaway, J. Swegle, C. Vaughan et al., Transient dynamics simulations: Parallel algorithms for contact detection and smoothed particle hydrodynamics, Proceedings of the 1996 ACM/IEEE Conference on Supercomputing, Supercomputing '96, 1996.

S. Simon-portegies-zwart, B. Mcmillan, D. Breanndán´o-nualláin, J. Heggie, P. Lombardi et al., Joris van Bever, Ofer Yaron, and Marcel Zemp. A Multiphysics and Multiscale Software Environment for Modeling Astrophysical Systems, pp.207-216, 2008.

A. Pothen, H. D. Simon, and K. Liou, Partitioning sparse matrices with eigenvectors of graphs, SIAM J. Matrix Anal. Appl, vol.11, issue.3, pp.430-452, 1990.

L. A. Sanchis, Multiple-way network partitioning, vol.38, pp.62-81, 1989.

P. Sanders and C. Schulz, Think Locally, Act Globally: Highly Balanced Graph Partitioning, Proceedings of the 12th International Symposium on Experimental Algorithms (SEA'13), vol.7933, pp.164-175, 2013.

K. Schloegel, G. Karypis, and V. Kumar, Multilevel diffusion schemes for repartitioning of adaptive meshes, Journal of Parallel and Distributed Computing, vol.47, issue.2, pp.109-124, 1997.

S. Gualdi, A. Bellucci, A. Sanna, P. G. Fogli, E. Manzini et al., Effects of tropical cyclones on ocean heat transport in a high resolution coupled general circulation model, Journal of Climate, vol.24, pp.4368-4384, 2011.

H. D. Simon, Partitioning of unstructured problems for parallel processing, Comp. Sys. Engng, vol.2, pp.135-148, 1991.

D. Horst, S. Simon, and . Teng, How good is recursive bisection?, SIAM J. Sci. Comput, vol.18, pp.1436-1445, 1995.

J. D. Teresco, K. D. Devine, and J. E. Flaherty, Partitioning and dynamic load balancing for the numerical solution of partial differential equations, Numerical Solution of Partial Differential Equations on Parallel Computers, vol.51, pp.55-88, 2006.

L. Tessé, F. Dupoirieux, and J. Taine, Monte carlo modeling of radiative transfer in a turbulent sooty flame, International Journal of Heat and Mass Transfer, vol.47, issue.3, pp.555-572, 2004.

S. Valcke, The oasis3 coupler: a european climate modelling community software, Journal of Geosci. Model Dev, pp.373-388, 2013.

R. Van-driessche and D. Roose, Dynamic load balancing with a spectral bisection algorithm for the constrained graph partitioning problem, High-Performance Computing and Networking, vol.919, pp.392-397, 1995.

B. Vastenhouw and R. H. Bisseling, A two-dimensional data distribution method for parallel sparse matrix-vector multiplication, SIAM Rev, vol.47, issue.1, pp.67-95, 2005.

C. Vuchener and A. Esnard, Dynamic Load-Balancing with Variable Number of Processors based on Graph Repartitioning, Proceedings of High Performance Computing, 2012.
URL : https://hal.archives-ouvertes.fr/hal-00687073

C. Vuchener and A. Esnard, Graph Repartitioning with both Dynamic Load and Dynamic Processor Allocation, International Conference on Parallel Computing-ParCo2013, Advances of Parallel Computing, pp.243-252, 2013.
URL : https://hal.archives-ouvertes.fr/hal-00857881

, Load Balancing for Parallel Coupled Simulations

C. Walshaw and M. Cross, Multilevel mesh partitioning for heterogeneous communication networks, Future Generation Comput. Syst, vol.17, pp.601-623, 2001.

C. Walshaw, M. Cross, and K. Mcmanus, Dynamic load balancing of mesh-based applications on parallel, Applied Mathematical Modelling, vol.25, issue.2, pp.123-140, 2000.

M. W. Washington, W. J. Weatherly, A. G. Meehl, J. A. Semtner, W. T. Bettge et al., Parallel climate model (pcm) control and transient simulations, Climate Dynamics, vol.16, issue.10, pp.755-774, 2000.

Y. Wu, . Haworth, B. Mf-modest, and . Cuenot, Direct numerical simulation of turbulence/radiation interaction in premixed combustion systems, Proceedings of the Combustion Institute, vol.30, pp.639-646, 2005.

Y. Zhang, R. Vicquelin, O. Gicquel, and J. Taine, Physical study of radiation effects on the boundary layer structure in a turbulent channel flow, International Journal of Heat and Mass Transfer, vol.61, pp.654-666, 2013.
URL : https://hal.archives-ouvertes.fr/hal-01344924

S. Zimny, B. Chopard, O. Malaspinas, E. Lorenz, K. Jain et al., A multiscale approach for the coupled simulation of blood flow and thrombus formation in intracranial aneurysms, Procedia Computer Science, vol.18, pp.1006-1015, 2013.

¨. Umit, V. Catalyürek, and C. Aykanat, PaToH: A Multilevel Hypergraph Partitioning Tool, 1999.

M. Predari and A. Esnard, Coupling-Aware Graph Partitioning Algorithms: Preliminary Study, IEEE International Conference on High Performance Computing, 2014.
URL : https://hal.archives-ouvertes.fr/hal-01069578

M. Predari and A. Esnard, Graph Operators for Couplingaware Graph Partitioning Algorithms, CIMI Workshop on Innovative clustering methods for large graphs and block methods, 2015.
URL : https://hal.archives-ouvertes.fr/hal-01203006

M. Predari and A. Esnard, New graph partitioning techniques for load balancing of coupled simulation. womENcourage 2015, septembre 2015. Poster
URL : https://hal.archives-ouvertes.fr/hal-01258036

M. Predari and A. Esnard, A k-way Greedy Graph Partitioning with Initial Fixed Vertices for Parallel Applications, 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, Parallel, Distributed, and Network-Based Processing, 2016.
URL : https://hal.archives-ouvertes.fr/hal-01277392

M. Predari and A. Esnard, Graph partitioning techniques for load balancing of coupled simulations. SIAM Workshop on Combinatorial Scientific Computing (CSC16) , octobre 2016. Poster
URL : https://hal.archives-ouvertes.fr/hal-01399392

M. Predari, A. Esnard, and J. Roman, Comparison of Methods for Graph Partitioning with Initial Fixed Vertices, In Parallel Computing Journal (under submission), 2016.
URL : https://hal.archives-ouvertes.fr/hal-01538600