Skip to Main content Skip to Navigation

Benchmark-driven approaches to performance modeling of multi-core architectures

Bertrand Putigny 1, 2 
2 RUNTIME - Efficient runtime systems for parallel architectures
Inria Bordeaux - Sud-Ouest, UB - Université de Bordeaux, CNRS - Centre National de la Recherche Scientifique : UMR5800
Abstract : In the race for better performance, computer architectures are becoming more and more complex. Therefore the need for hardware models is crucial to i) tune software to the underling architecture, ii) build tools to better exploit hardware or iii) choose an architecture according to the needs of a given application. In this dissertation, we aim at describing how to build a hardware model that targets all critical parts of modern computer architecture. That is the processing unit itself, memory and even power consumption. We believe that a large part of hardware modeling can be done automatically. This would relieve people from the tiresome task of doing it by hand. Our first contribution is a set of performance models for the on-core part of several different CPUs. This part of an architecture model is called the computational model. The computational model targeting the Intel SCC chip also includes a power model allowing for power aware performance optimization. Our other main contribution is an auto-tuned memory hierarchy model for general purpose CPUs able to i) predict performance of memory bound computations, ii) provide programmer with programming guidelines to improve software memory behavior.
Complete list of metadata

Cited literature [85 references]  Display  Hide  Download
Contributor : ABES STAR :  Contact
Submitted on : Tuesday, November 24, 2015 - 12:00:07 PM
Last modification on : Saturday, June 25, 2022 - 7:44:22 PM
Long-term archiving on: : Friday, April 28, 2017 - 2:35:10 PM


Version validated by the jury (STAR)


  • HAL Id : tel-00984791, version 4



Bertrand Putigny. Benchmark-driven approaches to performance modeling of multi-core architectures. Distributed, Parallel, and Cluster Computing [cs.DC]. Université de Bordeaux, 2014. English. ⟨NNT : 2014BORD0155⟩. ⟨tel-00984791v4⟩



Record views


Files downloads