Améliorer la performance séquentielle à l’ère des processeurs massivement multicœurs

Nathanaël Prémillieu 1
1 ALF - Amdahl's Law is Forever
Inria Rennes – Bretagne Atlantique , IRISA-D3 - ARCHITECTURE
Abstract : Computers are everywhere and the need for always more computation power has pushed the processor architects to find new ways to increase performance. The today's tendency is to replicate execution core on the same die to parallelize the execution. If it goes on, processors will become manycores featuring hundred to a thousand cores. However, Amdahl's law reminds us that increasing the sequential performance will always be vital to increase global performance. A perfect way to increase sequential performance is to improve how branches are executed because they limit instruction level parallelism. The branch prediction is the most studied solution, its interest greatly depending on its accuracy. In the last years, this accuracy has been continuously improved up to reach a hardly exceeding limit. An other solution is to suppress the branches by replacing them with a construct based on predicated instructions. However, the execution of predicated instructions on out-of-order processors comes up with several problems like the multiple definition problem. This study investigates these two aspects of the branch treatment. The first part is about branch prediction. A way to improve it without increasing the accuracy is to reduce the coast of a branch misprediction. This is possible by exploiting control flow reconvergence and control independence. The work done on the wrong path on instructions common to the two paths is saved to be reused on the correct path. The second part is about predicated instructions. We propose a solution to the multiple definition problem by selectively predicting the predicate values. A selective replay mechanism is used to reduce the cost of a predicate misprediction.
Document type :
Complete list of metadatas

Cited literature [61 references]  Display  Hide  Download
Contributor : Abes Star <>
Submitted on : Tuesday, December 10, 2013 - 2:36:18 PM
Last modification on : Friday, November 16, 2018 - 1:40:29 AM
Long-term archiving on : Tuesday, March 11, 2014 - 9:41:11 AM


Version validated by the jury (STAR)


  • HAL Id : tel-00916589, version 1


Nathanaël Prémillieu. Améliorer la performance séquentielle à l’ère des processeurs massivement multicœurs. Autre [cs.OH]. Université Rennes 1, 2013. Français. ⟨NNT : 2013REN1S071⟩. ⟨tel-00916589⟩



Record views


Files downloads