CPL - Chalmers Publication Library
| Utbildning | Forskning | Styrkeområden | Om Chalmers | In English In English Ej inloggad.

PATer: A Hardware Prefetching Automatic Tuner on IBM POWER8 Processor

Minghua Li ; Guancheng Chen ; Qijun Wang ; Yong Hua Lin ; Per Stenström (Institutionen för data- och informationsteknik, Datorteknik (Chalmers)) ; Peter Hofstee
IEEE Computer Architecture Letters (1556-6056). Vol. 15 (2015), 1, p. 37-40.
[Artikel, refereegranskad vetenskaplig]

Hardware prefetching on IBM’s latest POWER8 processor is able to improve performance of many applications significantly, but it can also cause performance loss for others. The IBM POWER8 processor provides one of the most sophisticated hardware prefetching designs which supports 225 different configurations. Obviously, it is a big challenge to find the optimal or near-optimal hardware prefetching configuration for a specific application. We present a dynamic prefetching tuning scheme in this paper, named Prefetch Automatic Tuner (PATer). PATer uses a prediction model based on machine learning to dynamically tune the prefetch configuration based on the values of hardware performance monitoring counters (PMCs). By developing a two-phase prefetching selection algorithm and a prediction accuracy optimization algorithm in this tool, we identify a set of selected key hardware prefetch configurations that matter mostly to performance as well as a set of PMCs that maximize the machine learning prediction accuracy. We show that PATer is able to accelerate the execution of diverse workloads up to 1.4x.

Nyckelord: Computer architecture, memory hierarchy, prefetching



Den här publikationen ingår i följande styrkeområden:

Läs mer om Chalmers styrkeområden  

Denna post skapades 2016-01-06. Senast ändrad 2016-09-22.
CPL Pubid: 229920

 

Läs direkt!


Länk till annan sajt (kan kräva inloggning)