A Top-Down method for performance analysis and counters architecture

Ahmad Yasin

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Optimizing an application's performance for a given microarchitecture has become painfully difficult. Increasing microarchitecture complexity, workload diversity, and the unmanageable volume of data produced by performance tools increase the optimization challenges. At the same time resource and time constraints get tougher with recently emerged segments. This further calls for accurate and prompt analysis methods. In this paper a Top-Down Analysis is developed - a practical method to quickly identify true bottlenecks in out-of-order processors. The developed method uses designated performance counters in a structured hierarchical approach to quickly and, more importantly, correctly identify dominant performance bottlenecks. The developed method is adopted by multiple in-production tools including VTune. Feedback from VTune average users suggests that the analysis is made easier thanks to the simplified hierarchy which avoids the high-learning curve associated with microarchitecture details. Characterization results of this method are reported for the SPEC CPU2006 benchmarks as well as key enterprise workloads. Field case studies where the method guides software optimization are included, in addition to architectural exploration study for most recent generations of Intel Core™ products. The insights from this method guide a proposal for a novel performance counters architecture that can determine the true bottlenecks of a general out-of-order processor. Unlike other approaches, our analysis method is low-cost and already featured in in-production systems - it requires just eight simple new performance events to be added to a traditional PMU. It is comprehensive - no restriction to predefined set of performance issues. It accounts for granular bottlenecks in super-scalar cores, missed by earlier approaches.

Original languageEnglish
Title of host publicationISPASS 2014 - IEEE International Symposium on Performance Analysis of Systems and Software
PublisherIEEE Computer Society
Pages35-44
Number of pages10
ISBN (Print)9781479936052
DOIs
StatePublished - 2014
Externally publishedYes
Event2014 IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2014 - Monterey, CA, United States
Duration: 23 Mar 201425 Mar 2014

Publication series

NameISPASS 2014 - IEEE International Symposium on Performance Analysis of Systems and Software

Conference

Conference2014 IEEE International Symposium on Performance Analysis of Systems and Software, ISPASS 2014
Country/TerritoryUnited States
CityMonterey, CA
Period23/03/1425/03/14

ASJC Scopus subject areas

  • Software

Fingerprint

Dive into the research topics of 'A Top-Down method for performance analysis and counters architecture'. Together they form a unique fingerprint.

Cite this