Name: A Primer on Hardware Prefetching
ISBN: 978-3-031-01743-8

Overview

Authors:

Babak Falsafi ⁰,
Thomas F. Wenisch ¹

Babak Falsafi
1. EPFL, Switzerland
View author publications

You can also search for this author in PubMed Google Scholar
Thomas F. Wenisch
1. University of Michigan, USA
View author publications

You can also search for this author in PubMed Google Scholar

Part of the book series: Synthesis Lectures on Computer Architecture (SLCA)

1848 Accesses
9 Citations

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 19.99

Price excludes VAT (USA)

Softcover Book USD 27.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (4 chapters)

Front Matter

Pages i-xiv

Download chapter PDF
Introduction
- Babak Falsafi, Thomas F. Wenisch
Pages 1-5
Instruction Prefetching
- Babak Falsafi, Thomas F. Wenisch
Pages 7-14
Data Prefetching
- Babak Falsafi, Thomas F. Wenisch
Pages 15-37
Concluding Remarks
- Babak Falsafi, Thomas F. Wenisch
Pages 39-40
Back Matter

Pages 41-53

Download chapter PDF

About this book

Since the 1970’s, microprocessor-based digital platforms have been riding Moore’s law, allowing for doubling of density for the same area roughly every two years. However, whereas microprocessor fabrication has focused on increasing instruction execution rate, memory fabrication technologies have focused primarily on an increase in capacity with negligible increase in speed. This divergent trend in performance between the processors and memory has led to a phenomenon referred to as the “Memory Wall.” To overcome the memory wall, designers have resorted to a hierarchy of cache memory levels, which rely on the principal of memory access locality to reduce the observed memory access time and the performance gap between processors and memory. Unfortunately, important workload classes exhibit adverse memory access patterns that baffle the simple policies built into modern cache hierarchies to move instructions and data across cache levels. As such, processors often spend much time idling upon a demand fetch of memory blocks that miss in higher cache levels. Prefetching—predicting future memory accesses and issuing requests for the corresponding memory blocks in advance of explicit accesses—is an effective approach to hide memory access latency. There have been a myriad of proposed prefetching techniques, and nearly every modern processor includes some hardware prefetching mechanisms targeting simple and regular memory access patterns. This primer offers an overview of the various classes of hardware prefetchers for instructions and data proposed in the research literature, and presents examples of techniques incorporated into modern microprocessors.

Authors and Affiliations

EPFL, Switzerland

Babak Falsafi
University of Michigan, USA

Thomas F. Wenisch

About the authors

Babak Falsafi is a Professor in the School of Computer and Communication Sciences at EPFL, and the founding director of the EcoCloud research center, targeting future energy-efficient and environmentally friendly cloud technologies. He has made numerous contributions to computer system design and evaluation including: a scalable multiprocessor architecture that laid the foundation for the Sun (now Oracle) WildFire servers; snoop filters; temporal stream prefetchers that are incorporated into IBM BlueGene/P and BlueGene/Q; and computer system simulation sampling methodologies that have been in use by AMD and HP for research and product development. His most notable contribution has been to be first to show that, contrary to conventional wisdom, multiprocessor memory programming models (known as memory consistency models) prevalent in all modern systems are neither necessary nor sufficient to achieve high performance. He is a recipient of an NSF CAREER award, IBM Faculty Partnership Awards, and an Alfred P. Sloan Research Fellowship. He is a fellow of IEEE.Thomas Wenisch is an Associate Professor of Computer Science and Engineering at the University of Michigan, specializing in computer architecture. His prior research includes memory streaming for commercial server applications, store-wait-free multiprocessor memory systems, memory disaggregation, and rigorous sampling-based performance evaluation methodologies. His ongoing work focuses on computational sprinting, memory persistency, data center architecture, energy-efficient server design, and accelerators for medical imaging. Wenisch received the NSF CAREER award in 2009 and the University of Michigan Henry Russell Award in 2013. He received his Ph.D. in Electrical and Computer Engineering from Carnegie Mellon University.

Bibliographic Information

Book Title: A Primer on Hardware Prefetching
Authors: Babak Falsafi, Thomas F. Wenisch
Series Title: Synthesis Lectures on Computer Architecture
DOI: https://doi.org/10.1007/978-3-031-01743-8
Publisher: Springer Cham
eBook Packages: Synthesis Collection of Technology (R0), eBColl Synthesis Collection 5
Copyright Information: Springer Nature Switzerland AG 2014
Softcover ISBN: 978-3-031-00615-9Published: 02 June 2014
eBook ISBN: 978-3-031-01743-8Published: 01 June 2022
Series ISSN: 1935-3235
Series E-ISSN: 1935-3243
Edition Number: 1
Number of Pages: XIV, 54
Topics: Circuits and Systems, Processor Architectures

Publish with us

Policies and ethics

A Primer on Hardware Prefetching

Overview

Access this book

Other ways to access

Table of contents (4 chapters)

Front Matter

Introduction

Instruction Prefetching

Data Prefetching

Concluding Remarks

Back Matter

About this book

Authors and Affiliations

EPFL, Switzerland

University of Michigan, USA

About the authors

Bibliographic Information

Publish with us

Search

Navigation