Logo - springer
Slogan - springer

Computer Science - Communication Networks | Energy-Efficient High Performance Computing - Measurement and Tuning

Energy-Efficient High Performance Computing

Measurement and Tuning

Laros III, J.H., Pedretti, K., Kelly, S.M., Shu, W., Ferreira, K., Van Dyke, J., Vaughan, C.

2013, XIV, 67 p. 19 illus., 8 illus. in color.

Available Formats:
eBook
Information

Springer eBooks may be purchased by end-customers only and are sold without copy protection (DRM free). Instead, all eBooks include personalized watermarks. This means you can read the Springer eBooks across numerous devices such as Laptops, eReaders, and tablets.

You can pay for Springer eBooks with Visa, Mastercard, American Express or Paypal.

After the purchase you can directly download the eBook file or read it online in our Springer eBook Reader. Furthermore your eBook will be stored in your MySpringer account. So you can always re-download your eBooks.

 
$29.95

(net) price for USA

ISBN 978-1-4471-4492-2

digitally watermarked, no DRM

Included Format: PDF

download immediately after purchase


learn more about Springer eBooks

add to marked items

Softcover
Information

Softcover (also known as softback) version.

You can pay for Springer Books with Visa, Mastercard, American Express or Paypal.

Standard shipping is free of charge for individual customers.

 
$39.95

(net) price for USA

ISBN 978-1-4471-4491-5

free shipping for individuals worldwide

usually dispatched within 3 to 5 business days


add to marked items

  • Examines the power requirements of a range of important DOE/NNSA production scientific computing applications running at large scale
  • Demonstrates how CPU and network bandwidth tuning can result in energy savings with little or no impact on run-time performance
  • Discusses how next-generation large-scale platforms could benefit from the capability to tune platform components to achieve more energy-efficient performance

Recognition of the importance of power and energy in the field of high performance computing (HPC) has never been greater. Research has been conducted in a number of areas related to power and energy, but little existing research has focused on large-scale HPC. Part of the reason is the lack of measurement capability currently available on small or large platforms. Typically, research is conducted using coarse methods of measurement such as inserting a power meter between the power source and the platform, or fine grained measurements using custom instrumented boards (with obvious limitations in scale). To analyze real scientific computing applications at large scale, an in situ measurement capability is necessary that scales to the size of the platform.

In response to this challenge, the unique power measurement capabilities of the Cray XT architecture were exploited to gain an understanding of power and energy use and the effects of tuning both CPU and network bandwidth. Modifications were made at the operating system level to deterministically halt cores when idle. Additionally, capabilities were added to alter operating P-state. At the application level, an understanding of the power requirements of a range of important DOE/NNSA production scientific computing applications running at large scale (thousands of nodes) is gained by simultaneously collecting current and voltage measurements on the hosting nodes. The effects of both CPU and network bandwidth tuning are examined and energy savings opportunities of up to 39% with little or no impact on run-time performance is demonstrated. Capturing scale effects was key. This research provides strong evidence that next generation large-scale platforms should not only approach CPU frequency scaling differently, as we will demonstrate, but could also benefit from the capability to tune other platform components, such as the network, to achieve more energy efficient performance.

Content Level » Research

Keywords » Energy Efficiency - High Performance Computing (HPC) - Lightweight Kernel - Network Bandwidth Tuning - Operating Systems - Power - Reliability Availability and Serviceability (RAS)

Related subjects » Communication Networks - Hardware - Software Engineering

Table of contents / Preface / Sample pages 

Popular Content within this publication 

 

Articles

Read this Book on Springerlink

Services for this book

New Book Alert

Get alerted on new Springer publications in the subject area of Computer Communication Networks.