Skip to main content
Book cover

Operating Systems for Supercomputers and High Performance Computing

  • Book
  • © 2019

Overview

  • Provides an in-depth look at real-world supercomputer operating systems and their history
  • Presents a comprehensive and structured approach to the topic, written by leading HPC OS researchers
  • Explains lessons-learned and best practices from history to cutting-edge supercomputer OS research

Part of the book series: High-Performance Computing Series (HPC, volume 1)

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 109.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 149.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 139.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (20 chapters)

  1. Introduction

  2. Lightweight Kernels

  3. Unix/Linux Based Systems

  4. Multi-kernels

Keywords

About this book

Few works are as timely and critical to the advancement of high performance computing than is this new up-to-date treatise on leading-edge directions of operating systems. It is a first-hand product of many of the leaders in this rapidly evolving field and possibly the most comprehensive.

This new and important book masterfully presents the major alternative concepts driving the future of operating system design for high performance computing. In particular, it describes the major advances of monolithic operating systems such as Linux and Unix that dominate the TOP500 list. It also presents the state of the art in lightweight kernels that exhibit high efficiency and scalability at the loss of generality. Finally, this work looks forward to possibly the most promising strategy of a hybrid structure combining full service functionality with lightweight kernel operation. With this, it is likely that this new work will find its way on the shelves of almost everyone who is in anyway engaged in the multi-discipline of high performance computing.

(From the foreword by Thomas Sterling)


Editors and Affiliations

  • RIKEN Center for Computational Science, Kobe, Japan

    Balazs Gerofi, Yutaka Ishikawa

  • Intel Corp., Oregon, USA

    Rolf Riesen

  • Intel Corp., New York, USA

    Robert W. Wisniewski

About the editors

Dr. Balazs Gerofi is a research scientist at the RIKEN Center for Computational Science, where he is involved with system software research and development for high performance computing. He actively participates in the design and development of the Post K supercomputer, Japan’s next-generation flagship supercomputer after the K Computer. Balazs earned his M.Sc. degree and Ph.D. degree in computer science from the Vrije Universiteit Amsterdam and The University of Tokyo, respectively. His research interest covers operating systems, high performance computing, cloud computing, and fault-tolerant computing. Balazs is a member of the IEEE Computer Society and the Association for Computing Machinery (ACM).

Dr. Yutaka Ishikawa is the leader of the Post-K computer development project that aims at deploying the next Japanese flagship supercomputer around 2021, at the RIKEN Center for Computational Science, Japan.  Ishikawa received his Ph.D. degree in electrical engineering from Keio University. From 1987 to 2001, he was a member of AIST (the former Electrotechnical Laboratory).  From 1993 to 2001, he was the chief of the Parallel and Distributed System Software Laboratory at the Real World Computing Partnership.  He led the development of the cluster system software called SCore, which was used in several large PC cluster systems around 2004. From 2002 to 2006 and from 2006 to 2014, he was an associate professor and a professor at The University Tokyo, respectively.  From 2006 to 2008, he was a project co-leader to design a commodity-based supercomputer called T2K open supercomputer.  As a result, three universities, Tsukuba, Tokyo, and Kyoto, obtained their respective supercomputers based on those specifications.  From 2010 to 2014, he was also the director of the Information Technology Center at The University of Tokyo. He led the design and implementation of HPCI, High Performance Computing Infrastructure in Japan, from 2010 to2012.

Dr. Rolf Riesen is the lead software architect for the multi-operating system (mOS) project at the Intel Corp. The mOS team is creating an OS for use in supercomputers and other high-end HPC systems. Rolf has 25 years of experience in researching, developing, and deploying software for massively parallel processors. His career began as a key member of the Sandia National Laboratory and University of New Mexico team that created the lightweight kernel and the Portals message passing interface that broke the teraflops barrier in 1997 with the Intel-powered ASCI Red supercomputer. Over the years, Rolf's code and research ideas have directly contributed to specific systems on the TOP500 list, stretching over a period of almost 20 years. It began with SUNMOS on an nCUBE 2 to the Catamount OS on the Cray/Sandia Red Storm system. After teaching for 2 years at the University of New Mexico, he joined IBM research in Dublin, Ireland, where he focused on simulation and fault tolerance for extreme scale systems Now, at Intel, he is using his expertise to guide a team that combines a lightweight OS kernel with Linux. Rolf has over 50 peer-reviewed publications and is an active member of various program committees.  He is also a subject area editor for the journal Parallel Computing.

Dr. Robert W. Wisniewski is an ACM Distinguished Scientist and the chief software architect for Extreme Scale Computing and a senior principal engineer at the Intel Corporation.  He is the lead architect for Intel's cohesive and comprehensive software stack that leverages OpenHPC and is responsible for the software for Aurora, the world's largest announced supercomputer.  He has published over 74 papers in the area of high performance computing, computer systems, and system performance, filed over 56 patents, and given over 53 external invited presentations. Before coming to Intel, he was the chief software architect for Blue Gene Research and manager of the Blue Gene and Exascale Research Software Team at the IBM T.J. Watson Research Facility. There, he was an IBM master inventor and led the software effort on Blue Gene/Q, the fastest machine in the world on the June 2012 TOP500 list, and occupied 4 of the top 10 positions.

Bibliographic Information

  • Book Title: Operating Systems for Supercomputers and High Performance Computing

  • Editors: Balazs Gerofi, Yutaka Ishikawa, Rolf Riesen, Robert W. Wisniewski

  • Series Title: High-Performance Computing Series

  • DOI: https://doi.org/10.1007/978-981-13-6624-6

  • Publisher: Springer Singapore

  • eBook Packages: Computer Science, Computer Science (R0)

  • Copyright Information: Springer Nature Singapore Pte Ltd. 2019

  • Hardcover ISBN: 978-981-13-6623-9Published: 28 October 2019

  • Softcover ISBN: 978-981-13-6626-0Published: 27 November 2020

  • eBook ISBN: 978-981-13-6624-6Published: 15 October 2019

  • Series ISSN: 2662-3420

  • Series E-ISSN: 2662-3439

  • Edition Number: 1

  • Number of Pages: XXIX, 400

  • Number of Illustrations: 43 b/w illustrations, 93 illustrations in colour

  • Topics: Operating Systems

Publish with us