Skip to main content
  • Book
  • © 2016

Software Design for Resilient Computer Systems

  • Outlines potential critical faults in the modern computer systems and what is required to change them
  • Explains how to design and re-design system software for the next generation of computers for wider application domains and greater efficiency and reliability
  • Presents how implemented system software support makes maintenance of computer systems much easier, while reliability and performance increases
  • Includes supplementary material: sn.pub/extras

Buy it now

Buying options

eBook USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (16 chapters)

  1. Front Matter

    Pages i-xiv
  2. Introduction

    • Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 1-6
  3. Hardware Faults

    • Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 7-10
  4. Fault Tolerance: Theory and Concepts

    • Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 11-22
  5. Generalized Algorithm of Fault Tolerance (GAFT)

    • Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 23-43
  6. GAFT Generalization: A Principle and Model of Active System Safety

    • Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 45-55
  7. System Software Support for Hardware Deficiency: Function and Features

    • Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 57-66
  8. Testing, Checking, and Hardware Syndrome

    • Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 67-104
  9. Recovery Preparation

    • Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 105-134
  10. Recovery: Searching and Monitoring of Correct Software States

    • Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 135-145
  11. Recovery Algorithms: An Analysis

    • Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 147-157
  12. Programming Language for Safety Critical Systems

    • Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 159-182
  13. Proposed Run-Time System Structure

    • Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 183-184
  14. Proposed Run-Time System Versus Existing Approaches

    • Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 185-187
  15. Hardware: The ERRIC Architecture

    • Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 189-197
  16. Architecture Comparison and Evaluation

    • Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 199-205
  17. ERRIC Reliability

    • Igor Schagaev, Thomas Kaegi-Trachsel
    Pages 207-211
  18. Back Matter

    Pages 213-214

About this book

This book addresses the question of how system software should be designed to account for faults, and which fault tolerance features it should provide for highest reliability. The authors first show how the system software interacts with the hardware to tolerate faults. They analyze and further develop the theory of fault tolerance to understand the different ways to increase the reliability of a system, with special attention on the role of system software in this process. They further develop the general algorithm of fault tolerance (GAFT) with its three main processes: hardware checking, preparation for recovery, and the recovery procedure. For each of the three processes, they analyze the requirements and properties theoretically and give possible implementation scenarios and system software support required. Based on the theoretical results, the authors derive an Oberon-based programming language with direct support of the three processes of GAFT.  In the last part of this book, they introduce a simulator, using it as a proof of concept implementation of a novel fault tolerant processor architecture (ERRIC) and its newly developed runtime system feature-wise and performance-wise. The content applies to industries such as military, aviation, intensive health care, industrial control, space exploration, etc. 

Authors and Affiliations

  • IT-ACS Ltd, Stevenage, United Kingdom

    Igor Schagaev, Thomas Kaegi-Trachsel

About the authors

Dr. Igor Schagaev is a Professor and Head of Laboratory of Applied Computer Science and Technology (LACOST) Faculty of Life Science and Computing and London Metropolitan University. He is a Fellow of the Institute of Analyst and Programmers (UK) and has previously work as an Electromechanical Engineer at the  Smolensk aviation factory, USSR, a Senior Programmer and Design Engineer at the Institute of Advanced Computations, Central Bureau, Smolensk Branch, and a Senior Design Engineer and System Programmer for Avionics, and was Director of ATLAB Ltd Bristol. He has published 5 books, over 60 papers, and has been honored with several industry awards, achievements, and grants.

Dr. Thomas Kaegi-Trachsel received his PhD in 2012 in ETH Zurich in the area of system software for embedded systems (under supervision of Prof  Schagaev). He is currently a Senior Researcher at IT-ACS Ltd.

Bibliographic Information

Buy it now

Buying options

eBook USD 99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access