Skip to main content
  • Book
  • © 2012

Resilience Assessment and Evaluation of Computing Systems

  • Defines and positions resilience in relation to dependability, performance engineering, and system testing
  • Provides a comprehensive overview of current and upcoming trends in computer system resilience assessment, as well as evaluation methods and techniques
  • Illustrates the applicability of the presented techniques using real-world examples and case studies
  • Includes supplementary material: sn.pub/extras

Buy it now

Buying options

eBook USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (20 chapters)

  1. Front Matter

    Pages i-xviii
  2. Introduction and Motivating Examples

    1. Front Matter

      Pages 1-1
    2. Resilience in Mobile Networks: A Need and a Challenge

      • Alberto Avritzer, Luca Berardinelli, Vittorio Cortellessa, Leïla Kloul, Carlo Rosa, Katinka Wolter
      Pages 25-40
    3. Assessing Dependability and Resilience in Critical Infrastructures: Challenges and Opportunities

      • Alberto Avritzer, Felicita Di Giandomenico, Anne Remke, Martin Riedl
      Pages 41-63
    4. Providing Dependability and Resilience in the Cloud: Challenges and Opportunities

      • Samuel Kounev, Philipp Reinecke, Fabian Brosig, Jeremy T. Bradley, Kaustubh Joshi, Vlastimil Babka et al.
      Pages 65-81
  3. Modelling Techniques

    1. Front Matter

      Pages 83-83
    2. Phase-Type Distributions

      • Philipp Reinecke, Levente Bodrog, Alexandra Danilkina
      Pages 85-113
    3. Scalable Stochastic Modelling for Resilience

      • Jeremy T. Bradley, Lucia Cloth, Richard A. Hayden, Leïla Kloul, Philipp Reinecke, Markus Siegle et al.
      Pages 115-149
  4. Model-Driven Prediction

    1. Front Matter

      Pages 151-151
    2. Modelling and Model-Based Assessment

      • Andrea Bondavalli, Paolo Lollini, István Majzik, Leonardo Montecchi
      Pages 153-165
    3. Software Aging and Rejuvenation for Increased Resilience: Modeling, Analysis and Applications

      • Alberto Avritzer, Ricardo M. Czekster, Salvatore Distefano, Kishor S. Trivedi
      Pages 167-183
    4. Online Prediction: Four Case Studies

      • Katja Gilly, Fabian Brosig, Ramon Nou, Samuel Kounev, Carlos Juiz
      Pages 185-202
  5. Measurement and Metrics

    1. Front Matter

      Pages 203-203
    2. Foundations of Metrology in the Observation of Critical Systems

      • Andrea Bondavalli, Andrea Ceccarelli, Lorenzo Falai, Michele Vadursi
      Pages 205-211
    3. Field Studies on Resilience: Measurements and Repositories

      • Joao Duraes, José Fonseca, Henrique Madeira, Marco Vieira
      Pages 213-237
    4. Failure Diagnosis of Complex Systems

      • Soila P. Kavulya, Kaustubh Joshi, Felicita Di Giandomenico, Priya Narasimhan
      Pages 239-261
    5. Fault Injection

      • Raul Barbosa, Johan Karlsson, Henrique Madeira, Marco Vieira
      Pages 263-281
    6. Resilience Benchmarking

      • Marco Vieira, Henrique Madeira, Kai Sachs, Samuel Kounev
      Pages 283-301
  6. Testing Techniques

    1. Front Matter

      Pages 303-303

About this book

The resilience of computing systems includes their dependability as well as their fault tolerance and security. It defines the ability of a computing system to perform properly in the presence of various kinds of disturbances and to recover from any service degradation. These properties are immensely important in a world where many aspects of our daily life depend on the correct, reliable and secure operation of often large-scale distributed computing systems.

Wolter and her co-editors grouped the 20 chapters from leading researchers into seven parts: an introduction and motivating examples, modeling techniques, model-driven prediction, measurement and metrics, testing techniques, case studies, and conclusions. The core is formed by 12 technical papers, which are framed by motivating real-world examples and case studies, thus illustrating the necessity and the application of the presented methods. While the technical chapters are independent of each other and can be read in any order, the reader will benefit more from the case studies if he or she reads them together with the related techniques.

The papers combine topics like modeling, benchmarking, testing, performance evaluation, and dependability, and aim at academic and industrial researchers in these areas as well as graduate students and lecturers in related fields. In this volume, they will find a comprehensive overview of the state of the art in a field of continuously growing practical importance.

Editors and Affiliations

  • Inst. Informatik, Humboldt-Universität Berlin, Berlin, Germany

    Katinka Wolter

  • Siemens Corporate Research, Princeton, USA

    Alberto Avritzer

  • Fac. Ciencias e Tecnologia, Depto. Engenharia Informatica, Universidade de Coimbra, Coimbra, Portugal

    Marco Vieira

  • Newcastle University, Newcastle upon Tyne, United Kingdom

    Aad van Moorsel

About the editors

Katinka Wolter is a senior researcher at the Free University Berlin, where she leads a research group working on Quality of Service in wireless networks and dependability of service-oriented architectures. Her interests are in performance and reliability modeling and the evaluation of computer and communication systems, networks, and applications.

Alberto Avritzer is currently a senior member of the technical staff at the Software Engineering Department at Siemens Corporate Research in Princeton, NJ. Before moving to Siemens Corporate Research, he spent 13 years at AT&T Bell Laboratories, where he developed tools and techniques for performance testing and analysis. His research interests are in software engineering, particularly software testing, monitoring and rejuvenation of smoothly degrading systems, and metrics to assess software architecture. He is a member of ACM SIGSOFT, and IEEE.

Aad van Moorsel is a Professor at the School of Computing Science at the University of Newcastle. Prior to that. he worked in industry, first as a researcher at Bell Labs/Lucent Technologies in Murray Hill and then as a research manager at Hewlett-Packard Labs in Palo Alto. His research agenda aims at establishing intelligent enterprises, with a specific focus on trust, privacy and security.

Marco Vieira is an Assistant Professor at the University of Coimbra, Portugal. He is an expert on dependability and security benchmarking and is coauthor of the first dependability benchmark proposal known – the DBenchOLTP. His research interests include experimental dependability evaluation, fault injection, robustness testing, software development processes, and software quality assurance.

Bibliographic Information

Buy it now

Buying options

eBook USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access