Skip to main content
Birkhäuser
Book cover

Handbook of Floating-Point Arithmetic

  • Book
  • © 2018

Overview

  • Provides a complete overview of a topic that is widely used to implement real-number arithmetic on modern computers, yet is far from being fully exploited to its full potential
  • Techniques are illustrated, whenever possible, by a corresponding program, allowing the reader to put them directly into practice
  • Develops smart and nontrivial algorithms for implementation of floating-point arithmetic in software
  • For a broad audience of programmers of numerical applications, compiler designers, programmers of floating-point algorithms, designers of arithmetic operators; as well as students and researchers in numerical analysis

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (14 chapters)

  1. Introduction, Basic Definitions, and Standards

  2. Cleverly Using Floating-Point Arithmetic

  3. Implementing Floating-Point Operators

  4. Extensions

Keywords

About this book

This handbook is a definitive guide to the effective use of modern floating-point arithmetic, which has considerably evolved, from the frequently inconsistent floating-point number systems of early computing to the recent IEEE 754-2008 standard. Most of computational mathematics depends on floating-point numbers, and understanding their various implementations will allow readers to develop programs specifically tailored for the standard’s technical features. Algorithms for floating-point arithmetic are presented throughout the book and illustrated where possible by example programs which show how these techniques appear in actual coding and design.


The volume itself breaks its core topic into four parts: the basic concepts and history of floating-point arithmetic; methods of analyzing floating-point algorithms and optimizing them; implementations of IEEE 754-2008 in hardware and software; and useful extensions to the standard floating-point system, such as interval arithmetic, double- and triple-word arithmetic, operations on complex numbers, and formal verification of floating-point algorithms. This new edition updates chapters to reflect recent changes to programming languages and compilers and the new prevalence of GPUs in recent years. The revisions also add material on fused multiply-add instruction, and methods of extending the floating-point precision. 


As supercomputing becomes more common, more numerical engineers will need to use number representation to account for trade-offs between various parameters, such as speed, accuracy, and energy consumption. The Handbook of Floating-Point Arithmetic is designed for students and researchers in numerical analysis, programmers of numerical algorithms, compiler designers, and designers of arithmetic operators. 





Reviews

“The new edition of this book updates chapters to reflect recent changes to programming languages and compilers and the new prevalence of Graphic Processing Units in recent years. … In the Appendix, the reader will find an introduction to relevant number theory tools … . This book is designed for programmers of numerical applications … and more generally students and researchers in numerical analysis who wish to more accurately understand a tool that they manipulate on an everyday basis.” (T. C. Mohan, zbMATH 1394.65001, 2018)

Authors and Affiliations

  • CNRS - LIP, Lyon, France

    Jean-Michel Muller

  • Kalray, Grenoble, France

    Nicolas Brunie

  • INSA-Lyon - CITI, Villeurbanne, France

    Florent de Dinechin

  • Inria - LIP, Lyon, France

    Claude-Pierre Jeannerod, Vincent Lefèvre, Nathalie Revol

  • CNRS - LAAS, Toulouse, France

    Mioara Joldes

  • Inria - LRI, Orsay, France

    Guillaume Melquiond

  • ENS-Lyon - LIP, Lyon, France

    Serge Torres

About the authors

Jean-Michel Muller (coordinator), CNRS, Laboratoire LIP, AriC teamNicolas Brunie, Kalray
Florent de Dinechin, INSA Lyon, Laboratoire CITI, Socrate team
Claude-Pierre Jeannerod, Inria, Laboratoire LIP, AriC team
Mioara Joldes, CNRS, LAAS, MAC team
Vincent Lefèvre, Inria, Laboratoire LIP, AriC team
Guillaume Melquiond, Inria, Laboratoire LRI, Toccata team
Nathalie Revol, Inria, Laboratoire LIP, AriC team
Serge Torres, ENS de Lyon, Laboratoire LIP, AriC team

Bibliographic Information

Publish with us