Skip to main content
  • Book
  • © 2006

Web Archiving

Authors:

  • Combines the librarian’s application knowledge with the computer scientist’s implementation knowledge
  • Introduces all aspects from website monitoring to deep Web preservation
  • Presents an unbiased view on current standardization and preservation projects
  • Includes supplementary material: sn.pub/extras

Buy it now

Buying options

eBook USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (10 chapters)

  1. Front Matter

    Pages I-VII
  2. Web Archiving: Issues and Methods

    • Julien Masanés
    Pages 1-53
  3. Web Use and Web Studies

    • Steve Jones, Camille Johnson
    Pages 55-69
  4. Selection for Web Archives

    • Julien Masanés
    Pages 71-91
  5. Copying Websites

    • Xavier Roche
    Pages 93-114
  6. Archiving the Hidden Web

    • Julien Masanés
    Pages 115-129
  7. Access and Finding Aids

    • Thorsteinn Hallgrimsson
    Pages 131-151
  8. Mining Web Collections

    • Andreas Aschenbrenner, Andreas Rauber
    Pages 153-176
  9. The Long-Term Preservation of Web Content

    • Michael Day
    Pages 177-199
  10. Small Scale Academic Web Archiving: DACHS

    • Hanno E. Lecher
    Pages 213-225
  11. Back Matter

    Pages 227-234

About this book

The public information available on the Web today is larger than information distributed on any other media. The raw nature of Web content, the unpredictable remote changes that can affect it, the wide variety of formats concerned, and the growth in data-driven websites make the preservation of this material a challenging task, requiring specific monitoring, collecting and preserving strategies, procedures and tools.

Julien Masanès, Director of the European Archive, has assembled contributions from computer scientists and librarians that altogether encompass the complete range of tools, tasks and processes needed to successfully preserve the cultural heritage of the Web. His book serves as a standard introduction for everyone involved in keeping alive the immense amount of online information, and it covers issues related to building, using and preserving Web archives both from the computer scientist and librarian viewpoints.

Practitioners will find in this book a state-of-the-art overview of methods, tools and standards they need for their activities. Researchers as well as advanced students in computer science will use it as an introduction to this new field with a hopefully stimulating review of open issues where future work is needed.

Reviews

From the reviews:

"The book addresses the problem of preserving the vast amount of information available on the web. … presents a useful index, as it helps bring together related issues presented by the various contributing authors. There are also a number of relevant figures. A comprehensive list of references after each chapter is also given. … a valuable source of information offered by a range of experts in the field, this book is recommended reading for archivists, record managers, librarians, computer scientists and information managers … ." (Tony Rodrigues, Online Information Review, Vol. 31 (6), 2007)

Authors and Affiliations

  • European Web Archive, Paris, France

    Julien Masanés

About the author

Julien Masanès is a cofounder and the Director of the European Archive, a non-profit foundation for Web preservation and digital cultural access. Before this he had directed the Web Archiving Project at the Bibliothèque nationale de France (BnF) since 2000. He also participated very actively in the creation of the International Internet Preservation Consortium (IIPC), which he coordinated during its first two years.

He is a curator and received a degree in librarianship at ENSSIB (Lyon) in 1999. He was a digital preservation adviser at BnF and has participated in various national and international initiatives in this domain, such as the European Project NEDLIB, the Global Digital Format Registry and the OCLC/RLG Open Archive Information System Certification Group. He has numerous publications in this field, and he launched and presently chairs the International Web Archiving Workshop (IWAW) series, the main international rendezvous in this domain.

Bibliographic Information

Buy it now

Buying options

eBook USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access