Skip to main content
  • Book
  • © 2010

Data Mining Techniques for the Life Sciences

  • An easily accessible reference book for computational data mining, ranging from databases, to computational details, and to modern applications
  • Covers a wide range of biological systems and in silico approaches
  • Presents the exciting interface between computational and experimental approaches of molecular biology
  • Serves as a comprehensive guide to designing and running systematic and large scale computational analyses of biological data
  • Outlines the process of turning a series of observations into a network of relationships
  • Includes supplementary material: sn.pub/extras

Part of the book series: Methods in Molecular Biology (MIMB, volume 609)

Buy it now

Buying options

eBook USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 159.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (22 protocols)

  1. Front Matter

    Pages i-xii
  2. Databases

    1. Front Matter

      Pages 1-1
    2. Nucleic Acid Sequence and Structure Databases

      • Stefan Washietl, Ivo L. Hofacker
      Pages 3-15
    3. Protein Sequence Databases

      • Michael Rebhan
      Pages 45-57
    4. Protein Structure Databases

      • Roman A. Laskowski
      Pages 59-82
    5. Protein Domain Architectures

      • Nicola J. Mulder
      Pages 83-95
    6. Thermodynamic Database for Proteins: Features and Applications

      • M. Michael Gromiha, Akinori Sarai
      Pages 97-112
    7. Enzyme Databases

      • Dietmar Schomburg, Ida Schomburg
      Pages 113-128
    8. Biomolecular Pathway Databases

      • Hong Sain Ooi, Georg Schneider, Teng-Ting Lim, Ying-Leong Chan, Birgit Eisenhaber, Frank Eisenhaber
      Pages 129-144
    9. Databases of Protein–Protein Interactions and Complexes

      • Hong Sain Ooi, Georg Schneider, Ying-Leong Chan, Teng-Ting Lim, Birgit Eisenhaber, Frank Eisenhaber
      Pages 145-159
  3. Data Mining Techniques

    1. Front Matter

      Pages 161-161
    2. Proximity Measures for Cluster Analysis

      • Oliviero Carugo
      Pages 163-174
    3. Clustering Criteria and Algorithms

      • Oliviero Carugo
      Pages 175-196
    4. Neural Networks

      • Zheng Rong Yang
      Pages 197-222
    5. A User’s Guide to Support Vector Machines

      • Asa Ben-Hur, Jason Weston
      Pages 223-239
    6. Hidden Markov Models in Biology

      • Claus Vogl, Andreas Futschik
      Pages 241-253
  4. Database Annotations and Predictions

    1. Front Matter

      Pages 255-255
    2. Integrated Tools for Biomolecular Sequence-Based Function Prediction as Exemplified by the ANNOTATOR Software Environment

      • Georg Schneider, Michael Wildpaner, Fernanda L. Sirota, Sebastian Maurer-Stroh, Birgit Eisenhaber, Frank Eisenhaber
      Pages 257-267
    3. Computational Methods for Ab Initio and Comparative Gene Finding

      • Ernesto Picardi, Graziano Pesole
      Pages 269-284

About this book

Most life science researchers will agree that biology is not a truly theoretical branch of science. The hype around computational biology and bioinformatics beginning in the nineties of the 20th century was to be short lived (1, 2). When almost no value of practical importance such as the optimal dose of a drug or the three-dimensional structure of an orphan protein can be computed from fundamental principles, it is still more straightforward to determine them experimentally. Thus, experiments and observationsdogeneratetheoverwhelmingpartofinsightsintobiologyandmedicine. The extrapolation depth and the prediction power of the theoretical argument in life sciences still have a long way to go. Yet, two trends have qualitatively changed the way how biological research is done today. The number of researchers has dramatically grown and they, armed with the same protocols, have produced lots of similarly structured data. Finally, high-throu- put technologies such as DNA sequencing or array-based expression profiling have been around for just a decade. Nevertheless, with their high level of uniform data generation, they reach the threshold of totally describing a living organism at the biomolecular level for the first time in human history. Whereas getting exact data about living systems and the sophistication of experimental procedures have primarily absorbed the minds of researchers previously, the weight increasingly shifts to the problem of interpreting accumulated data in terms of biological function and bio- lecular mechanisms.

Reviews

From the reviews:

“The book consists of three parts with 22 chapters prepared by well-known experts from many countries. … book will be useful for students and researchers, such as biochemists, molecular biologists, and biotechnologists, who wish to get a condensed introduction to the world of biological databases and their applications related to various aspects of life science.” (G. Ya. Wiederschain, Biochemistry, Vol. 76 (4), 2011)

“Provides a comprehensive overview and reference for molecular biologists and bioinformaticians as to the goals and scope of each database in each category. … The chapters are well written and provide a good introduction to the addressed topics … . Each chapter is an interesting and informative read in itself … . Overall, this edited volume provides a good reference to the current state of bioinformatics-related databases and as an introduction to the more common machine-learning techniques in bioinformatics.” (Iddo Friedberg, The Quarterly Review of Biology, Vol. 86, December, 2011)

Editors and Affiliations

  • Max F. Perutz Laboratories GmbH, Universität Wien, Wien, Austria

    Oliviero Carugo

  • Research (A*STAR), Agency for Science & Technology, Singapore, Singapore

    Frank Eisenhaber

Bibliographic Information

  • Book Title: Data Mining Techniques for the Life Sciences

  • Editors: Oliviero Carugo, Frank Eisenhaber

  • Series Title: Methods in Molecular Biology

  • DOI: https://doi.org/10.1007/978-1-60327-241-4

  • Publisher: Humana Totowa, NJ

  • eBook Packages: Springer Protocols

  • Copyright Information: Humana Press 2010

  • Hardcover ISBN: 978-1-60327-240-7Published: 14 December 2009

  • Softcover ISBN: 978-1-4939-5688-3Published: 23 August 2016

  • eBook ISBN: 978-1-60327-241-4Published: 11 March 2010

  • Series ISSN: 1064-3745

  • Series E-ISSN: 1940-6029

  • Edition Number: 1

  • Number of Pages: XII, 407

  • Topics: Bioinformatics, Systems Biology, Computer Appl. in Life Sciences

Buy it now

Buying options

eBook USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 159.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access