Editors:

Xindong Wu,
Lakhmi Jain,
Jason T.L. Wang²,
Mohammed J. Zaki³,
Hannu T.T. Toivonen⁴,
…
Dennis Shasha⁵

Xindong Wu

View editor publications

You can also search for this editor in PubMed Google Scholar
Lakhmi Jain

View editor publications

You can also search for this editor in PubMed Google Scholar
Jason T.L. Wang
1. New Jersey Institute of Technology, USA
View editor publications

You can also search for this editor in PubMed Google Scholar
Mohammed J. Zaki
1. Computer Science Department, Rensselaer Polytechnic Institute, USA
View editor publications

You can also search for this editor in PubMed Google Scholar
Hannu T.T. Toivonen
1. University of Helsinki and Nokia Research Center, Helsinki
View editor publications

You can also search for this editor in PubMed Google Scholar
Dennis Shasha
1. New York University, USA
View editor publications

You can also search for this editor in PubMed Google Scholar

No known book on this area
First book containing the work of key researchers in biological data mining
Presents new techniques on (a) gene expression data mining, (b) gene mapping for disease detection, and (c) phylogenetic knowledge discovery, which are of increasing importance but are absent in all previously published books in the area of computational biology
Organized around the major themes of modern biology: sequence studies, proteomics and developmental biology - these are core areas of present and future research
Includes supplementary material: sn.pub/extras

Part of the book series: Advanced Information and Knowledge Processing (AI&KP)

20k Accesses
58 Citations

Buy it now

eBook USD 129.00

Price excludes VAT (USA)

Softcover Book USD 169.99

Price excludes VAT (USA)

Hardcover Book USD 169.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Learn about institutional subscriptions

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (12 chapters)

Front Matter

Pages i-xi

PDF
Overview
1. Introduction to Data Mining in Bioinformatics
  
  Jason T. L. Wang, Mohammed J. Zaki, Hannu T. T. Toivonen, Dennis Shasha
  
  Pages 3-8
2. Survey of Biodata Analysis from a Data Mining Perspective
  
  Peter Bajcsy, Jiawei Han, Lei Liu, Jiong Yang
  
  Pages 9-39
Sequence and Structure Alignment
1. AntiClustAl: Multiple Sequence Alignment by Antipole Clustering
  
  Cinzia Di Pietro, Alfredo Ferro, Giuseppe Pigola, Alfredo Pulvirenti, Michele Purrello, Marco Ragusa et al.
  
  Pages 43-57
2. RNA Structure Comparison and Alignment
  
  Kaizhong Zhang
  
  Pages 59-81
Biological Data Mining
1. Piecewise Constant Modeling of Sequential Data Using Reversible Jump Markov Chain Monte Carlo
  
  Marko Salmenkivi, Heikki Mannila
  
  Pages 85-103
2. Gene Mapping by Pattern Discovery
  
  Petteri Sevon, Hannu T. T. Toivonen, Päivi Onkamo
  
  Pages 105-126
3. Predicting Protein Folding Pathways
  
  Mohammed J. Zaki, Vinay Nadimpally, Deb Bardhan, Chris Bystroff
  
  Pages 127-141
4. Data Mining Methods for a Systematics of Protein Subcellular Location
  
  Kai Huang, Robert F. Murphy
  
  Pages 143-187
5. Mining Chemical Compounds
  
  Mukund Deshpande, Michihiro Kuramochi, George Karypis
  
  Pages 189-215
Biological Data Management
1. Phyloinformatics: Toward a Phylogenetic Database
  
  Roderic D. M. Page
  
  Pages 219-241
2. Declarative and Efficient Querying on Protein Secondary Structures
  
  Jignesh M. Patel, Donald P. Huddler, Laurie Hammel
  
  Pages 243-273
3. Scalable Index Structures for Biological Data
  
  Ambuj K. Singh
  
  Pages 275-296
Back Matter

Pages 297-340

PDF

About this book

8. 1. 1 Protein Subcellular Location The life sciences have entered the post-genome era where the focus of biologicalresearchhasshiftedfromgenomesequencestoproteinfunctionality. Withwhole-genomedraftsofmouseandhumaninhand,scientistsareputting more and more e?ort into obtaining information about the entire proteome in a given cell type. The properties of a protein include its amino acid sequences, its expression levels under various developmental stages and in di?erenttissues,its3Dstructureandactivesites,itsfunctionalandstructural binding partners, and its subcellular location. Protein subcellular location is important for understanding protein function inside the cell. For example, the observation that the product of a gene is localized in mitochondria will support the hypothesis that this protein or gene is involved in energy metabolism. Proteins localized in the cytoskeleton are probably involved in intracellular tra?cking and support. The context of protein functionality is well represented by protein subcellular location. Proteins have various subcellular location patterns [250]. One major category of proteins is synthesized on free ribosomes in the cytoplasm. Soluble proteins remain in the cytoplasm after their synthesis and function as small factories catalyzing cellular metabolites. Other proteins that have a target signal in their sequences are directed to their target organelle (such as mitochondria) via posttranslational transport through the organelle membrane. Nuclear proteins are transferred through pores on the nuclear envelope to the nucleus and mostly function as regulators. The second major category of proteins is synthesized on endoplasmic reticulum(ER)-associated ribosomes and passes through the reticuloendothelial system, consisting of the ERand the Golgi apparatus.

Keywords

Editors and Affiliations

New Jersey Institute of Technology, USA

Jason T.L. Wang
Computer Science Department, Rensselaer Polytechnic Institute, USA

Mohammed J. Zaki
University of Helsinki and Nokia Research Center, Helsinki

Hannu T.T. Toivonen
New York University, USA

Dennis Shasha

Bibliographic Information

Book Title: Data Mining in Bioinformatics
Editors: Xindong Wu, Lakhmi Jain, Jason T.L. Wang, Mohammed J. Zaki, Hannu T.T. Toivonen, Dennis Shasha
Series Title: Advanced Information and Knowledge Processing
DOI: https://doi.org/10.1007/b138131
Publisher: Springer London
eBook Packages: Computer Science, Computer Science (R0)
Hardcover ISBN: 978-1-85233-671-4Published: 18 October 2004
Softcover ISBN: 978-1-84996-894-2Published: 22 October 2010
eBook ISBN: 978-1-84628-059-7Published: 02 September 2005
Series ISSN: 1610-3947
Series E-ISSN: 2197-8441
Edition Number: 1
Number of Pages: XII, 340
Number of Illustrations: 110 b/w illustrations
Topics: Database Management, Programming Techniques, Information Systems Applications (incl. Internet), Data Structures, Data Storage Representation, Bioinformatics

Publish with us

Policies and ethics

Editors:

Sections

Buy it now

Buying options

Other ways to access

Table of contents (12 chapters)

Front Matter

Overview

Sequence and Structure Alignment

Biological Data Mining

Biological Data Management

Back Matter

About this book

Keywords

Editors and Affiliations

New Jersey Institute of Technology, USA

Computer Science Department, Rensselaer Polytechnic Institute, USA

University of Helsinki and Nokia Research Center, Helsinki

New York University, USA

Bibliographic Information

Publish with us

Buy it now

Buying options

Other ways to access

Search

Navigation