Skip to main content
  • Book
  • © 2004

Clustering and Information Retrieval

Part of the book series: Network Theory and Applications (NETA, volume 11)

Buy it now

Buying options

eBook USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (10 chapters)

  1. Front Matter

    Pages i-viii
  2. Clustering in Metric Spaces with Applications to Information Retrieval

    • Ricardo Baeza-Yates, Benjamín Bustos, Edgar Chávez, Norma Herrera, Gonzalo Navarro
    Pages 1-33
  3. Techniques for Clustering Massive Data Sets

    • Sudipto Guha, Rajeev Rastogi, Kyuseok Shim
    Pages 35-82
  4. Finding Topics in Collections of Documents: A Shared Nearest Neighbor Approach

    • Levent Ertöz, Michael Steinbach, Vipin Kumar
    Pages 83-103
  5. On Quantitative Evaluation of Clustering Systems

    • Ji He, Ah-Hwee Tan, Chew-Lim Tan, Sam-Yuan Sung
    Pages 105-133
  6. Document Clustering, Visualization, and Retrieval via Link Mining

    • Steven Noel, Vijay Raghavan, C.-H. Henry Chu
    Pages 161-193
  7. Query Clustering in the Web Context

    • Ji-Rong Wen, Hong-Jiang Zhang
    Pages 195-225
  8. Clustering Techniques for Large Database Cleansing

    • Sam Y. Sung, Zhao Li, Tok W. Ling
    Pages 227-259
  9. A Science Data System Architecture for Information Retrieval

    • Daniel J. Crichton, J. Steven Hughes, Sean Kelly
    Pages 261-298

About this book

Clustering is an important technique for discovering relatively dense sub-regions or sub-spaces of a multi-dimension data distribution. Clus­ tering has been used in information retrieval for many different purposes, such as query expansion, document grouping, document indexing, and visualization of search results. In this book, we address issues of cluster­ ing algorithms, evaluation methodologies, applications, and architectures for information retrieval. The first two chapters discuss clustering algorithms. The chapter from Baeza-Yates et al. describes a clustering method for a general metric space which is a common model of data relevant to information retrieval. The chapter by Guha, Rastogi, and Shim presents a survey as well as detailed discussion of two clustering algorithms: CURE and ROCK for numeric data and categorical data respectively. Evaluation methodologies are addressed in the next two chapters. Ertoz et al. demonstrate the use of text retrieval benchmarks, such as TRECS, to evaluate clustering algorithms. He et al. provide objective measures of clustering quality in their chapter. Applications of clustering methods to information retrieval is ad­ dressed in the next four chapters. Chu et al. and Noel et al. explore feature selection using word stems, phrases, and link associations for document clustering and indexing. Wen et al. and Sung et al. discuss applications of clustering to user queries and data cleansing. Finally, we consider the problem of designing architectures for infor­ mation retrieval. Crichton, Hughes, and Kelly elaborate on the devel­ opment of a scientific data system architecture for information retrieval.

Authors and Affiliations

  • Department of Computer Science, The University of Texas at Dallas, Richardson, USA

    Weili Wu

  • Department of Computer Science and Engineering, University of Minnesota - Twin Cities, Minneapolis, USA

    Hui Xiong, Shashi Shekhar

Bibliographic Information

Buy it now

Buying options

eBook USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access