Skip to main content
Book cover

Survey of Text Mining II

Clustering, Classification, and Retrieval

  • Book
  • © 2008

Overview

  • Overview of current methods and software for text mining

  • Experts from academia and industry share their experiences in solving large-scale retrieval and classification problems

  • Highlights open research questions in document categorization and clustering, and trend detection

  • Describes new application problems in areas such as email surveillance and anomaly detection

  • Includes supplementary material: sn.pub/extras

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 54.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (12 chapters)

  1. Clustering

  2. Document Retrieval and Representation

  3. Email Surveillance and Filtering

  4. Anomaly Detection

Keywords

About this book

As we enter the third decade of the World Wide Web (WWW), the textual revolution has seen a tremendous change in the availability of online information. Finding inf- mation for just about any need has never been more automatic—just a keystroke or mouseclick away. While the digitalization and creation of textual materials continues at light speed, the ability to navigate, mine, or casually browse through documents too numerous to read (or print) lags far behind. What approaches to text mining are available to ef?ciently organize, classify, label, and extract relevant information for today’s information-centric users? What algorithms and software should be used to detect emerging trends from both text streamsandarchives?Thesearejustafewoftheimportantquestionsaddressedatthe Text Mining Workshop held on April 28, 2007, in Minneapolis, MN. This workshop, the ?fth in a series of annual workshops on text mining, was held on the ?nal day of the Seventh SIAM International Conference on Data Mining (April 26–28, 2007). With close to 60 applied mathematicians and computer scientists representing universities, industrial corporations, and government laboratories, the workshop f- tured both invited and contributed talks on important topics such as the application of techniques of machine learning in conjunction with natural language processing, - formation extraction and algebraic/mathematical approaches to computational inf- mation retrieval. The workshop’s program also included an Anomaly Detection/Text Mining competition. NASA Ames Research Center of Moffett Field, CA, and SAS Institute Inc. of Cary, NC, sponsored the workshop.

Editors and Affiliations

  • Department of Computer Science, University of Tennessee, USA

    Michael W. Berry

  • Hewlett-Packard Laboratories, Palo Alto, USA

    Malu Castellanos

Bibliographic Information

Publish with us