Skip to main content
  • Book
  • © 2005

Text Mining

Predictive Methods for Analyzing Unstructured Information

  • Provides an authoritative, comprehensive survey of the concepts, principles, and methods of text mining (the search and retrieval of nonnumeric data), which is becoming increasingly critical at companies and organizations as they attempt to fully utilize their document/textual databases
  • Includes supplementary material: sn.pub/extras

Buy it now

Buying options

eBook USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (8 chapters)

  1. Front Matter

    Pages i-xii
  2. Overview of Text Mining

    • Sholom M. Weiss, Nitin Indurkhya, Tong Zhang, Fred J. Damerau
    Pages 1-13
  3. From Textual Information to Numerical Vectors

    • Sholom M. Weiss, Nitin Indurkhya, Tong Zhang, Fred J. Damerau
    Pages 15-46
  4. Using Text for Prediction

    • Sholom M. Weiss, Nitin Indurkhya, Tong Zhang, Fred J. Damerau
    Pages 47-84
  5. Information Retrieval and Text Mining

    • Sholom M. Weiss, Nitin Indurkhya, Tong Zhang, Fred J. Damerau
    Pages 85-102
  6. Finding Structure in a Document Collection

    • Sholom M. Weiss, Nitin Indurkhya, Tong Zhang, Fred J. Damerau
    Pages 103-128
  7. Looking for Information in Documents

    • Sholom M. Weiss, Nitin Indurkhya, Tong Zhang, Fred J. Damerau
    Pages 129-156
  8. Case Studies

    • Sholom M. Weiss, Nitin Indurkhya, Tong Zhang, Fred J. Damerau
    Pages 157-195
  9. Emerging Directions

    • Sholom M. Weiss, Nitin Indurkhya, Tong Zhang, Fred J. Damerau
    Pages 197-211
  10. Back Matter

    Pages 213-237

About this book

Data mining is a mature technology. The prediction problem, looking for predictive patterns in data, has been widely studied. Strong me- ods are available to the practitioner. These methods process structured numerical information, where uniform measurements are taken over a sample of data. Text is often described as unstructured information. So, it would seem, text and numerical data are different, requiring different methods. Or are they? In our view, a prediction problem can be solved by the same methods, whether the data are structured - merical measurements or unstructured text. Text and documents can be transformed into measured values, such as the presence or absence of words, and the same methods that have proven successful for pred- tive data mining can be applied to text. Yet, there are key differences. Evaluation techniques must be adapted to the chronological order of publication and to alternative measures of error. Because the data are documents, more specialized analytical methods may be preferred for text. Moreover, the methods must be modi?ed to accommodate very high dimensions: tens of thousands of words and documents. Still, the central themes are similar.

Authors and Affiliations

  • TJ Watson Labs, IBM Research, Yorktown Heights, USA

    Sholom M. Weiss, Tong Zhang, Fred J. Damerau

  • School of Computer Science and Engineering, University of New South Wales, Sydney, Australia

    Nitin Indurkhya

Bibliographic Information

Buy it now

Buying options

eBook USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access