More than 1,900 Springer Protocols eBooks at just $9.99 each! Get yours today>>

The Information Retrieval Series

Language Modeling for Information Retrieval

Editors: Croft, Bruce, Lafferty, John (Eds.)

Buy this book

eBook $119.00
price for USA (gross)
  • ISBN 978-94-017-0171-6
  • Digitally watermarked, DRM-free
  • Included format: PDF
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Hardcover $159.00
price for USA
  • ISBN 978-1-4020-1216-7
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
Softcover $159.00
price for USA
  • ISBN 978-90-481-6263-5
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
About this book

A statisticallanguage model, or more simply a language model, is a prob­ abilistic mechanism for generating text. Such adefinition is general enough to include an endless variety of schemes. However, a distinction should be made between generative models, which can in principle be used to synthesize artificial text, and discriminative techniques to classify text into predefined cat­ egories. The first statisticallanguage modeler was Claude Shannon. In exploring the application of his newly founded theory of information to human language, Shannon considered language as a statistical source, and measured how weH simple n-gram models predicted or, equivalently, compressed natural text. To do this, he estimated the entropy of English through experiments with human subjects, and also estimated the cross-entropy of the n-gram models on natural 1 text. The ability of language models to be quantitatively evaluated in tbis way is one of their important virtues. Of course, estimating the true entropy of language is an elusive goal, aiming at many moving targets, since language is so varied and evolves so quickly. Yet fifty years after Shannon's study, language models remain, by all measures, far from the Shannon entropy liInit in terms of their predictive power. However, tbis has not kept them from being useful for a variety of text processing tasks, and moreover can be viewed as encouragement that there is still great room for improvement in statisticallanguage modeling.

Table of contents (10 chapters)

  • Probabilistic Relevance Models Based on Document and Query Generation

    Lafferty, John (et al.)

    Pages 1-10

  • Relevance Models in Information Retrieval

    Lavrenko, Victor (et al.)

    Pages 11-56

  • Language Modeling and Relevance

    Jones, Karen Sparck (et al.)

    Pages 57-71

  • Contributions of Language Modeling to the Theory and Practice of Information Retrieval

    Greiff, Warren R. (et al.)

    Pages 73-93

  • Language Models for Topic Tracking

    Kraaij, Wessel (et al.)

    Pages 95-123

Buy this book

eBook $119.00
price for USA (gross)
  • ISBN 978-94-017-0171-6
  • Digitally watermarked, DRM-free
  • Included format: PDF
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Hardcover $159.00
price for USA
  • ISBN 978-1-4020-1216-7
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
Softcover $159.00
price for USA
  • ISBN 978-90-481-6263-5
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
Loading...

Recommended for you

Loading...

Bibliographic Information

Bibliographic Information
Book Title
Language Modeling for Information Retrieval
Editors
  • Bruce Croft
  • John Lafferty
Series Title
The Information Retrieval Series
Series Volume
13
Copyright
2003
Publisher
Springer Netherlands
Copyright Holder
Springer Science+Business Media Dordrecht
eBook ISBN
978-94-017-0171-6
DOI
10.1007/978-94-017-0171-6
Hardcover ISBN
978-1-4020-1216-7
Softcover ISBN
978-90-481-6263-5
Series ISSN
1387-5264
Edition Number
1
Number of Pages
XIV, 246
Topics