Name: Language Modeling for Information Retrieval
ISBN: 978-94-017-0171-6

Overview

Editors:

W. Bruce Croft (Distinguished Professor)⁰,
John Lafferty (Associate Professor)¹

W. Bruce Croft
1. Department of Computer Science, University of Massachusetts, Amherst, USA
View editor publications

You can also search for this editor in PubMed Google Scholar
John Lafferty
1. Computer Science Department, Carniege Mellon University, Pittsburgh, USA
View editor publications

You can also search for this editor in PubMed Google Scholar

Part of the book series: The Information Retrieval Series (INRE, volume 13)

2697 Accesses
289 Citations

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 84.99

Price excludes VAT (USA)

Softcover Book USD 109.99

Price excludes VAT (USA)

Hardcover Book USD 109.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (10 chapters)

Front Matter

Pages i-xiii

Download chapter PDF
Probabilistic Relevance Models Based on Document and Query Generation
- John Lafferty, ChengXiang Zhai
Pages 1-10
Relevance Models in Information Retrieval
- Victor Lavrenko, W. Bruce Croft
Pages 11-56
Language Modeling and Relevance
- Karen Sparck Jones, Stephen Robertson, Djoerd Hiemstra, Hugo Zaragoza
Pages 57-71
Contributions of Language Modeling to the Theory and Practice of Information Retrieval
- Warren R. Greiff, William T. Morgan
Pages 73-93
Language Models for Topic Tracking
- Wessel Kraaij, Martijn Spitters
Pages 95-123
A Probabilistic Approach to Term Translation for Cross-Lingual Retrieval
- Jinxi Xu, Ralph Weischedel
Pages 125-140
Using Compression-Based Language Models for Text Categorization
- William J. Teahan, David J. Harper
Pages 141-165
Applications of Score Distributions in Information Retrieval
- R. Manmatha
Pages 167-188
An Unbiased Generative Model for Setting Dissemination Thresholds
- Yi Zhang, Jamie Callan
Pages 189-217
Language Modeling Experiments in Non-Extractive Summarization
- Vibhu O. Mittal, Michael J. Witbrock
Pages 219-244
Back Matter

Pages 245-245

Download chapter PDF

Keywords

About this book

A statisticallanguage model, or more simply a language model, is a prob abilistic mechanism for generating text. Such adefinition is general enough to include an endless variety of schemes. However, a distinction should be made between generative models, which can in principle be used to synthesize artificial text, and discriminative techniques to classify text into predefined cat egories. The first statisticallanguage modeler was Claude Shannon. In exploring the application of his newly founded theory of information to human language, Shannon considered language as a statistical source, and measured how weH simple n-gram models predicted or, equivalently, compressed natural text. To do this, he estimated the entropy of English through experiments with human subjects, and also estimated the cross-entropy of the n-gram models on natural 1 text. The ability of language models to be quantitatively evaluated in tbis way is one of their important virtues. Of course, estimating the true entropy of language is an elusive goal, aiming at many moving targets, since language is so varied and evolves so quickly. Yet fifty years after Shannon's study, language models remain, by all measures, far from the Shannon entropy liInit in terms of their predictive power. However, tbis has not kept them from being useful for a variety of text processing tasks, and moreover can be viewed as encouragement that there is still great room for improvement in statisticallanguage modeling.

Editors and Affiliations

Department of Computer Science, University of Massachusetts, Amherst, USA

W. Bruce Croft
Computer Science Department, Carniege Mellon University, Pittsburgh, USA

John Lafferty

Bibliographic Information

Book Title: Language Modeling for Information Retrieval
Editors: W. Bruce Croft, John Lafferty
Series Title: The Information Retrieval Series
DOI: https://doi.org/10.1007/978-94-017-0171-6
Publisher: Springer Dordrecht
eBook Packages: Springer Book Archive
Copyright Information: Springer Science+Business Media Dordrecht 2003
Hardcover ISBN: 978-1-4020-1216-7Published: 31 May 2003
Softcover ISBN: 978-90-481-6263-5Published: 06 December 2010
eBook ISBN: 978-94-017-0171-6Published: 17 April 2013
Series ISSN: 1871-7500
Series E-ISSN: 2730-6836
Edition Number: 1
Number of Pages: XIV, 246
Topics: Data Structures and Information Theory, Information Storage and Retrieval, Computer Science, general, Artificial Intelligence

Publish with us

Policies and ethics

Overview

Access this book

Other ways to access

Table of contents (10 chapters)

Front Matter

Back Matter

Keywords

About this book

Editors and Affiliations

Department of Computer Science, University of Massachusetts, Amherst, USA

Computer Science Department, Carniege Mellon University, Pittsburgh, USA

Bibliographic Information

Publish with us

Search

Navigation