Overview

Authors:

Udo Kruschwitz ⁰

Udo Kruschwitz
1. University of Essex, Colchester, UK
View author publications

You can also search for this author in PubMed Google Scholar

Using markup structure alone to extract knowledge from documents is new
Domain knowledge is extracted from documents in a fully automated process
The techniques outlined avoid the bottleneck of manual customization
Searching a document collection can be seen as navigating the user through the automatically extracted domain knowledge
Combines the theoretical framework and detailed evaluation steps

Part of the book series: The Information Retrieval Series (INRE, volume 17)

3328 Accesses
3 Citations
2 Altmetric

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 16.99 ~~USD 84.99~~

Discount applied Price excludes VAT (USA)

Softcover Book USD 109.99

Price excludes VAT (USA)

Hardcover Book USD 109.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

About this book

Collections of digital documents can nowadays be found everywhere in institutions, universities or companies. Examples are Web sites or intranets. But searching them for information can still be painful. Searches often return either large numbers of matches or no suitable matches at all.

Such document collections can vary a lot in size and how much structure they carry. What they have in common is that they typically do have some structure and that they cover a limited range of topics. The second point is significantly different from the Web in general.

The type of search system that we propose in this book can suggest ways of refining or relaxing the query to assist a user in the search process. In order to suggest sensible query modifications we would need to know what the documents are about. Explicit knowledge about the document collection encoded in some electronic form is what we need. However, typically such knowledge is not available. So we construct it automatically.

Indexes for Document Retrieval with Relevance

Document retrieval on repetitive string collections

Article Open access 01 April 2017

Semistructured Data Search

Keywords

Table of contents (9 chapters)

Front Matter

Pages I-XVI

Download chapter PDF
Introduction

Pages 1-19
The Model
1. Related Work
  
  Pages 23-44
2. Data Analysis and Domain Model Construction
  
  Pages 45-61
3. Incorporating Additional Knowledge
  
  Pages 63-68
4. A Dialogue System for Partially Structured Data
  
  Pages 69-90
Practical Applications
1. UKSearch - Intelligent Web Search
  
  Pages 93-120
2. UKSearch - Evaluation and Discussion
  
  Pages 121-156
3. YPA - Searching Classified Directories
  
  Pages 157-171
4. Future Directions and Conclusions
  
  Pages 173-179
Back Matter

Pages 181-197

Download chapter PDF

Reviews

From the reviews:

"The main idea of this book, based on the author’s PhD thesis, is to use markup information as a series of cues to the significance of words and concepts in a text, thus enhancing the indexing of that text. The technique is developed for collections of texts with a specific focus, such as a Web site or a collection of documents … . The presented approach is attractive, because it can be adapted to different contexts in a straightforward manner … ." (D. T. Barnard, Computing Reviews, July, 2006)

Authors and Affiliations

University of Essex, Colchester, UK

Udo Kruschwitz

Bibliographic Information

Book Title: Intelligent Document Retrieval
Book Subtitle: Exploiting Markup Structure
Authors: Udo Kruschwitz
Series Title: The Information Retrieval Series
DOI: https://doi.org/10.1007/1-4020-3768-6
Publisher: Springer Dordrecht
eBook Packages: Computer Science, Computer Science (R0)
Copyright Information: Springer Science+Business Media B.V. 2005
Hardcover ISBN: 978-1-4020-3767-2Published: 24 October 2005
Softcover ISBN: 978-90-481-6957-3Published: 28 October 2010
eBook ISBN: 978-1-4020-3768-9Published: 09 January 2006
Series ISSN: 1871-7500
Series E-ISSN: 2730-6836
Edition Number: 1
Number of Pages: XVI, 198
Topics: Theory of Computation, Computer Science, general, Information Storage and Retrieval, System Performance and Evaluation, Information Systems Applications (incl. Internet), Natural Language Processing (NLP)

Publish with us

Policies and ethics

Intelligent Document Retrieval

Overview

Access this book

Other ways to access

About this book

Similar content being viewed by others

Indexes for Document Retrieval with Relevance

Document retrieval on repetitive string collections

Semistructured Data Search

Keywords

Table of contents (9 chapters)

Front Matter

Introduction

The Model

Related Work

Data Analysis and Domain Model Construction

Incorporating Additional Knowledge

A Dialogue System for Partially Structured Data

Practical Applications

UKSearch - Intelligent Web Search

UKSearch - Evaluation and Discussion

YPA - Searching Classified Directories

Future Directions and Conclusions

Back Matter

Reviews

Authors and Affiliations

University of Essex, Colchester, UK

Bibliographic Information

Publish with us

Navigation