Read While You Wait - Get immediate ebook access, if available*, when you order a print book

Theoretical Computer Science and General Issues

Text Analysis Pipelines

Towards Ad-hoc Large-Scale Text Mining

Authors: Wachsmuth, Henning

Free Preview

Buy this book

eBook $59.99
price for USA in USD
  • ISBN 978-3-319-25741-9
  • Digitally watermarked, DRM-free
  • Included format: EPUB, PDF
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Softcover $79.99
price for USA in USD
  • ISBN 978-3-319-25740-2
  • Free shipping for individuals worldwide
  • Immediate ebook access, if available*, with your print order
  • Usually dispatched within 3 to 5 business days.
About this book

This monograph proposes a comprehensive and fully automatic approach to designing text analysis pipelines for arbitrary information needs that are optimal in terms of run-time efficiency and that robustly mine relevant information from text of any kind. Based on state-of-the-art techniques from machine learning and other areas of artificial intelligence, novel pipeline construction and execution algorithms are developed and implemented in prototypical software. Formal analyses of the algorithms and extensive empirical experiments underline that the proposed approach represents an essential step towards the ad-hoc use of text mining in web search and big data analytics.
Both web search and big data analytics aim to fulfill peoples’ needs for information in an adhoc manner. The information sought for is often hidden in large amounts of natural language text. Instead of simply returning links to potentially relevant texts, leading search and analytics engines have started to directly mine relevant information from the texts. To this end, they execute text analysis pipelines that may consist of several complex information-extraction and text-classification stages. Due to practical requirements of efficiency and robustness, however, the use of text mining has so far been limited to anticipated information needs that can be fulfilled with rather simple, manually constructed pipelines.


Table of contents (6 chapters)

Table of contents (6 chapters)

Buy this book

eBook $59.99
price for USA in USD
  • ISBN 978-3-319-25741-9
  • Digitally watermarked, DRM-free
  • Included format: EPUB, PDF
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Softcover $79.99
price for USA in USD
  • ISBN 978-3-319-25740-2
  • Free shipping for individuals worldwide
  • Immediate ebook access, if available*, with your print order
  • Usually dispatched within 3 to 5 business days.
Loading...

Recommended for you

Loading...

Bibliographic Information

Bibliographic Information
Book Title
Text Analysis Pipelines
Book Subtitle
Towards Ad-hoc Large-Scale Text Mining
Authors
Series Title
Theoretical Computer Science and General Issues
Series Volume
9383
Copyright
2015
Publisher
Springer International Publishing
Copyright Holder
Springer International Publishing Switzerland
eBook ISBN
978-3-319-25741-9
DOI
10.1007/978-3-319-25741-9
Softcover ISBN
978-3-319-25740-2
Edition Number
1
Number of Pages
XX, 302
Number of Illustrations
74 illustrations in colour
Topics

*immediately available upon purchase as print book shipments may be delayed due to the COVID-19 crisis. ebook access is temporary and does not include ownership of the ebook. Only valid for books with an ebook version. Springer Reference Works and instructor copies are not included.