Skip to main content
  • Book
  • © 2012

Structure Discovery in Natural Language

Authors:

  • The book sets an ambitious goal: to shift development of language processing systems to a much more automated setting than previous works
  • A new approach is defined
  • All software described is open source and freely available ?
  • Includes supplementary material: sn.pub/extras

Buy it now

Buying options

eBook USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (8 chapters)

  1. Front Matter

    Pages i-xx
  2. Introduction

    • Chris Biemann
    Pages 1-17
  3. Graph Models

    • Chris Biemann
    Pages 19-37
  4. SmallWorlds of Natural Language

    • Chris Biemann
    Pages 39-71
  5. Graph Clustering

    • Chris Biemann
    Pages 73-100
  6. Unsupervised Language Separation

    • Chris Biemann
    Pages 101-111
  7. Unsupervised Part-of-Speech Tagging

    • Chris Biemann
    Pages 113-144
  8. Word Sense Induction and Disambiguation

    • Chris Biemann
    Pages 145-155
  9. Conclusion

    • Chris Biemann
    Pages 157-160
  10. Back Matter

    Pages 161-178

About this book

Current language technology is dominated by approaches that either enumerate a large set of rules, or are focused on a large amount of manually labelled data. The creation of both is time-consuming and expensive, which is commonly thought to be the reason why automated natural language understanding has still not made its way into “real-life” applications yet.

This book sets an ambitious goal: to shift the development of language processing systems to a much more automated setting than previous works. A new approach is defined: what if computers analysed large samples of language data on their own, identifying structural regularities that perform the necessary abstractions and generalisations in order to better understand language in the process?
After defining the framework of Structure Discovery and shedding light on the nature and the graphic structure of natural language data, several procedures are described that do exactly this: let the computer discover structures without supervision in order to boost the performance of language technology applications. Here, multilingual documents are sorted by language, word classes are identified, and semantic ambiguities are discovered and resolved without using a dictionary or other explicit human input. The book concludes with an outlook on the possibilities implied by this paradigm and sets the methods in perspective to human computer interaction.

The target audience are academics on all levels (undergraduate and graduate students, lecturers and professors) working in the fields of natural language processing and computational linguistics, as well as natural language engineers who are seeking to improve their systems.

Authors and Affiliations

  • Department of Computer Science, Technical University Darmstadt, Darmstadt, Germany

    Chris Biemann

Bibliographic Information

Buy it now

Buying options

eBook USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access