Text, Speech and Language Technology

Treebanks

Building and Using Parsed Corpora

Editors: Abeillé, A. (Ed.)

Buy this book

eBook $89.00
price for USA (gross)
  • ISBN 978-94-010-0201-1
  • Digitally watermarked, DRM-free
  • Included format: PDF
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Hardcover $269.00
price for USA
  • ISBN 978-1-4020-1334-8
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
Softcover $119.00
price for USA
  • ISBN 978-1-4020-1335-5
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
About this book

Linguists and engineers in Natural Language Processing tend to use electronic corpora more and more. Most research has long been limited to raw (unannotated) texts or to tagged texts (annotated with parts of speech only), but these approaches suffer from a word by word perspective. A new line of research involves corpora with richer annotations such as clauses and major constituents, grammatical functions and dependency links. The first parsed corpora were the English Lancaster treebank and Penn Treebank. New ones have recently been developed for other languages.
This book:

provides a state of the art on work being done with parsed corpora;

gathers 21 papers on building and using parsed corpora raising many relevant questions;

deals with a variety of languages and a variety of corpora;

is for those working in linguistics, computational linguistics, natural language, syntax, and grammar.

Reviews

From the reviews:

"Anne Abeillé draws together a collection of fifteen short pieces focused primarily on the issues that come up in creating treebanks, demonstrated across an impressive variety of languages, along with six chapters on how treebanks are used. … For computational linguists working on automatic parsing, a pass through this book should be required … . The reader … will be rewarded with a clear sense of the challenge and the promise of systematically applying theoretically motivated linguistic representations to ‘language in the large’." (Philip Resnik, Language, Vol. 83 (4), 2007)


Table of contents (21 chapters)

  • The Penn Treebank: An Overview

    Taylor, Ann (et al.)

    Pages 5-22

  • Thoughts on Two Decades of Drawing Trees

    Sampson, Geoffrey

    Pages 23-41

  • Bank of English and Beyond

    Järvinen, Timo

    Pages 43-59

  • Completing Parsed Corpora

    Wallis, Sean

    Pages 61-71

  • Syntactic Annotation of a German Newspaper Corpus

    Brants, Thorsten (et al.)

    Pages 73-87

Buy this book

eBook $89.00
price for USA (gross)
  • ISBN 978-94-010-0201-1
  • Digitally watermarked, DRM-free
  • Included format: PDF
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Hardcover $269.00
price for USA
  • ISBN 978-1-4020-1334-8
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
Softcover $119.00
price for USA
  • ISBN 978-1-4020-1335-5
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
Loading...

Recommended for you

Loading...

Bibliographic Information

Bibliographic Information
Book Title
Treebanks
Book Subtitle
Building and Using Parsed Corpora
Editors
  • A. Abeillé
Series Title
Text, Speech and Language Technology
Series Volume
20
Copyright
2003
Publisher
Springer Netherlands
Copyright Holder
Springer Science+Business Media B.V.
eBook ISBN
978-94-010-0201-1
DOI
10.1007/978-94-010-0201-1
Hardcover ISBN
978-1-4020-1334-8
Softcover ISBN
978-1-4020-1335-5
Series ISSN
1386-291X
Edition Number
1
Number of Pages
XXVI, 407
Topics