Skip to main content
Book cover

Treebanks

Building and Using Parsed Corpora

  • Book
  • © 2003

Overview

Part of the book series: Text, Speech and Language Technology (TLTB, volume 20)

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (21 chapters)

  1. Building Treebanks

    1. English treebanks

    2. German treebanks

    3. Slavic treebanks

    4. Treebanks for romance languages

    5. Treebanks for other languages

  2. Using Treebanks

    1. Evaluation with treebanks

Keywords

About this book

Linguists and engineers in Natural Language Processing tend to use electronic corpora more and more. Most research has long been limited to raw (unannotated) texts or to tagged texts (annotated with parts of speech only), but these approaches suffer from a word by word perspective. A new line of research involves corpora with richer annotations such as clauses and major constituents, grammatical functions and dependency links. The first parsed corpora were the English Lancaster treebank and Penn Treebank. New ones have recently been developed for other languages.
This book:

provides a state of the art on work being done with parsed corpora;

gathers 21 papers on building and using parsed corpora raising many relevant questions;

deals with a variety of languages and a variety of corpora;

is for those working in linguistics, computational linguistics, natural language, syntax, and grammar.

Reviews

From the reviews:

"Anne Abeillé draws together a collection of fifteen short pieces focused primarily on the issues that come up in creating treebanks, demonstrated across an impressive variety of languages, along with six chapters on how treebanks are used. … For computational linguists working on automatic parsing, a pass through this book should be required … . The reader … will be rewarded with a clear sense of the challenge and the promise of systematically applying theoretically motivated linguistic representations to ‘language in the large’." (Philip Resnik, Language, Vol. 83 (4), 2007)

Editors and Affiliations

  • Universite Paris 7, Paris, France

    Anne Abeillé

Bibliographic Information

Publish with us