Editors:
- A reference source for researchers and students coming to the field of comparable corpora
- Identifies the state of the art in the field as well as future trends
- Written by experts in the fields
- Includes supplementary material: sn.pub/extras
Buy it now
Buying options
Tax calculation will be finalised at checkout
Other ways to access
This is a preview of subscription content, log in via an institution to check for access.
Table of contents (17 chapters)
-
Front Matter
-
Compiling and Measuring Comparable Corpora
-
Front Matter
-
-
Using Comparable Corpora
-
Front Matter
-
About this book
The 1990s saw a paradigm change in the use of corpus-driven methods in NLP. In the field of multilingual NLP (such as machine translation and terminology mining) this implied the use of parallel corpora. However, parallel resources are relatively scarce: many more texts are produced daily by native speakers of any given language than translated. This situation resulted in a natural drive towards the use of comparable corpora, i.e. non-parallel texts in the same domain or genre. Nevertheless, this research direction has not produced a single authoritative source suitable for researchers and students coming to the field.
The proposed volume provides a reference source, identifying the state of the art in the field as well as future trends. The book is intended for specialists and students in natural language processing, machine translation and computer-assisted translation.
Reviews
Editors and Affiliations
-
Centre for Translation Studies, University of Leeds, Leeds, United Kingdom
Serge Sharoff
-
University of Mainz, Mainz, Germany
Reinhard Rapp
-
Université de Paris-Sud LIMSI-CNRS, Orsay, France
Pierre Zweigenbaum
-
Electronic & Computer Engineering, The Hong Kong University of Science and Technology, Hong Kong, People's Republic of China
Pascale Fung
Bibliographic Information
Book Title: Building and Using Comparable Corpora
Editors: Serge Sharoff, Reinhard Rapp, Pierre Zweigenbaum, Pascale Fung
DOI: https://doi.org/10.1007/978-3-642-20128-8
Publisher: Springer Berlin, Heidelberg
eBook Packages: Computer Science, Computer Science (R0)
Copyright Information: Springer-Verlag Berlin Heidelberg 2013
Hardcover ISBN: 978-3-642-20127-1Published: 07 January 2014
Softcover ISBN: 978-3-662-52006-2Published: 23 August 2016
eBook ISBN: 978-3-642-20128-8Published: 13 December 2013
Edition Number: 1
Number of Pages: XII, 335
Number of Illustrations: 56 b/w illustrations, 14 illustrations in colour
Topics: Natural Language Processing (NLP), Computational Linguistics, Information Systems Applications (incl. Internet)