Skip to main content
  • Book
  • © 2015

Multiword Expressions Acquisition

A Generic and Open Framework

Authors:

  • A unique, complete and up-to-date overview of research in multiword expressions
  • Concrete experimental results and examples are included
  • A new generic framework for automatic acquisition of multiword expressions from texts is introduced

Buy it now

Buying options

eBook USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (8 chapters)

  1. Front Matter

    Pages i-xiv
  2. Introduction

    • Carlos Ramisch
    Pages 1-19
  3. Multiword Expressions: A Tough Nut to Crack

    1. Front Matter

      Pages 21-21
    2. Definitions and Characteristics

      • Carlos Ramisch
      Pages 23-51
    3. State of the Art in MWE Processing

      • Carlos Ramisch
      Pages 53-102
  4. MWE Acquisition

    1. Front Matter

      Pages 103-103
    2. Evaluation of MWE Acquisition

      • Carlos Ramisch
      Pages 105-125
    3. A New Framework for MWE Acquisition

      • Carlos Ramisch
      Pages 127-155
  5. Applications

    1. Front Matter

      Pages 157-157
    2. Application 1: Lexicography

      • Carlos Ramisch
      Pages 159-179
    3. Application 2: Machine Translation

      • Carlos Ramisch
      Pages 181-199
    4. Conclusions

      • Carlos Ramisch
      Pages 201-205
  6. Back Matter

    Pages 207-230

About this book

​This book is an excellent introduction to multiword expressions. It provides a unique, comprehensive and up-to-date overview of this exciting topic in computational linguistics. The first part describes the diversity and richness of multiword expressions, including many examples in several languages. These constructions are not only complex and arbitrary, but also much more frequent than one would guess, making them a real nightmare for natural language processing applications. 

The second part introduces a new generic framework for automatic acquisition of multiword expressions from texts. Furthermore, it describes the accompanying free software tool, the mwetoolkit, which comes in handy when looking for expressions in texts (regardless of the language). Evaluation is greatly emphasized, underlining the fact that results depend on parameters like corpus size, language, MWE type, etc. The last part contains solid experimental results and evaluates the mwetoolkit, demonstrating its usefulness for computer-assisted lexicography and machine translation.

This is the first book to cover the whole pipeline of multiword expression acquisition in a single volume. It is addresses the needs of students and researchers in computational and theoretical linguistics, cognitive sciences, artificial intelligence and computer science. Its good balance between computational and linguistic views make it the perfect starting point for anyone interested in multiword expressions, language and text processing in general.

Reviews

“The motivating idea behind this work is to explore and compare approaches to MWE, involving various tools as well as human resources. … Much information is given to enable other researchers to investigate MWEs. The book contains a vast amount of information. … An extensive bibliography follows each chapter. There are helpful appendices, including a list of standard part of speech tags.” (Alice Davison, Computing Reviews, September, 2015)

Authors and Affiliations

  • Aix Marseille University, Marseille, France

    Carlos Ramisch

About the author

Carlos Ramisch is a researcher and lecturer at the Aix-Marseille University (France). He holds a double PhD in computer science from Grenoble University (France) and UFRGS (Brazil). His research interests are multiword expressions, semantics and multilingualism. Carlos coordinated many events, including the MWE workshops (2010, 2011, 2013) and the ACM TSLP special issue. He is the creator and developer of the mwetoolkit.

Bibliographic Information

Buy it now

Buying options

eBook USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access