Skip to main content

Finite-State Text Processing

  • Book
  • © 2021

Overview

Part of the book series: Synthesis Lectures on Human Language Technologies (SLHLT)

  • 1640 Accesses

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 49.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 64.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (8 chapters)

About this book

Weighted finite-state transducers (WFSTs) are commonly used by engineers and computational linguists for processing and generating speech and text. This book first provides a detailed introduction to this formalism. It then introduces Pynini, a Python library for compiling finite-state grammars and for combining, optimizing, applying, and searching finite-state transducers. This book illustrates this library's conventions and use with a series of case studies. These include the compilation and application of context-dependent rewrite rules, the construction of morphological analyzers and generators, and text generation and processing applications.

Authors and Affiliations

  • Graduate Center, City University of New York, USA

    Kyle Gorman

  • Google LLC, USA

    Richard Sproat

About the authors

Kyle Gorman is an assistant professor of linguistics at the Graduate Center, City University of New York, where he directs the master’s program in computational linguistics; he also works as a software engineer at Google. He was previously an assistant professor at the Oregon Health & Science University in Portland. He holds a Ph.D. in linguistics from the University of Pennsylvania. His research interests include phonology, morphology, and speech and text processing. He is a maintainer of the OpenFst and OpenGrm libraries and the creator of Pynini. He lives in Brooklyn.Richard Sproat received his Ph.D. in linguistics from the Massachusetts Institute of Technology in 1985. Since then, he has worked in a number of areas of linguistics and computational linguistics, but he is perhaps best known for his work on text normalization for speech applications such as text-to-speech synthesis. His recent interests include neural text processing, finite-state methods, and computational modelsof writing systems. He is currently a research scientist at Google in Tokyo.

Bibliographic Information

Publish with us