Skip to main content
  • Book
  • © 2020

Speech-to-Speech Translation

  • All the whole from its start to the final destination of speech to speech translation will be introduced
  • Speech to speech translation is a remarkable success of machine learning technology based on big data
  • Explained system evaluation methods is of high importance in that it guides development properly and persuade possible users to understand the competence of the system

Part of the book series: SpringerBriefs in Computer Science (BRIEFSCOMPUTER)

Buy it now

Buying options

Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Table of contents (7 chapters)

  1. Front Matter

    Pages i-xiv
  2. Multilingualization of Speech Processing

    • Hiroaki Kato, Shoji Harada, Tasuku Kitade, Yoshinori Shiga
    Pages 1-20
  3. Automatic Speech Recognition

    • Xugang Lu, Sheng Li, Masakiyo Fujimoto
    Pages 21-38
  4. Text-to-Speech Synthesis

    • Yoshinori Shiga, Jinfu Ni, Kentaro Tachibana, Takuma Okamoto
    Pages 39-52
  5. Language Translation

    • Kenji Imamura
    Pages 53-66
  6. Field Experiment System “VoiceTra”

    • Yutaka Ashikari, Hisashi Kawai
    Pages 67-75
  7. Measuring the Capability of a Speech Translation System

    • Fumiaki Sugaya, Keiji Yasuda
    Pages 77-85

About this book

This book provides the readers with retrospective and prospective views with detailed explanations of component technologies, speech recognition, language translation and speech synthesis.


Speech-to-speech translation system (S2S) enables to break language barriers, i.e., communicate each other between any pair of person on the glove, which is one of extreme dreams of humankind.


People, society, and economy connected by S2S will demonstrate explosive growth without exception.


In 1986, Japan initiated basic research of S2S, then the idea spread world-wide and were explored deeply by researchers during three decades.


Now, we see S2S application on smartphone/tablet around the world.


Computational resources such as processors, memories, wireless communication accelerate this computation-intensive systems and accumulation of digital data of speech and language encourage recent approaches based on machine learning.


Through field experiments after long research in laboratories, S2S systems are being well-developed and now ready to utilized in daily life.


Unique chapter of this book is end-2-end evaluation by comparing system’s performance and human competence. The effectiveness of the system would be understood by the score of this evaluation.


The book will end with one of the next focus of S2S will be technology of simultaneous interpretation for lecture, broadcast news and so on.

Editors and Affiliations

  • Advanced Speech Translation Research and Development Promotion Center, National Institute of Information and Communications Technology, Kyoto, Japan

    Yutaka Kidawara

  • Advanced Translation Technology Laboratory, Advanced Speech Translation Research and Development Promotion Center, National Institute of Information and Communications Technology, Kyoto, Japan

    Eiichiro Sumita

  • Advanced Speech Technology Laboratory, Advanced Speech Translation Research and Development Promotion Center, National Institute of Information and Communications Technology, Kyoto, Japan

    Hisashi Kawai

Bibliographic Information

  • Book Title: Speech-to-Speech Translation

  • Editors: Yutaka Kidawara, Eiichiro Sumita, Hisashi Kawai

  • Series Title: SpringerBriefs in Computer Science

  • DOI: https://doi.org/10.1007/978-981-15-0595-9

  • Publisher: Springer Singapore

  • eBook Packages: Computer Science, Computer Science (R0)

  • Copyright Information: The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2020

  • Softcover ISBN: 978-981-15-0594-2Published: 06 December 2019

  • eBook ISBN: 978-981-15-0595-9Published: 22 November 2019

  • Series ISSN: 2191-5768

  • Series E-ISSN: 2191-5776

  • Edition Number: 1

  • Number of Pages: XIV, 91

  • Number of Illustrations: 41 b/w illustrations, 9 illustrations in colour

  • Topics: Natural Language Processing (NLP), Signal, Image and Speech Processing

Buy it now

Buying options

Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access