Name: Open-Set Text Recognition
ISBN: 978-981-97-0361-6

Overview

Authors:

Xu-Cheng Yin ⁰,
Chun Yang ¹,
Chang Liu ²

Xu-Cheng Yin
1. School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
View author publications

You can also search for this author in PubMed Google Scholar
Chun Yang
1. School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
View author publications

You can also search for this author in PubMed Google Scholar
Chang Liu
1. School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China
View author publications

You can also search for this author in PubMed Google Scholar

Helps readers to model and measure open-world challenges in applications like document digitization, etc
Introduces a framework for the OSTR, which helps readers to build solutions that strive for an evolving environment
Offers possible implementations of each module in the framework

Part of the book series: SpringerBriefs in Computer Science (BRIEFSCOMPUTER)

492 Accesses

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 39.99

Price excludes VAT (USA)

Softcover Book USD 49.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (8 chapters)

Front Matter

Pages i-xiii

Download chapter PDF
Introduction
- Xu-Cheng Yin, Chun Yang, Chang Liu
Pages 1-4
Background
- Xu-Cheng Yin, Chun Yang, Chang Liu
Pages 5-25
Open-Set Text Recognition: Concept, Dataset, Protocol, and Framework
- Xu-Cheng Yin, Chun Yang, Chang Liu
Pages 27-52
Open-Set Text Recognition Implementations(I): Label-to-Representation Mapping
- Xu-Cheng Yin, Chun Yang, Chang Liu
Pages 53-65
Open-Set Text Recognition Implementations(II): Sample-to-Representation Mapping
- Xu-Cheng Yin, Chun Yang, Chang Liu
Pages 67-77
Open-Set Text Recognition Implementations(III): Open-set Predictor
- Xu-Cheng Yin, Chun Yang, Chang Liu
Pages 79-86
Open-Set Text Recognition: Case-Studies
- Xu-Cheng Yin, Chun Yang, Chang Liu
Pages 87-112
Discussions and Future Directions
- Xu-Cheng Yin, Chun Yang, Chang Liu
Pages 113-121

Keywords

About this book

In real-world applications, new data, patterns, and categories that were not covered by the training data can frequently emerge, necessitating the capability to detect and adapt to novel characters incrementally. Researchers refer to these challenges as the Open-Set Text Recognition (OSTR) task, which has, in recent years, emerged as one of the prominent issues in the field of text recognition. This book begins by providing an introduction to the background of the OSTR task, covering essential aspects such as open-set identification and recognition, conventional OCR methods, and their applications. Subsequently, the concept and definition of the OSTR task are presented encompassing its objectives, use cases, performance metrics, datasets, and protocols. A general framework for OSTR is then detailed, composed of four key components: The Aligned Represented Space, the Label-to-Representation Mapping, the Sample-to-Representation Mapping, and the Open-set Predictor. In addition,possible implementations of each module within the framework are discussed. Following this, two specific open-set text recognition methods, OSOCR and OpenCCD, are introduced. The book concludes by delving into applications and future directions of Open-set text recognition tasks.

This book presents a comprehensive overview of the open-set text recognition task, including concepts, framework, and algorithms. It is suitable for graduated students and young researchers who are majoring in pattern recognition and computer science, especially interdisciplinary research.

Authors and Affiliations

School of Computer and Communication Engineering, University of Science and Technology Beijing, Beijing, China

Xu-Cheng Yin, Chun Yang, Chang Liu

About the authors

Xu-Cheng Yin is a full professor, the director of Pattern Recognition and Artificial Intelligence Lab, Department of Computer Science and Technology, School of Computer and Communication Engineering, University of Science and Technology Beijing, China. He received the B.Sc. and M.Sc. degrees in computer science from the University of Science and Technology Beijing, China, in 1999 and 2002, respectively, and the Ph.D. degree in pattern recognition and intelligent systems from the Institute of Automation, Chinese Academy of Sciences, in 2006. He was a visiting professor in the College of Information and Computer Sciences, University of Massachusetts Amherst, USA, for three times (in 2013, 2014 and 2016). He recieved the National Science Fund for Distinguished Young Scholars in 2021. His research interests include pattern recognition, document analysis and recognition, computer vision, machine learning, and data mining.

Chun Yang received the B.Sc. and Ph.D. degrees in computer science from the

University of Science and Technology Beijing, China, in 2011 and 2018,

respectively. He is currently a lecturer with the School of Computer and

Communication Engineering, University of Science and Technology Beijing.

His current research interests include pattern

recognition, classifier ensemble, and document analysis and recognition.

Chang Liu received the B.Sc. degree in computer science from the University of

Science and Technology Beijing, China, in 2016, where he is currently pursuing

the Ph.D. degree with the Department of Computer Science and Technology.

His research interests include text detection,

few-shot learning, and text recognition.

Bibliographic Information

Book Title: Open-Set Text Recognition
Book Subtitle: Concepts, Framework, and Algorithms
Authors: Xu-Cheng Yin, Chun Yang, Chang Liu
Series Title: SpringerBriefs in Computer Science
DOI: https://doi.org/10.1007/978-981-97-0361-6
Publisher: Springer Singapore
eBook Packages: Computer Science, Computer Science (R0)
Copyright Information: The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024
Softcover ISBN: 978-981-97-0360-9Published: 02 April 2024
eBook ISBN: 978-981-97-0361-6Published: 01 April 2024
Series ISSN: 2191-5768
Series E-ISSN: 2191-5776
Edition Number: 1
Number of Pages: XIII, 121
Number of Illustrations: 2 b/w illustrations, 36 illustrations in colour
Topics: Computer Imaging, Vision, Pattern Recognition and Graphics, Machine Learning, Image Processing and Computer Vision

Publish with us

Policies and ethics

Open-Set Text Recognition