Name: Learning from Imbalanced Data Sets
ISBN: 978-3-319-98074-4

Overview

Authors:

Alberto Fernández ⁰,
Salvador García ¹,
Mikel Galar ²,
Ronaldo C. Prati ORCID: https://orcid.org/0000-0001-8597-4987³,
Bartosz Krawczyk ⁴,
…
Francisco Herrera ⁵

Alberto Fernández
1. Department of Computer Science and AI, University of Granada, Granada, Spain
View author publications

Search author on: PubMed Google Scholar
Salvador García
1. Department of Computer Science and AI, University of Granada, Granada, Spain
View author publications

Search author on: PubMed Google Scholar
Mikel Galar
1. Institute of Smart Cities, Public University of Navarre, Pamplona, Spain
View author publications

Search author on: PubMed Google Scholar
Ronaldo C. Prati
1. Department of Computer Science, Universidade Federal do ABC, Santo Andre, Brazil
View author publications

Search author on: PubMed Google Scholar
Bartosz Krawczyk
1. Department of Computer Science, Virginia Commonwealth University, Richmond, USA
View author publications

Search author on: PubMed Google Scholar
Francisco Herrera
1. Department of Computer Science and AI, University of Granada, Granada, Spain
View author publications

Search author on: PubMed Google Scholar

Offers a comprehensive review of imbalanced learning widely used worldwide in many real applications, such as fraud detection, disease diagnosis, etc
Provides the user with the required background and software tools needed to deal with Imbalance data
Presents the latest advances in the field of learning with imbalanced data, including Big Data applications and non-classical problems, such as semi-supervised learning, multilabel and multi instance learning, and ordinal classification and regression
Includes case studies

120k Accesses
1232 Citations
16 Altmetric

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 129.00

Price excludes VAT (USA)

Softcover Book USD 169.99

Hardcover Book USD 169.99

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

About this book

This book provides a general and comprehensible overview of imbalanced learning. It contains a formal description of a problem, and focuses on its main features, and the most relevant proposed solutions. Additionally, it considers the different scenarios in Data Science for which the imbalanced classification can create a real challenge.

This book stresses the gap with standard classification tasks by reviewing the case studies and ad-hoc performance metrics that are applied in this area. It also covers the different approaches that have been traditionally applied to address the binary skewed class distribution. Specifically, it reviews cost-sensitive learning, data-level preprocessing methods and algorithm-level solutions, taking also into account those ensemble-learning solutions that embed any of the former alternatives. Furthermore, it focuses on the extension of the problem for multi-class problems, where the former classical methods are no longer to be applied in a straightforward way.

This book also focuses on the data intrinsic characteristics that are the main causes which, added to the uneven class distribution, truly hinders the performance of classification algorithms in this scenario. Then, some notes on data reduction are provided in order to understand the advantages related to the use of this type of approaches.

Finally this book introduces some novel areas of study that are gathering a deeper attention on the imbalanced data issue. Specifically, it considers the classification of data streams, non-classical classification problems, and the scalability related to Big Data. Examples of software libraries and modules to address imbalanced classification are provided.

This book is highly suitable for technical professionals, senior undergraduate and graduate students in the areas of data science, computer science and engineering. It will also be useful for scientists and researchers to gain insight on the current developments in this area of study, as well as future research directions.

Table of contents (14 chapters)

Front Matter

Pages i-xviii

Download chapter PDF
Introduction to KDD and Data Science
- Alberto Fernández, Salvador García, Mikel Galar, Ronaldo C. Prati, Bartosz Krawczyk, Francisco Herrera
Pages 1-17
Foundations on Imbalanced Classification
- Alberto Fernández, Salvador García, Mikel Galar, Ronaldo C. Prati, Bartosz Krawczyk, Francisco Herrera
Pages 19-46
Performance Measures
- Alberto Fernández, Salvador García, Mikel Galar, Ronaldo C. Prati, Bartosz Krawczyk, Francisco Herrera
Pages 47-61
Cost-Sensitive Learning
- Alberto Fernández, Salvador García, Mikel Galar, Ronaldo C. Prati, Bartosz Krawczyk, Francisco Herrera
Pages 63-78
Data Level Preprocessing Methods
- Alberto Fernández, Salvador García, Mikel Galar, Ronaldo C. Prati, Bartosz Krawczyk, Francisco Herrera
Pages 79-121
Algorithm-Level Approaches
- Alberto Fernández, Salvador García, Mikel Galar, Ronaldo C. Prati, Bartosz Krawczyk, Francisco Herrera
Pages 123-146
Ensemble Learning
- Alberto Fernández, Salvador García, Mikel Galar, Ronaldo C. Prati, Bartosz Krawczyk, Francisco Herrera
Pages 147-196
Imbalanced Classification with Multiple Classes
- Alberto Fernández, Salvador García, Mikel Galar, Ronaldo C. Prati, Bartosz Krawczyk, Francisco Herrera
Pages 197-226
Dimensionality Reduction for Imbalanced Learning
- Alberto Fernández, Salvador García, Mikel Galar, Ronaldo C. Prati, Bartosz Krawczyk, Francisco Herrera
Pages 227-251
Data Intrinsic Characteristics
- Alberto Fernández, Salvador García, Mikel Galar, Ronaldo C. Prati, Bartosz Krawczyk, Francisco Herrera
Pages 253-277
Learning from Imbalanced Data Streams
- Alberto Fernández, Salvador García, Mikel Galar, Ronaldo C. Prati, Bartosz Krawczyk, Francisco Herrera
Pages 279-303
Non-classical Imbalanced Classification Problems
- Alberto Fernández, Salvador García, Mikel Galar, Ronaldo C. Prati, Bartosz Krawczyk, Francisco Herrera
Pages 305-325
Imbalanced Classification for Big Data
- Alberto Fernández, Salvador García, Mikel Galar, Ronaldo C. Prati, Bartosz Krawczyk, Francisco Herrera
Pages 327-349
Software and Libraries for Imbalanced Classification
- Alberto Fernández, Salvador García, Mikel Galar, Ronaldo C. Prati, Bartosz Krawczyk, Francisco Herrera
Pages 351-377

Authors and Affiliations

Department of Computer Science and AI, University of Granada, Granada, Spain

Alberto Fernández, Salvador García, Francisco Herrera
Institute of Smart Cities, Public University of Navarre, Pamplona, Spain

Mikel Galar
Department of Computer Science, Universidade Federal do ABC, Santo Andre, Brazil

Ronaldo C. Prati
Department of Computer Science, Virginia Commonwealth University, Richmond, USA

Bartosz Krawczyk

Accessibility Information

Accessibility information for this book is coming soon. We're working to make it available as quickly as possible. Thank you for your patience.

Bibliographic Information

Book Title: Learning from Imbalanced Data Sets
Authors: Alberto Fernández, Salvador García, Mikel Galar, Ronaldo C. Prati, Bartosz Krawczyk, Francisco Herrera
DOI: https://doi.org/10.1007/978-3-319-98074-4
Publisher: Springer Cham
eBook Packages: Computer Science, Computer Science (R0)
Copyright Information: Springer Nature Switzerland AG 2018
Hardcover ISBN: 978-3-319-98073-7Published: 01 November 2018
Softcover ISBN: 978-3-030-07446-3Published: 19 January 2019
eBook ISBN: 978-3-319-98074-4Published: 22 October 2018
Edition Number: 1
Number of Pages: XVIII, 377
Number of Illustrations: 21 b/w illustrations, 50 illustrations in colour
Topics: Artificial Intelligence, Information Systems and Communication Service, Computer Communication Networks

Keywords

Publish with us

Policies and ethics

Learning from Imbalanced Data Sets

Overview

Access this book

Other ways to access

About this book

Similar content being viewed by others

Learning from imbalanced data: open challenges and future directions

A literature survey on various aspect of class imbalance problem in data mining

An Empirical Study of Multi-class Imbalance Learning Algorithms

Table of contents (14 chapters)

Front Matter

Introduction to KDD and Data Science

Foundations on Imbalanced Classification

Performance Measures

Cost-Sensitive Learning

Data Level Preprocessing Methods

Algorithm-Level Approaches

Ensemble Learning

Imbalanced Classification with Multiple Classes

Dimensionality Reduction for Imbalanced Learning

Data Intrinsic Characteristics

Learning from Imbalanced Data Streams

Non-classical Imbalanced Classification Problems

Imbalanced Classification for Big Data

Software and Libraries for Imbalanced Classification

Authors and Affiliations

Department of Computer Science and AI, University of Granada, Granada, Spain

Institute of Smart Cities, Public University of Navarre, Pamplona, Spain

Department of Computer Science, Universidade Federal do ABC, Santo Andre, Brazil

Department of Computer Science, Virginia Commonwealth University, Richmond, USA

Accessibility Information

Bibliographic Information

Keywords

Publish with us

Learning from Imbalanced Data Sets

Overview

Access this book

Other ways to access

About this book

Similar content being viewed by others

Explore related subjects

Table of contents (14 chapters)

Front Matter

Authors and Affiliations

Department of Computer Science and AI, University of Granada, Granada, Spain

Institute of Smart Cities, Public University of Navarre, Pamplona, Spain

Department of Computer Science, Universidade Federal do ABC, Santo Andre, Brazil

Department of Computer Science, Virginia Commonwealth University, Richmond, USA

Accessibility Information

Bibliographic Information

Keywords

Publish with us