Name: Statistical Methods for Imbalanced Data in Ecological and Biological Studies
ISBN: 978-4-431-55570-4

Overview

Authors:

Osamu Komori ⁰,
Shinto Eguchi ¹

Osamu Komori
1. Seikei University, Musashino, Japan
View author publications

You can also search for this author in PubMed Google Scholar
Shinto Eguchi
1. The Institute of Statistical Mathematics, Tachikawa, Japan
View author publications

You can also search for this author in PubMed Google Scholar

Focuses on the problem caused by imbalanced data often observed in ecology and biology
Introduces the latest statistical methods for imbalanced data
Demonstrates the application of statistical methods to several real data sets

Part of the book series: SpringerBriefs in Statistics (BRIEFSSTATIST)

Part of the book sub series: JSS Research Series in Statistics (JSSRES)

4606 Accesses
8 Citations
2 Altmetric

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 39.99

Price excludes VAT (USA)

Softcover Book USD 54.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (5 chapters)

Front Matter

Pages i-viii

Download chapter PDF
Introduction to Imbalanced Data
- Osamu Komori, Shinto Eguchi
Pages 1-10
Weighted Logistic Regression
- Osamu Komori, Shinto Eguchi
Pages 11-25
\(\beta \)-Maxent
- Osamu Komori, Shinto Eguchi
Pages 27-33
Generalized T-Statistic
- Osamu Komori, Shinto Eguchi
Pages 35-43
Machine Learning Methods for Imbalanced Data
- Osamu Komori, Shinto Eguchi
Pages 45-55
Back Matter

Pages 57-59

Download chapter PDF

Keywords

About this book

This book presents a fresh, new approach in that it provides a comprehensive recent review of challenging problems caused by imbalanced data in prediction and classification, and also in that it introduces several of the latest statistical methods of dealing with these problems. The book discusses the property of the imbalance of data from two points of view. The first is quantitative imbalance, meaning that the sample size in one population highly outnumbers that in another population. It includes presence-only data as an extreme case, where the presence of a species is confirmed, whereas the information on its absence is uncertain, which is especially common in ecology in predicting habitat distribution. The second is qualitative imbalance, meaning that the data distribution of one population can be well specified whereas that of the other one shows a highly heterogeneous property. A typical case is the existence of outliers commonly observed in gene expression data, and another is heterogeneous characteristics often observed in a case group in case-control studies. The extension of the logistic regression model, maxent, and AdaBoost for imbalanced data is discussed, providing a new framework for improvement of prediction, classification, and performance of variable selection. Weights functions introduced in the methods play an important role in alleviating the imbalance of data. This book also furnishes a new perspective on these problem and shows some applications of the recently developed statistical methods to real data sets.

Authors and Affiliations

Seikei University, Musashino, Japan

Osamu Komori
The Institute of Statistical Mathematics, Tachikawa, Japan

Shinto Eguchi

About the authors

Osamu Komori, The Institute of Statistical Mathematics,
Shinto Eguchi, The Institute of Statistical Mathematics

Bibliographic Information

Book Title: Statistical Methods for Imbalanced Data in Ecological and Biological Studies
Authors: Osamu Komori, Shinto Eguchi
Series Title: SpringerBriefs in Statistics
DOI: https://doi.org/10.1007/978-4-431-55570-4
Publisher: Springer Tokyo
eBook Packages: Mathematics and Statistics, Mathematics and Statistics (R0)
Copyright Information: The Author(s), under exclusive licence to Springer Japan KK 2019
Softcover ISBN: 978-4-431-55569-8Published: 15 July 2019
eBook ISBN: 978-4-431-55570-4Published: 02 July 2019
Series ISSN: 2191-544X
Series E-ISSN: 2191-5458
Edition Number: 1
Number of Pages: VIII, 59
Number of Illustrations: 15 b/w illustrations, 7 illustrations in colour
Topics: Statistics for Life Sciences, Medicine, Health Sciences, Statistical Theory and Methods, Biostatistics, Statistics for Social Sciences, Humanities, Law

Publish with us

Policies and ethics

Statistical Methods for Imbalanced Data in Ecological and Biological Studies

Overview

Access this book

Other ways to access

Table of contents (5 chapters)

Front Matter

Introduction to Imbalanced Data

Weighted Logistic Regression

\(\beta \)-Maxent

Generalized T-Statistic

Machine Learning Methods for Imbalanced Data

Back Matter

Keywords

About this book

Authors and Affiliations

Seikei University, Musashino, Japan

The Institute of Statistical Mathematics, Tachikawa, Japan

About the authors

Bibliographic Information

Publish with us

Search

Navigation