Go to:
Logótipo
Comuta visibilidade da coluna esquerda
Você está em: Start > Publications > View > A unifying view of class overlap and imbalance: Key concepts, multi-view panorama, and open avenues for research
Publication

Publications

A unifying view of class overlap and imbalance: Key concepts, multi-view panorama, and open avenues for research

Title
A unifying view of class overlap and imbalance: Key concepts, multi-view panorama, and open avenues for research
Type
Article in International Scientific Journal
Year
2023
Authors
Santos, MS
(Author)
Other
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page Without ORCID
Japkowicz, N
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Fernandez, A
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Santos, J
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. View Authenticus page Without ORCID
Journal
Title: Information FusionImported from Authenticus Search for Journal Publications
Vol. 89
Pages: 228-253
ISSN: 1566-2535
Publisher: Elsevier
Other information
Authenticus ID: P-00X-3XG
Abstract (EN): The combination of class imbalance and overlap is currently one of the most challenging issues in machine learning. While seminal work focused on establishing class overlap as a complicating factor for classification tasks in imbalanced domains, ongoing research mostly concerns the study of their synergy over real-word applications. However, given the lack of a well-formulated definition and measurement of class overlap in real-world domains, especially in the presence of class imbalance, the research community has not yet reached a consensus on the characterisation of both problems. This naturally complicates the evaluation of existing approaches to address these issues simultaneously and prevents future research from moving towards the devise of specialised solutions. In this work, we advocate for a unified view of the problem of class overlap in imbalanced domains. Acknowledging class overlap as the overarching problem - since it has proven to be more harmful for classification tasks than class imbalance - we start by discussing the key concepts associated to its definition, identification, and measurement in real-world domains, while advocating for a characterisation of the problem that attends to multiple sources of complexity. We then provide an overview of existing data complexity measures and establish the link to what specific types of class overlap problems these measures cover, proposing a novel taxonomy of class overlap complexity measures. Additionally, we characterise the relationship between measures, the insights they provide, and discuss to what extent they account for class imbalance. Finally, we systematise the current body of knowledge on the topic across several branches of Machine Learning (Data Analysis, Data Preprocessing, Algorithm Design, and Meta-learning), identifying existing limitations and discussing possible lines for future research.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 26
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same journal

SWINN: Efficient nearest neighbor search in sliding windows using graphs (2024)
Article in International Scientific Journal
Mastelini, SM; Veloso, B; Halford, M; de Carvalho, ACPDF; João Gama
Preference rules for label ranking: Mining patterns in multi-target relations (2018)
Article in International Scientific Journal
Cláudio Rebelo de Sá; Paulo Azevedo; Carlos Soares; Alípio Mário Jorge; Arno Knobbe
Multimodal inverse perspective mapping (2014)
Article in International Scientific Journal
Oliveira, M; Santos, V; Sappa, AD
MARESye: A hybrid imaging system for underwater robotic applications (2020)
Article in International Scientific Journal
Pinto, AM; Aníbal Castilho Coimbra de Matos
Hyperparameter self-tuning for data streams (2021)
Article in International Scientific Journal
Veloso, B; João Gama; Malheiro, B; Vinagre, J

See all (8)

Recommend this page Top
Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-08-06 at 22:10:45 | Privacy Policy | Personal Data Protection Policy | Whistleblowing