Go to:
Logótipo
Você está em: Start > Publications > View > Chebyshev approaches for imbalanced data streams regression models
Map of Premises
Principal
Publication

Chebyshev approaches for imbalanced data streams regression models

Title
Chebyshev approaches for imbalanced data streams regression models
Type
Article in International Scientific Journal
Year
2021
Authors
Aminian, E
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Rita Ribeiro
(Author)
FCUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page
João Gama
(Author)
FEP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page
Journal
Vol. 35
Pages: 2389-2466
ISSN: 1384-5810
Publisher: Springer Nature
Other information
Authenticus ID: P-00V-EJH
Abstract (EN): In recent years data stream mining and learning from imbalanced data have been active research areas. Even though solutions exist to tackle these two problems, most of them are not designed to handle challenges inherited from both problems. As far as we are aware, the few approaches in the area of learning from imbalanced data streams fall in the context of classification, and no efforts on the regression domain have been reported yet. This paper proposes a technique that uses sampling strategies to cope with imbalanced data streams in a regression setting, where the most important cases have rare and extreme target values. Specifically, we employ under-sampling and over-sampling strategies that resort to Chebyshev's inequality value as a heuristic to disclose the type of incoming cases (i.e. frequent or rare). We have evaluated our proposal by applying it in the training of models by four well-known regression algorithms over fourteen benchmark data sets. We conducted a series of experiments with different setups on both synthetic and real-world data sets. The experimental results confirm our approach's effectiveness by showing the models' superior performance trained by each of the sampling strategies compared with their baseline pairs.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 78
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same authors

A Study on Imbalanced Data Streams (2020)
Article in International Scientific Journal
Aminian, E; Rita Ribeiro; João Gama
Current Trends in Learning from Data Streams (2021)
Article in International Conference Proceedings Book
João Gama; Veloso, B; Aminian, E; Rita Ribeiro

Of the same journal

Guest editors introduction: special issue of the ECMLPKDD 2015 journal track (2015)
Another Publication in an International Scientific Journal
Bielza, C; João Gama; Jorge, AM; Zliobaite, I
Guest Editorial: Special Issue on Data Mining for Geosciences (2019)
Another Publication in an International Scientific Journal
Jorge, AM; Lopes, RL; Larrazabal, G; Nikhalat Jahromi, H
Very fast decision rules for classification in data streams (2015)
Article in International Scientific Journal
Kosina, P; João Gama
Probabilistic change detection and visualization methods for the assessment of temporal stability in biomedical data quality (2015)
Article in International Scientific Journal
Carlos Saez; Pedro Pereira Rodrigues; João Gama; Montserrat Robles; Juan M Garcia Gomez
Novel features for time series analysis: a complex networks approach (2022)
Article in International Scientific Journal
Silva, VF; Maria Eduarda Silva; Pedro Ribeiro; Silva, F

See all (14)

Recommend this page Top
Copyright 1996-2025 © Faculdade de Medicina Dentária da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-07-19 at 02:15:27 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book