Go to:
Logótipo
Comuta visibilidade da coluna esquerda
Você está em: Start > Publications > View > Over-sampling methods for mixed data in imbalanced problems
Publication

Publications

Over-sampling methods for mixed data in imbalanced problems

Title
Over-sampling methods for mixed data in imbalanced problems
Type
Article in International Scientific Journal
Year
2025
Authors
Alonso, H
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Da Costa, JP
(Author)
FCUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page
Journal
Pages: 1-23
ISSN: 0361-0918
Publisher: Taylor & Francis
Indexing
Publicação em ISI Web of Knowledge ISI Web of Knowledge - 0 Citations
Publicação em Scopus Scopus - 0 Citations
Scientific classification
CORDIS: Technological sciences
FOS: Natural sciences
Other information
Authenticus ID: P-017-QHG
Abstract (EN): In practice, it is common to find imbalanced classification problems, where one or more classes have many fewer examples than the others. There are several ways to deal with imbalance in order to improve the classification results in the less represented class(es) and one of them consists in applying re-sampling methods. Furthermore, it is no less common for data sets in imbalanced classification problems to be a mix of nominal, ordinal, quantitative discrete and continuous data. However, the true nature of the data tends to be ignored, like when ordinal data are treated as nominal. In this paper, we propose several re-sampling methods for mixed data, which take into account the four scales of measurement usually found in real data. They are based on the popular synthetic minority over-sampling technique or SMOTE. We consider different measures of distance adequate for mixed data. We also introduce new ways of creating the synthetic examples, using all of the nearest neighbors. We show through a comparative study that it pays off taking into account the true nature of the data and the new ways of creating synthetic examples.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 23
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same authors

The unimodal model for the classification of ordinal data (vol 21, pg 78, 2008) (2014)
Other Publications
Pinto da Costa, J; Alonso, H; Cardoso, JS

Of the same journal

The effect of temporal aggregation on the estimation accuracy of time series models (2017)
Article in International Scientific Journal
Paulo Teles; Sousa, PSA
The effect of temporal aggregation on the estimation accuracy of ARMA models (2018)
Article in International Scientific Journal
Paulo Teles; Sousa, PSA
Test for rotational symmetry of data from the Watson distribution defined on the hypersphere (2004)
Article in International Scientific Journal
Adelaide Figueiredo
Resampling methods in ANOVA for data from the von Mises-Fisher distribution (2023)
Article in International Scientific Journal
Adelaide Figueiredo
Regularization with Maximum Entropy and Quantum Electrodynamics: The Merge(E) Estimators (2016)
Article in International Scientific Journal
Pedro Macedo; Manuel Scotto; Elvira Silva

See all (14)

Recommend this page Top
Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-07-28 at 22:54:25 | Privacy Policy | Personal Data Protection Policy | Whistleblowing