Go to:
Logótipo
Comuta visibilidade da coluna esquerda
Você está em: Start > Publications > View > The Impact of Feature Selection on Balancing, Based on Diabetes Data
Publication

Publications

The Impact of Feature Selection on Balancing, Based on Diabetes Data

Title
The Impact of Feature Selection on Balancing, Based on Diabetes Data
Type
Article in International Conference Proceedings Book
Year
2024
Authors
Machado, D
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Costa, VS
(Author)
FCUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page
Conference proceedings International
Pages: 125-145
16th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC)
Lisbon, PORTUGAL, FEB 16-18, 2023
Indexing
Publicação em ISI Web of Knowledge ISI Web of Knowledge - 0 Citations
Publicação em Scopus Scopus - 0 Citations
Other information
Authenticus ID: P-011-3FK
Abstract (EN): Diabetes management data is composed of diverse factors and glycaemia indicators. Glycaemia predictive models tend to focus solely on glycaemia values. A comprehensive understanding of diabetes management requires the consideration of several aspects of diabetes management, beyond glycaemia. However, the inclusion of every aspect of diabetes management can create an overly high-dimensional data set. Excessive feature spaces increase computational complexity and may introduce over-fitting. Additionally, the inclusion of inconsequential features introduces noise that hinders a model's performance. Feature importance is a process that evaluates a feature's value, and can be used to identify optimal feature sub-sets. Depending on the context, multiple methods can be used. The drop feature method, in the literature, is considered to be the best approach to evaluate individual feature importance. To reach an optimal set, the best approach is branch and bound, albeit its heavy computational cost. This overhead can be addressed through a trade-off between the feature set's optimisation level and the process' computational feasibility. The improvement of the feature space has implications on the effectiveness of data balancing approaches. Whilst, in this study, the observed impact was not substantial, it warrants the need to reconsider the balancing approach given a superior feature space.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 21
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same authors

Using Balancing Methods to Improve Glycaemia-Based Data Mining (2023)
Article in International Conference Proceedings Book
Machado, D; Costa, VS; Pedro Brandão
Impact of the glycaemic sampling method in diabetes data mining (2022)
Article in International Conference Proceedings Book
Machado, D; Costa, VS; Pedro Brandão
Diabetes Management Guidance by a Logical Unit Supported by Data-Mining in a Mobile Application (2020)
Article in International Conference Proceedings Book
Machado, D; Costa, VS; Ines Dutra; Pedro Brandão
Recommend this page Top
Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-09-27 at 01:36:31 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book