Go to:
Logótipo
Você está em: Start > Publications > View > Predicting Age from Human Lung Tissue Through Multi-modal Data Integration
Map of Premises
Principal
Publication

Predicting Age from Human Lung Tissue Through Multi-modal Data Integration

Title
Predicting Age from Human Lung Tissue Through Multi-modal Data Integration
Type
Article in International Conference Proceedings Book
Year
2023
Authors
Moraes, A
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Moreno, M
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Ribeiro, R
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Conference proceedings International
Pages: 644-658
26th International Conference on Discovery Science, DS 2023
Porto, 9 October 2023 through 11 October 2023
Indexing
Other information
Authenticus ID: P-00Z-820
Abstract (EN): The accurate prediction of biological age can bring important benefits in promoting therapeutic and behavioural strategies for healthy aging. We propose the development of age prediction models using multi-modal datasets, including transcriptomics, methylation and histological images from lung tissue samples of 793 human donors. From a technical point of view this is a challenging problem since not all donors are covered by the same data modalities and the datasets have a very high feature dimensionality with a relatively smaller number of samples. To fairly compare performance across different data types, we¿ve created a test set including donors represented in each modality. Given the unique characteristics of the data distribution, we developed gradient boosting tree and convolutional neural network models for each dataset. The performance of the models can be affected by several covariates, including smoking history, and, most importantly, by a skewed distribution of age. Data-centric approaches, including feature engineering, feature selection, data stratification and resampling, proved fundamental in building models that were optimally adapted for each data modality, resulting in significant improvements in model performance for imbalanced regression. The models were then applied to the test set independently, and later combined into a multi-modal ensemble through a voting strategy, predicting age with a median absolute error of 4 years. Even if prediction accuracy remains a challenge, in this work we provide insights to address the difficulties of multi-modal data integration and imbalanced data prediction. © 2023, The Author(s), under exclusive license to Springer Nature Switzerland AG.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 14
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same authors

Integration of multi-modal datasets to estimate human aging (2024)
Article in International Scientific Journal
Ribeiro, R; Moraes, A; Moreno, M; Ferreira, PG
Recommend this page Top
Copyright 1996-2025 © Faculdade de Medicina Dentária da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-08-24 at 04:40:22 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book