Go to:
Logótipo
Você está em: Start > Publications > View > ydata-profiling: Accelerating data-centric AI with high-quality data
Publication

ydata-profiling: Accelerating data-centric AI with high-quality data

Title
ydata-profiling: Accelerating data-centric AI with high-quality data
Type
Article in International Scientific Journal
Year
2023
Authors
Clemente, F
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Ribeiro, GM
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Quemy, A
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Santos, MS
(Author)
Other
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page Without ORCID
Pereira, RC
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. View Authenticus page Without ORCID
Barros, A
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Journal
Title: NeurocomputingImported from Authenticus Search for Journal Publications
Vol. 554
ISSN: 0925-2312
Publisher: Elsevier
Other information
Authenticus ID: P-010-7KQ
Abstract (EN): ydata-profiling is an open-source Python package for advanced exploratory data analysis that enables users to generate data profiling reports in a simple, fast, and efficient manner, fostering a standardized and visual understanding of the data. Beyond traditional descriptive properties and statistics, ydata-profiling follows a Data-Centric AI approach to exploratory analysis, as it focuses on the automatic detection and highlighting of complex data characteristics often associated with potential data quality issues, such as high ratios of missing or imbalanced data, infinite, unique, or constant values, skewness, high correlation, high cardinality, non-stationarity, seasonality, duplicate records, and other inconsistencies. The source code, documentation, and examples are available in the GitHub repository: https://github.com/ydataai/ydata-profiling.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 10
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same journal

The vitality of pattern recognition and image analysis (2015)
Another Publication in an International Scientific Journal
Luisa Mico; Joao M Sanches; Jaime S Cardoso
The vitality of pattern recognition and image analysis (2015)
Article in International Scientific Journal
Micó, L; Sanches, JM; Jaime S Cardoso
QIDLEARNINGLIB: A Python library for quasi-identifier recognition and evaluation (2025)
Article in International Scientific Journal
Simoes, SA; João P. Vilela; Santos, MS; Abreu, PH
Pycol: A Python package for dataset complexity measures (2025)
Article in International Scientific Journal
Apóstolo, D; Santos, MS; Lorena, AC; Pedro Henriques Abreu
Pre-processing approaches for imbalanced distributions in regression (2019)
Article in International Scientific Journal
Branco, P; Torgo, L; Rita Ribeiro

See all (25)

Recommend this page Top
Copyright 1996-2026 © Faculdade de Farmácia da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2026-02-24 at 09:37:49 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book