Go to:
Logótipo
Comuta visibilidade da coluna esquerda
Você está em: Start > Publications > View > Distributed generative data mining
Publication

Publications

Distributed generative data mining

Title
Distributed generative data mining
Type
Article in International Conference Proceedings Book
Year
2007
Authors
Ruy Ramos
(Author)
FEUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications Without AUTHENTICUS Without ORCID
Rui Camacho
(Author)
FEUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page
Conference proceedings International
Pages: 307-317
7th Industrial Conference on Data Mining (ICDM 2007)
Leipzig, Alemanha, Julho 14-18 de 2007
Indexing
Publicação em ISI Web of Science ISI Web of Science
Publicação em Scopus Scopus
INSPEC
Scientific classification
FOS: Engineering and technology > Electrical engineering, Electronic engineering, Information engineering
CORDIS: Technological sciences > Engineering > Computer engineering
Other information
Abstract (EN): A process of Knowledge Discovery in Databases (KDD) involving large amounts of data requires a considerable amount of computational power. The process may be done on a dedicated and expensive machinery or, for some tasks, one can use distributed computing techniques on a network of affordable machines. In either approach it is usual the user to specify the workflow of the sub-tasks composing the whole KDD process before execution starts. In this paper we propose a technique that we call Distributed Generative Data Mining. The generative feature of the technique is due to its capability of generating new sub-tasks of the Data Mining analysis process at execution time. The workflow of sub-tasks of the DM is, therefore, dynamic. To deploy the proposed technique we extended the Distributed Data Mining system HARVARD and adapted an Inductive Logic Programming system (IndLog) used in a Relational Data Ming task. As a proof-of-concept, the extended system was used to analyse an artificial dataset of a credit scoring problem with eighty million records.
Language: English
Type (Professor's evaluation): Scientific
Contact: Rui Camacho
No. of pages: 11
License type: Click to view license CC BY-NC
Documents
File name Description Size
icdm2007 Distributed Generative Data Mining 146.52 KB
Related Publications

Of the same authors

Utilização de XML numa plataforma de Data Mining distribuído (2007)
Article in International Conference Proceedings Book
Ruy Ramos; Carlos Adriano Gonçalves; Rui Camacho
A step up with the HARVARD system: the HARVARD-g system (2007)
Article in International Conference Proceedings Book
Ruy Ramos; Rui Camacho
Recommend this page Top
Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-08-14 at 09:55:41 | Privacy Policy | Personal Data Protection Policy | Whistleblowing