Go to:
Logótipo
Comuta visibilidade da coluna esquerda
Você está em: Start > Publications > View > Learning decision trees from dynamic data streams
Publication

Publications

Learning decision trees from dynamic data streams

Title
Learning decision trees from dynamic data streams
Type
Article in International Scientific Journal
Year
2005
Authors
João Gama
(Author)
FEP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page
Pedro Medas
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Journal
Vol. 11
Pages: 1353-1366
ISSN: 0948-695X
Scientific classification
FOS: Natural sciences
CORDIS: Technological sciences
Other information
Authenticus ID: P-000-65H
Abstract (EN): This paper presents a system for induction of forest of functional trees from data streams able to detect concept drift. The Ultra Fast Forest of Trees (UFFT) is an incremental algorithm, which works online, processing each example in constant time, and performing a single scan over the training examples. It uses analytical techniques to choose the splitting criteria, and the information gain to estimate the merit of each possible splitting-test. For multi-class problems the algorithm builds a binary tree for each possible pair of classes, leading to a forest of trees. Decision nodes and leaves contain naive-Bayes classifiers playing different roles during the induction process. Naive-Bayes in leaves are used to classify test examples. Naive-Bayes in inner nodes play two different roles. They can be used as multivariate splitting-tests if chosen by the splitting criteria, and used to detect changes in the class-distribution of the examples that traverse the node. When a change in the class-distribution is detected, all the sub-tree rooted at that node will be pruned. The use of naive-Bayes classifiers at leaves to classify test examples, the use of splitting-tests based on the outcome of naive-Bayes, and the use of naive-Bayes classifiers at decision nodes to detect changes in the distribution of the examples are directly obtained from the sufficient statistics required to compute the splitting criteria, without no additional computations. This aspect is a main advantage in the context of high-speed data streams. This methodology was tested with artificial and real-world data sets. The experimental results show a very good performance in comparison to a batch decision tree learner, and high capacity to detect drift in the distribution of the examples.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 14
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same authors

Learning in Dynamic Environments: Decision Trees for Data Streams (2004)
Article in International Conference Proceedings Book
João Gama; Pedro Medas

Of the same journal

Selected papers from SBLP 2007: The 11th Brazilian Symposium on Programming Languages J.UCS special issue (2007)
Another Publication in an International Scientific Journal
Bigonha, RS; Musicante, MA; Pardo, A; Garcia, A; Martini, A; Moreira, AF; De Melo, ACV; Du Bois, AR; Santos, A; Camarao, C; Rubira, C; Braga, C; Naumann, D; Haeusler, EH; De Carvalho Junior, FH; Cafezeiro, I; Palsberg, J; Jeuring, J; Saraiva, J; Guimaraes, J...(mais 24 authors)
Performance Management in Collaborative Networks: a Methodological Proposal (2011)
Article in International Scientific Journal
Ferreira, RP; Silva, JN; Strauhs, FDR; António Lucas Soares
Orchestration of E-Learning Services for Automatic Evaluation of Programming Exercises (2012)
Article in International Scientific Journal
Ricardo Queiros; Jose Paulo Leal
On pipelining sequences of data-dependent loops (2007)
Article in International Scientific Journal
Rui M. M. Rodrigues ; João M. P. Cardoso
HC plus : Towards a Framework for Improving Processes in Health Organizations by Means of Security and Data Quality Management (2012)
Article in International Scientific Journal
Caballero, I; Enrique Sanchez, LE; Freitas A; Fernandez Medina, E

See all (8)

Recommend this page Top
Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-07-29 at 11:32:28 | Privacy Policy | Personal Data Protection Policy | Whistleblowing