Resumo (PT):
Abstract (EN):
This paper addresses the problem of data mining in Inductive Logic Programming (ILP) motivated by its application in the domain of economics.
ILP systems have been largely applied to data mining classification tasks with a considerable success. The use of ILP systems in regression tasks has been far less successful. Current systems have very limited numerical reasoning capabilities, which limits the application of ILP to discovery of functional relationships of numeric nature.
This paper proposes improvements in numerical reasoning capabilities of ILP systems for dealing with regression tasks. It proposes the use of statistical-based techniques like Model Validation and Model Selection to improve noise handling and it introduces a new search stopping criterium inspired in the PAC learning framework.
We have found these extensions essential to improve on results over machine learning and statistical-based algorithms used in the empirical evaluation study.
Idioma:
Inglês
Tipo (Avaliação Docente):
Científica
Nº de páginas:
15
Tipo de Licença: