Knowledge Extraction
Keywords |
Classification |
Keyword |
OFICIAL |
Artificial Intelligence |
Instance: 2007/2008 - 1S
Cycles of Study/Courses
Teaching language
Portuguese
Objectives
To provide the students with knowledge so that they can use analysis and extraction techniques of large data quantities’ patterns.
Program
Introduction to the knowledge extraction: Concept of Data Mining; Data Mining and Knowledge Discovery process.
Data preparation: Data cleaning; Data Normalization, Reduction and Discretization.
Association Rules: Definition of the association rules research problem. Quality measures of the association rules. Some research algorithms of association rules.
Clustering: Clustering Techniques. Partition clustering algorithms (K-means, K-medoids) and Hierarchical clustering quality. Other algorithms: BIRCH, CURE, DBSCAN.
Web Mining: Data Mining concepts on the Web; Information research on the Web; Research usage patterns on the Web; Structure analysis and research on the Web.
Text Data Mining: definition, application techniques and examples.
Classification: Classification techniques for the analysis of large data quantities; Decision Trees;
Classification and Regression Trees (CART); Pruning principles; Bayesian Classification. Inductive Logic Programming.
Data Display: Display Techniques for the identification of patterns or exceptions in large data quantities.
PKDD: Parallel Knowledge Discovery in Databases – Parallel Processing Techniques for the extraction of patterns in large data quantities.
KDD Applications.
Mandatory literature
Han, Jiawei;
Data mining. ISBN: 1-55860-489-8
Teaching methods and learning activities
Theoretical classes: Exposition of theoretical concepts.
Practical classes: Exercise resolution, discussion of themes presented in the theoretical classes and help on the practical assignments.
Evaluation Type
Distributed evaluation with final exam
Eligibility for exams
The average grade of the distributed evaluation component (assignment and article presentation) must be equal or superior to 6 marks.
Calculation formula of final grade
0.35* Assignment Grade + 0.15* Article Presentation + 0.5* Exam Grade
Special assessment (TE, DA, ...)
The students dismissed from the practical classes must do the practical assignment and the article presentation.
Classification improvement
The distributed classification improvement can only be done in the following year.