| Code: | 2MADSAD01 | Acronym: | ECD I |
| Keywords | |
|---|---|
| Classification | Keyword |
| OFICIAL | Information Technology |
| Active? | Yes |
| Web Page: | http://moodle.up.pt/course/view.php?id=480 |
| Responsible unit: | Agrupamento Científico de Matemática e Sistemas de Informação |
| Course/CS Responsible: | Master in Modeling, Data Analysis and Decision Support Systems |
| Acronym | No. of Students | Study Plan | Curricular Years | Credits UCN | Credits ECTS | Contact hours | Total Time |
|---|---|---|---|---|---|---|---|
| MADSAD | 41 | Bologna Official Syllabus | 1 | - | 7,5 | 56 | 202,5 |
At the end of the semester students should have the knowledge of various Data Mining tasks, the main methods and algorithms for each task, be able to apply these methods to new specific data analysis problems and have the capacity to evaluate, apply a critical posture in relation to results.
Knowledge of various Data Mining tasks, the main methods and algorithms for each task, be able to apply these methods to new specific data analysis problems and have the capacity to evaluate, apply a critical posture in relation to results.
- Machine learning, data mining. - From OTLP to OLAP. Multidimensional databases. - Knowledge: Representation. Generalization and specialization. - Data: Examples and instances of concepts. Attributes and values. Signal and noise. Multi-relational representations. Types of attributes. Data formats for data mining systems. Exploratory data analysis. - Distance-based methods. Algorithm k-NN and its properties. - Probabilistic methods. Bayesian classifiers. - Methods based on search. Decision trees and rules. Covering algorithm. - Evaluation of classification models. Costs. Overfitting. - Pre-processing: Feature selection, discretization, dealing with unknown values and outliers. - Advanced topics of classification. - Regression. Evaluation of regression models. - Frequent patterns, association rules. - Cluster analysis. - Methods for combination of models. Voting methods. Methods based on samples. Hierarchical combination of models. Data Mining tools. - Methodologies of data mining projects (CRISP-DM).
Theoretical and practical classes
| Designation | Weight (%) |
|---|---|
| Exame | 50,00 |
| Participação presencial | 0,00 |
| Trabalho escrito | 50,00 |
| Total: | 100,00 |
Exam 50% Practical work 50% The mark of practical work is calculated as the mean of practical works carried out. The practical work not done has a mark of 0. The minimum mark of the exam is 6.5. The minimum mark of the practical work is 6.5