Machine Learning
Keywords |
Classification |
Keyword |
OFICIAL |
Artificial Intelligence |
Instance: 2022/2023 - 1S
Cycles of Study/Courses
Acronym |
No. of Students |
Study Plan |
Curricular Years |
Credits UCN |
Credits ECTS |
Contact hours |
Total Time |
M.EIC |
152 |
Syllabus |
1 |
- |
6 |
52 |
162 |
Teaching language
English
Objectives
With the increasing digitization of their processes, organizations (companies, government, etc.) now feel the need to extract knowledge from this data to improve the efficiency and effectiveness of these processes (eg to gain competitive advantage). To this end, organizations need to acquire technical skills to develop solutions based on standard approaches to Machine Learning (ML) and Data Mining (DM), but also scientific skills for developing innovative solutions to problems where these standard approaches do not exist.
Thus, the goals of this course are:
- Motivate for the use of ML / DM techniques in decision support.
- Develop the ability to properly utilize these techniques for automatic analysis of large amounts of data.
- Develop the ability to undertake scientific research to develop new approaches to ML / DM.
Percentage Distribution
- Scientific component: 70%
- Technological component: 30%
Learning outcomes and competences
Students should be able to
- Understand the different types of machine learning & data mining (ML&DM) tasks.
- Identify decision support problems that can be represented as ML&DM tasks.
- Understand the phases of a ML&DM project.
- Know the main methods / algorithms for each ML&DM task type and understand the basics of their behaviour.
- Apply these methods to decision support problems.
- Evaluate the results of a ML&DM project.
- Identify opportunities for developing new approaches to ML / DM.
- Develop simple but appropriate scientific work to create new approaches to ML / DM.
Working method
Presencial
Pre-requirements (prior knowledge) and co-requirements (common knowledge)
Although no particular course is required, it is useful to have basic knowledge of:
- statistics
- algorithms
- artificial intelligence
Program
- Introduction to Machine Learning and Data Mining.
- DM Projects: DM methodologies and data preparation.
- Classification: introduction, evaluation (measures and methodologies) and algorithms (rule-, distance- and kernel-based methods; Bayesian methods). Scoring with classification models: approach and evaluation. Common classification issues (unbalanced class distribution and costs).
- Regression: introduction, evaluation (measures; compromise between bias and variance) and algorithms.
- Clustering: Partition (revision of K-means, K -medoids), density and hierarchical algorithms. Evaluation measures.
- Frequent Pattern Discovery: Frequent item set algorithms (APRIORI, Eclat, FP-Growth) and association rules. Evaluation measures (Support, trust, lift, ...). Other types of patterns: sequences and graphs.
- Recommendation systems: introduction, evaluation (measures and methodologies) and algorithms (content based, collaborative filtering, specialized systems).
- Advanced ML algorithms: ensembles, deep learning.
- Anomaly detection: introduction, algorithms.
- Automated machine learning (autoML) and meta learning.
Mandatory literature
João Moreira, Andre Carvalho, Tomás Horvath;
Data Analytics: A General Introduction, Wiley, 2018. ISBN: 978-1-119-29626-3 (https://www.wiley.com/en-aw/A+General+Introduction+to+Data+Analytics-p-9781119296263)
Charu C. Aggarwal;
Data mining. ISBN: 978-3-319-14142-8
Complementary Bibliography
Ian H. Witten, Eibe Frank;
Data mining. ISBN: 1-55860-552-5
Peter Flach;
Machine Learning: The Art and Science of Algorithms that Make Sense of Data, Cambridge University Press, 2012. ISBN: 9781107422223 (http://www.cs.bris.ac.uk/~flach/mlbook/)
Mohammed Zaki and Wagner Meira Jr.;
Data Mining and Analysis: Fundamental Concepts and Algorithms, Cambridge University Press, 2013. ISBN: 9780521766333 (http://www.dcc.ufmg.br/miningalgorithms/DokuWiki/doku.php)
Jiawei Han, Micheline Kamber;
Data mining. ISBN: 1-55860-489-8
Max Kuhn, Kjell Johnson;
Applied Predictive Modeling, Springer New York, 2013. ISBN: 9781461468493
Teaching methods and learning activities
- Theoretical classes and individual study for exposition of concepts.
- Laboratory sessions and data mining project for practical application and consolidation of learned concepts.
Software
Rapid Miner
The R Project for Statistical Computing
Python
Evaluation Type
Distributed evaluation with final exam
Assessment Components
Designation |
Weight (%) |
Exame |
50,00 |
Participação presencial |
0,00 |
Trabalho prático ou de projeto |
50,00 |
Total: |
100,00 |
Amount of time allocated to each course unit
Designation |
Time (hours) |
Estudo autónomo |
60,00 |
Frequência das aulas |
42,00 |
Trabalho laboratorial |
60,00 |
Total: |
162,00 |
Eligibility for exams
The distributed assessment consists of:
In case of a missing component of the evaluation, the respective grade is 0 (zero) values.
Students with worker or equivalent status, who are exempt from class attendance should, at regular intervals to be defined with the teachers, present the progress of their work, as well as do the scheduled presentations together with the regular students.
Calculation formula of final grade
0,5 * DM project + 0,5 * exam
Minimum grade in each componente: 7.0 (out of 20)
Examinations or Special Assignments
The DM project will be developed in groups of 3 students and consists of analyzing a data set and preparing a presentation that describes and discusses both the project and the results obtained.
Special assessment (TE, DA, ...)
Students who are exempt from class attendance must complete all assessment components and should contact the teacher to make any necessary adjustments to the process.
Classification improvement
Grade improvement may be done for the exam in the special season (
recurso) of the year in which the student is approved.
For components which no grade improvement has been done in the year in which the student is approved, improvement may be made in one or more of the components in the following year, during the regular or special season.