Go to:
Logótipo
You are in:: Start > CC4072

Foundations of Data Science

Code: CC4072     Acronym: CC4072     Level: 400

Keywords
Classification Keyword
OFICIAL Computer Science

Instance: 2025/2026 - 1S Ícone do Moodle

Active? Yes
Web Page: https://moodle2526.up.pt/course/view.php?id=7126
Responsible unit: Department of Computer Science
Course/CS Responsible: Master in Computer Science

Cycles of Study/Courses

Acronym No. of Students Study Plan Curricular Years Credits UCN Credits ECTS Contact hours Total Time
M:CC 21 Study plan since academic year 2025/2026 1 - 6 42 162
M:ENSI 4 Official study plan since 2025/2026 1 - 6 42 162
M:ERSI 11 Official Study Plan since 2021_M:ERSI 1 - 6 42 162

Teaching Staff - Responsibilities

Teacher Responsibility
Inês de Castro Dutra

Teaching - Hours

Theoretical and practical : 3,23
Type Teacher Classes Hour
Theoretical and practical Totals 1 3,231
Inês de Castro Dutra 3,231

Teaching language

Suitable for English-speaking students

Objectives

This unit has as main objectives to provide an introduction to the main data science methodologies and also to convey knowledge on programming and tools for data processing and analysis, such as the Python language.

Learning outcomes and competences

This unit should provide the students with: 
1. theoretical competences on several basic methodologies of data science. 
2. competences for developing software for data science tasks. 
3. practical competences on applying data science techniques to specific problems. 

Working method

Presencial

Program

 
1. Introduction to Data Science:
• the CRISP-DM model
• data, models and patterns
• data science tasks

2. Data Pre-Processing:
• importing data
• cleaning data
• transforming and creating variables
• dimensionality reduction techniques

3. Exploring and Visualizing Data
• data summarization
• data visualization

4. Descriptive Models
•  clustering methods: partitional methods, hierarchical methods 
•  rule association

5. Predictive Models
• classification and regression tasks
• evaluation  metrics
• linear regression, naive Bayes, k-nearest neighbours
• tree-based models: classification and regression trees, pruning methods 
• neural networks and deep learning
• support vector machines
• ensembles: bagging, random forests, boosting, AdaBoost, Xgboost 

6. Methodologies for Evaluating and Comparing Models
• evaluation measures
• estimation methods
• significance tests


Mandatory literature

Pang-Ning Tan; Introduction to data mining. ISBN: 9780321420527
Charu C. Aggarwal; Data mining. ISBN: 978-3-319-14142-8
Jiawei Han; Data mining. ISBN: 978-0-12-381479-1

Complementary Bibliography

Peter Flach; Machine learning. ISBN: 978-1-107-42222-3
Andriy Burkov; The Hundred-Page Machine Learning Book, 2019. ISBN: 978-1999579500
Torgo Luís; Data mining with R. ISBN: 978-1-4398-1018-7

Teaching methods and learning activities

The lectures are based on the oral exposition of the topics that are part of the syllabus, as well as illustrations with concrete data mining case studies. 

keywords

Technological sciences > Technology > Information technology
Physical sciences > Computer science > Modelling tools
Physical sciences > Computer science > Informatics > Applied informatics
Technological sciences > Technology > Computer technology > Software technology

Evaluation Type

Distributed evaluation with final exam

Assessment Components

designation Weight (%)
Exame 70,00
Apresentação/discussão de um trabalho científico 30,00
Total: 100,00

Amount of time allocated to each course unit

designation Time (hours)
Estudo autónomo 84,00
Apresentação/discussão de um trabalho científico 36,00
Frequência das aulas 42,00
Total: 162,00

Eligibility for exams



Calculation formula of final grade

The course evaluation consists of a final exam and a practical assignment at the end of the semester. The assignment will be evaluated through an oral presentation.

The final grade is calculated by averaging the practical and theoretical grades using the formula:

NF = 0.7 * Exam + 0.3 * Assignment

Examinations or Special Assignments

The practical assignment will be announced in the beginning of the semester and should be completed by the end of the semester.

Special assessment (TE, DA, ...)

The assessment for the special period will be carried out in the same way as the continuous assessment, with a final exam. The student will take the exam, which is worth 70%. They may or may not present a project if they wish to complete the remaining 30%. This project needs to be delivered and presented within the calendar for the special season.

Classification improvement

The evaluation of the practical assignment is not subject to improvement. 

The student can improve in the theoretical grade by taking the appeal (recurso) exam.

Observations

All of the provided material (e.g. slides, recommended books) is given in English and if there are foreign students the classes will also be given in English.

The material of the discipline will be made available in the corresponding Moodle webpage.

Recommend this page Top
Copyright 1996-2025 © Faculdade de Ciências da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-11-21 at 23:25:28 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book