Go to:
Logótipo
You are in:: Start > B4048

Computational Analysis of Molecular Data

Code: B4048     Acronym: B4048

Keywords
Classification Keyword
OFICIAL Biology

Instance: 2016/2017 - 2S

Active? Yes
Responsible unit: Department of Biology
Course/CS Responsible: Master in Biodiversity, Genetics and Evolution

Cycles of Study/Courses

Acronym No. of Students Study Plan Curricular Years Credits UCN Credits ECTS Contact hours Total Time
M:BGE 9 Official Study Plan 1 - 3 21 81
M:GF 7 plano de estudos do Mestrado em Genética Forense a partir de 2013_2014. 1 - 3 21 81

Teaching language

English

Objectives

The aim of this curricular unit is to provide training in conceptual and practical aspects of data analysis of population genomics datasets, with emphasis on applications. This includes an introduction on the main concepts of coalescent, Bayesian, approximate Bayesian (ABC), and likelihood-based approaches. Emphasis will be on interpretation of output from statistical approaches and software programs. The main computational methods to analyze molecular data from high-throughput sequencing and to estimate demography and selection will also be taught.

Learning outcomes and competences

This curricular unit addresses current methodologies of computational analysis using sequence data. The advances in parallel, high-throughput sequencing technologies boosted the generation of sequence data, prompting for new ways of dealing with large volumes of information. In this curricular unit, focus will be put into current statistical methods that are best suited to particular questions but, more importantly, on the understanding of data types/formats, models and underlying algorithms. As the development of computational analysis methods for molecular data is rapidly growing, students will be able to cope with the newest progress in bioinformatics tools, as the curricular unit emphasizes fundamental concepts rather than particular software.

Working method

Presencial

Program


  • Overview and introduction to statistical approaches in population genetics

  • Frequentist, likelihood, and Bayesian approaches

  • The theory of coalescence

  • Estimation of effective populacional (Ne) and Approximate Bayesian Methods

  • Sequence data analysis and quality scores

  • Combining genetics and demography to assess dispersal (and detect selection)

  • Detecting selection: FST-outliers and local adaptation

  • Landscape genetics and spatial statistics

  • Analysis using high-throughput sequencing

  • RAD sequencing, Short read sequence analysis and SNP detection

  • Data mining. File formats and conversion. Fetching data through APIs.

  • Bootstrap, Jackknife and permutation methods

  • Scripting languages (Python, Perl and R)

Mandatory literature

Gascuel O; Mathematics of Evolution and Phylogeny, Oxford University Press, 2007. ISBN: 978-0199231348
Manly BJF; Randomization, Bootstrap and Monte Carlo Methods in Biology, Chapman & Hall, 2006. ISBN: 978-1584885412

Teaching methods and learning activities

Theoretical classes, laboratory work (computational analysis).

Evaluation Type

Distributed evaluation with final exam

Assessment Components

designation Weight (%)
Exame 50,00
Participação presencial 20,00
Trabalho escrito 30,00
Total: 100,00

Eligibility for exams

Conclusion of the laboratory work and respective report.

Attendance of a minimum of 50% of the theoretical course.

Calculation formula of final grade

Report of the laboratory practical work (30/100), written test on topics covered in the theoretical component of the discipline (50/100);

Attendance and participation in class (20/100).
Recommend this page Top
Copyright 1996-2025 © Faculdade de Ciências da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-12-09 at 14:35:22 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book