Go to:
Esta página em português Ajuda Autenticar-se
Bandeira a meia haste - A FEUP encontra-se de luto
You are here: Start > EIC0107

Markup Languages and Document Processing

Code: EIC0107     Acronym: LAPD

Classification Keyword
OFICIAL Information Systems

Instance: 2014/2015 - 2S Ícone do Moodle

Active? Yes
Web Page: http://paginas.fe.up.pt/~jlopes/doku.php/teach/lapd/index
Responsible unit: Department of Informatics Engineering
Course/CS Responsible: Master in Informatics and Computing Engineering

Cycles of Study/Courses

Acronym No. of Students Study Plan Curricular Years Credits UCN Credits ECTS Contact hours Total Time
MIEIC 26 Syllabus since 2009/2010 4 - 6 56 162

Teaching - Hours

Recitations: 3,00
Type Teacher Classes Hour
Recitations Totals 1 3,00
João António Correia Lopes 3,00
Mais informaçõesLast updated on 2015-02-19.

Fields changed: Objectives, Fórmula de cálculo da classificação final, Bibliografia Complementar, Componentes de Avaliação e Ocupação, Programa

Teaching language

Suitable for English-speaking students



The "Markup Languages and Document Processing" unit assumes as its context the widespread use of markup languages for representing semi-structured information and the existence of standardized tools for their treatment.


1. Make the students aware of the multiple non-trivial applications of markup languages;
2. Make the students familiar with the technologies of processing and storage of semi-structured information;
3. Explore the technologies in defining an annotation language and in the development of an application.

Learning outcomes and competences

On completion of this course, the student should be able to:
- Identify the use of markup languages in documents, in data repositories and in applications;
- Create models for XML documents;
- Tell the difference between a standardised language for an application domain and the ad-hoc uses of markup languages;
- Design a markup language to support a document type or the data for an application;
- Evaluate and compare XML-based and other solutions to support application data interchange;
- Design XSL stylesheets to transform documents;
- Interpret the results of document processing with XML-based technologies;
- Use a native XML database and take advantage of its query functions;
- Compare data organisation in markup models with the relational model and translate data between both models;
- Generate a markup model for data in an application domain, store a dataset and query it;
- Compare markup languages to other document and data representations with respect to the support to data preservation along technological change;
- Relate web documents with the metadata that describes or links them;
- Compare semantic web based services with simpler approaches to resource description;
- Develop an XML-based prototype application involving the use of a dialect and document processing.


Working method


Pre-requirements (prior knowledge) and co-requirements (common knowledge)

Students must have basic skills in markup languages and technologies.


- Design of markup languages. Analysis of existing languages for different domains.
- Querying XML documents. The XPath language. The XQuery language.
- XML document transformation with XSLT. XML presentation with XSL-FO.
- XML processing in applications. The standard interfaces DOM and SAX.
- Storage of XML: native XML databases, XML storage in databases object-relational.
- Introduction to the Semantic Web. RDF, OWL, Linked Open Data.


Mandatory literature

Moller, Anders; An introduction to XML and web technologies. ISBN: 0-321-26966-7

Complementary Bibliography

Bradley, Neil; The XML companion. ISBN: 0-201-77059-8
José Carlos Ramalho, Pedro Henriques; XML & XSL. ISBN: 972-722-347-8
Harold, Elliotte Rusty; XML in a nutshell. ISBN: 0-596-00292-0
Grigoris Antoniou and Frank van Harmelen; A Semantic Web Primer, The MIT Press, 2008. ISBN: 9780262018289

Teaching methods and learning activities

Lectures include theoretical presentation of the course subjects and practical sessions where proposed research topics are discussed with the students and practical coursework is reported.




Physical sciences > Computer science > Informatics

Evaluation Type

Distributed evaluation without final exam

Assessment Components

Designation Weight (%)
Teste 30,00
Trabalho laboratorial 70,00
Total: 100,00

Amount of time allocated to each course unit

Designation Time (hours)
Elaboração de projeto 120,00
Frequência das aulas 42,00
Total: 162,00

Eligibility for exams


Calculation formula of final grade

Classification = 70% Practical assignments + 30% Questionnaires, where: Practical assignments = 20% E1 + 30% E2 + 50% E3.

The practical work will be assessed through the documentation submitted up to their due dates established in the course plan, the participation in the presentations and through the developed XML-based application.

The theoretical concepts are evaluated through the individual response to 20 minutes questionnaires that take place during
the classes.

The minimum required to pass the course is 50% in each of the deliverables and presentations of practical assessment (E1 to E3) and 40% on average of the Quiz #1 to Quiz #4 questionnaires related theoretical subjects. There is no recovery exam.

This course, given its nature, is not eligible for evaluation in a single moment, so the practical work evaluation cannot be replaced by taking an exam.

Examinations or Special Assignments

There are no special works or tests.

Internship work/project

The practical work is broken down into three components with the delivery of a small scientific paper and a presentation and discussion.

Special assessment (TE, DA, ...)

Students taking exams under special regimes are expected to previously submit the assignments required for this course.

Classification improvement

Students may improve the mark in the course's next edition.

Recommend this page Top
Copyright 1996-2022 © Faculdade de Engenharia da Universidade do Porto  I Terms and Conditions  I Accessibility  I Index A-Z  I Guest Book
Page generated on: 2022-06-25 at 15:00:59