Information Storage and Retrieval I
Instance: 2009/2010 - 1S
Cycles of Study/Courses
Teaching language
Portuguese
Objectives
On completion of this course, te student should be able to:
-Identify information retrieval tasks performed with specific tools or embedded in services;
-Describe the retrieval tools and their components;
-Distinguish classical information retrieval models, identifying their principles, document models and similarity measures;
-Use text-processing software to index documents and anticipate their results;
-Clearly separate the indexing and search modules in information retrieval tools;
-Perform web retrieval tasks using advanced search modes;
-For a document collection and a retrieval task, create an appropriate document model and specify automatic methods for processing the documents;
-Calculate several reference measures for evaluating retrieval systems;
-Participate in retrieval evaluation efforts, providing relevance judgements for selected topics;
-Relate textual information retrieval with its extensions to voice and image, identifying the open problems.
Program
Information storage and retrieval and its tasks. Information retrieval versus data retrieval. The evolution of information retrieval. The information retrieval process. General features of retrieval systems.
Information retrieval models. Boolean model, vectorial model, probabilistic model.
Processing documents and queries: lexical analysis, stemming, compression. Index construction.
Retrieval using indexes. Term weights and document ranking.
Web information retrieval. Crawling and indexing. Link analysis.
Evaluation in retrieval systems. Test collections, topics and relevance judgements.
Mandatory literature
Baeza-Yates, Ricardo;
Modern information retrieval. ISBN: 0-201-39829-X
Manning, Christopher D.;
Introduction to information retrieval. ISBN: 978-0-521-86571-5
Teaching methods and learning activities
Tutorial classes are accompanied by practical sessions, using selected software. Students present their projects in the scheduled sessions.
Evaluation Type
Distributed evaluation with final exam
Assessment Components
| Description |
Type |
Time (hours) |
Weight (%) |
End date |
| Attendance (estimated) |
Participação presencial |
46,00 |
|
|
| Project: Evaluation of Web Retrieval |
Trabalho escrito |
30,00 |
|
2009-12-11 |
| Practical Exercises |
Teste |
30,00 |
|
2009-12-17 |
|
Total: |
- |
0,00 |
|
Amount of time allocated to each course unit
| Description |
Type |
Time (hours) |
End date |
| Dossier maintenance |
Estudo autónomo |
30 |
2009-12-17 |
| Study |
Estudo autónomo |
53 |
2010-01-08 |
|
Total: |
83,00 |
|
Eligibility for exams
50% in practical evaluation
Calculation formula of final grade
Mark = round(50% * exam + 10% * practical exercises + 30% * mini-project + 10% * course dossier)
Examinations or Special Assignments
Group presentation of projects.
All students have to complete the projects and present them as scheduled.