Information Storage and Retrieval I
Instance: 2011/2012 - 1S 
Cycles of Study/Courses
Teaching language
Portuguese
Objectives
The Information Storage and Retrieval I unit assumes as its context the existence of large collections of information and the need for methods and tools for information retrieval on large heterogeneous collections.
Goals
1. Make the students feel the difference between structured and unstructured information and the difference between documents having associated descriptions or not.
2. Make the students familiar with the main concepts in textual information retrieval and their application in retrieval tools.
3. Use well-established methods in information retrieval to evaluate retrieval tools.
On completion of this course, te student should be able to:
-Identify information retrieval tasks performed with specific tools or embedded in services;
-Describe the retrieval tools and their components;
-Distinguish classical information retrieval models, identifying their principles, document models and similarity measures;
-Clearly separate the indexing and search modules in information retrieval tools;
-Perform web retrieval tasks using advanced search modes;
-For a document collection and a retrieval task, create an appropriate document model and specify automatic methods for processing the documents;
-Calculate several reference measures for evaluating retrieval systems;
-Participate in retrieval evaluation efforts, providing relevance judgements for selected topics;
-Relate textual information retrieval with its extensions to voice and image, identifying the open problems.
Program
Information storage and retrieval and its tasks. Information retrieval versus data retrieval. The evolution of information retrieval. The information retrieval process. General features of retrieval systems.
Information retrieval models. Boolean model, vectorial model, probabilistic model.
Processing documents and queries: lexical analysis, stemming, compression. Index construction.
Retrieval using indexes. Term weights and document ranking.
Web information retrieval. Crawling and indexing. Link analysis.
Evaluation in retrieval systems. Test collections, topics and relevance judgements.
Mandatory literature
Ricardo Baeza-Yates, Berthier Ribeiro-Neto ; Modern Information Retrieval: The Concepts and Technology behind Search (2nd Edition), Addison-Wesley Professional, 2011. ISBN: 978-0321416919
Manning, Christopher D.;
Introduction to information retrieval. ISBN: 978-0-521-86571-5
Teaching methods and learning activities
Tutorial classes are accompanied by practical sessions, using selected software. Students present their projects in the scheduled sessions.
keywords
Physical sciences > Computer science > Informatics
Evaluation Type
Distributed evaluation with final exam
Assessment Components
| Description |
Type |
Time (hours) |
Weight (%) |
End date |
| Attendance (estimated) |
Participação presencial |
46,00 |
|
|
| Project: Web Retrieval Evaluation |
Trabalho escrito |
50,00 |
|
2011-12-12 |
| Exercises |
Teste |
20,00 |
|
2011-12-12 |
| Final exam |
Exame |
2,00 |
|
2012-02-10 |
|
Total: |
- |
0,00 |
|
Amount of time allocated to each course unit
| Description |
Type |
Time (hours) |
End date |
| Dossier maintenance |
Estudo autónomo |
20 |
2011-12-12 |
| Study |
Estudo autónomo |
51 |
2012-02-10 |
|
Total: |
71,00 |
|
Eligibility for exams
Minimal grades for passing the course:
50% practical evaluation (mini-project)
50% exam
Student presence in tutorial and practical sessions is registered. Class attendance is not mandatory, but the credit for exercises and course dossier can only be obtained when attending.
Calculation formula of final grade
Ordinary students
Mark = round( 40% * exam + 10% * practical exercises + 40% * mini-project + 10% * course dossier )
Students with special "non-atendance" status
Mark = round( 50% * exam + 50% * mini-project )
Examinations or Special Assignments
None. All students have to complete the projects and present them as scheduled.
Special assessment (TE, DA, ...)
Special exams require that students have completed the practical evaluation in the scheduled periods.
Classification improvement
Exam grades can be improved in the available seasons. Improving the grades for practical work requires a new enrollment in the course.