| Code: | CINF045 | Acronym: | RI |
| Keywords | |
|---|---|
| Classification | Keyword |
| OFICIAL | Information Science |
| Active? | Yes |
| Web Page: | https://moodle.up.pt/course/view.php?id=1890 |
| Responsible unit: | Department of Informatics Engineering |
| Course/CS Responsible: | Bachelor of Arts in Information Science |
| Acronym | No. of Students | Study Plan | Curricular Years | Credits UCN | Credits ECTS | Contact hours | Total Time |
|---|---|---|---|---|---|---|---|
| CINF | 37 | Study plan | 3 | - | 6 | 41 | 162 |
The "Information Retrieval" unit assumes as its context the existence of large collections of information and the need for methods and tools for information retrieval on large heterogeneous collections.
Goals
1. Make the students feel the difference between structured and unstructured information and the difference between documents having associated descriptions or not.
2. Make the students familiar with the main concepts in textual information retrieval and their application in retrieval tools.
3. Use well-established methods in information retrieval to evaluate retrieval tools.
On completion of this course, the student should be able to:
-Identify information retrieval tasks performed with specific tools or embedded in services;
-Describe the retrieval tools and their components;
-Distinguish classical information retrieval models, identifying their principles, document models and similarity measures;
-Clearly separate the indexing and search modules in information retrieval tools;
-Perform web retrieval tasks using advanced search modes;
-For a document collection and a retrieval task, create an appropriate document model and specify automatic methods for processing the documents;
-Calculate several reference measures for evaluating retrieval systems;
-Participate in retrieval evaluation efforts, providing relevance judgements for selected topics;
-Describe web information retrieval, namely in what concerns document diversity and authority estimation;
-Relate textual information retrieval with its extensions to voice and image, identifying the open problems.
Information retrieval and its tasks. Information retrieval versus data retrieval. The evolution of information retrieval. The information retrieval process. General features of retrieval systems.
Information retrieval models. Boolean model, vectorial model, probabilistic model.
Processing documents and queries: lexical analysis, stemming, compression. Index construction.
Retrieval using indexes. Term weights and document ranking.
Web information retrieval. Crawling and indexing. Link analysis.
Evaluation in retrieval systems. Test collections, topics and relevance judgements.
Plenary lectures are used to present the course subjects, to discuss selected topics and to make workshop sessions with student project results. The practical classes are used for small exercises applying the concepts and techniques introduced in the course. Students present their projects in the scheduled sessions at the end of the semester.
The evaluation of the unit includes practical work in the form of an information retrieval evaluation project. The project involves performing search, relevance judgments on the results and the calculation of retrieval measures.
| Designation | Weight (%) |
|---|---|
| Exame | 40,00 |
| Participação presencial | 10,00 |
| Trabalho escrito | 10,00 |
| Trabalho laboratorial | 40,00 |
| Total: | 100,00 |
| Designation | Time (hours) |
|---|---|
| Elaboração de projeto | 50,00 |
| Estudo autónomo | 50,00 |
| Frequência das aulas | 39,00 |
| Apresentação/discussão de um trabalho científico | 10,00 |
| Trabalho escrito | 13,00 |
| Total: | 162,00 |
Minimal grades for passing the course:
50% practical evaluation (mini-project)
40% exam
Student presence in tutorial and practical sessions is registered. Class attendance is not mandatory, but the credit for exercises and course dossier can only be obtained when attending.
Ordinary students
Mark = round( 40% * exam + 10% * practical exercises + 40% * mini-project + 10% * participation in tutorial sessions)
Students with special "non-atendance" status
Mark = round( 50% * exam + 50% * mini-project )
The final classification of the mini-project can vary from element to element in the same group, plus or minus 2 values, based on the inner assessment performed by each group.
None. All students have to complete the projects and present them as scheduled.