Português

help

Comuta visibilidade da coluna direita

Você está em: Start > Publications > View > Contributions for the automatic description of multimodal scenes

Map of Premises

Publication

Publication Search

Contributions for the automatic description of multimodal scenes

Title

Contributions for the automatic description of multimodal scenesExport publication in the APA format Export publication in the EXCEL format Export publication in the RIS format

Type

Thesis

Date

2009

Title

Contributions for the automatic description of multimodal scenes

Type

Thesis

Year

2009

Authors

Luis F. Teixeira

(Author)

FEUP

View Personal Page Send message Search for Participant Publications View Authenticus page View ORCID page

Luís Corte-Real

(Technical adviser)

FEUP

View Personal Page Send message Search for Participant Publications View Authenticus page Without ORCID

Scientific classification

FOS: Engineering and technology > Electrical engineering, Electronic engineering, Information engineering

CORDIS: Technological sciences > Technology > Computer technology > Image processing ; Technological sciences > Technology > Computer technology > Signal processing

Associated Institutions

FEUP - Faculdade de Engenharia da Universidade do Porto

Other information

Abstract (EN): With the emergence of an information-oriented society it soon became clear that the massive amount of information that was generated required effective ways for indexing and searching. From as early as the 50s in the 20th century, researchers have sought ways to implement information retrieval systems. These systems, and in particular text retrieval systems, have evolved considerably and became a part of our daily life. How we have now virtually the whole internet text content searchable and accessible in less than half a second is paradigmatic of this. The next natural step was indexing also multimedia content besides text content. However, multimedia content introduces additional problems to the indexing task. The large amount of information and the complexity of its relations are factors that dramatically increase the difficulty in achieving highly successful indexing and searching results. For instance, until recently, devising a system that could automatically detect and identify persons in a complex scene, track them across multiple cameras and analyse their behaviour in real-time would be too much of an arduous task. Though such a system is not yet fully accomplished, many recent successful advances, mostly in computer vision and machine learning, take us much nearer to that technological milestone. In this dissertation we approach the issue of indexing content obtained from real-world scenes. We define ``real-world scene'' as any scene captured continuously in public or private spaces by automated and often passive sensors. These scenes are usually captured by multiple sensors of multiple types. The actions portrayed in the captured sequences consist of everyday actions, like people walking or running, cars passing by or parking, etc. An example of application is a surveillance system. Most of the information in surveillance scenarios is conveyed by a sequence of images but, more often than not, there is important information that can be obtained from analysing other types of data, or modalities -- multimodal scene analysis relies on that premiss. We start by analysing the concepts and challenges that are part of multimodal analysis, having in mind real-world scenes. Three processing areas are considered: object detection, object recognition, and event analysis. With object detection we separate both in space and in time each object and associate a label to them. This label distinguishes objects from one another but does not associate any semantic knowledge. That is the goal of object recognition with which we associate an identity to the object from a set of known classes. With event analysis on the other hand we identify relevant activities and events that are defined by the context of the scene under analysis. For each area we survey relevant algorithms and systems, and present original contributions.

Language: Portuguese

Type (Professor's evaluation): Scientific

Contact: lfpt@fe.up.pt

No. of pages: 272

Documents

We could not find any documents associated to the publication.

Related Publications

Of the same authors

Editing and Description Framework for Video Objects (2004)
Thesis
Luis F. Teixeira; Luís Corte-Real

Video object matching across multiple independent views using local descriptors and adaptive learning (2009)
Article in International Scientific Journal
Luis F Teixeira; Luis Corte Real

Partition-distance methods for assessing spatial segmentations of images and videos (2009)
Article in International Scientific Journal
Jaime S Cardoso; Pedro Carvalho; Luis F Teixeira; Luis Corte Real

Object segmentation using background modelling and cascaded change detection (2007)
Article in International Scientific Journal
Teixeira, LF; Cardoso, JS; Corte Real, L

Analysis of object description methods in a video object tracking environment (2013)
Article in International Scientific Journal
Pedro Carvalho; Telmo Oliveira; Lucian Ciobanu; Filipe Gaspar; Luis F Teixeira; Rafael Bastos; Jaime S Cardoso; Miguel S Dias; Luis Corte Real

See all (11)

Of the same scientific areas

An Active Illumination Single-Pixel Camera Based on Compressive Sensing (2011)
Article in International Scientific Journal
F. Magalhães; M. Abolbashari; F. M. Araújo; M. V. Correia; F. Faramarz

A compressive sensing based transmissive single-pixel camera (2011)
Article in International Conference Proceedings Book
magalhaes, f; abolbashari, m; farahi, f; araujo, fm; correia, mv

Recommend this page Top

Copyright 1996-2025 © Faculdade de Letras da Universidade do Porto I Terms and Conditions I Acessibility I Index A-Z
Page created on: 2025-12-07 at 22:44:40 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book