Go to:
Logótipo
Comuta visibilidade da coluna esquerda
Você está em: Start > Publications > View > Automatic speaker segmentation using multiple features and distance measures: A comparison of three approaches
Publication

Publications

Automatic speaker segmentation using multiple features and distance measures: A comparison of three approaches

Title
Automatic speaker segmentation using multiple features and distance measures: A comparison of three approaches
Type
Article in International Conference Proceedings Book
Year
2006
Authors
Margarita Kotti
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Luis Gustavo P M Martins
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. View Authenticus page Without ORCID
Emmanouil Benetos
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Jaime S Cardoso
(Author)
FCUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page
Constantine Kotropoulos
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Conference proceedings International
Pages: 1101-1104
IEEE International Conference on Multimedia and Expo (ICME 2006)
Toronto, CANADA, JUL 09-12, 2006
Scientific classification
FOS: Engineering and technology > Electrical engineering, Electronic engineering, Information engineering
CORDIS: Technological sciences > Technology > Computer technology > Speech processing
Other information
Authenticus ID: P-004-R3J
Abstract (EN): This paper addresses the problem of unsupervised speaker change detection. Three systems based on the Bayesian Information Criterion (BIC) are tested. The first system investigates the AudioSpectrumCentroid and the AudioWaveformEnvelope features, implements a dynamic thresholding followed by a fusion scheme, and finally applies BIC. The second method is a real-time one that uses a metric-based approach employing the line spectral pairs and the BIC to validate a potential speaker change point. The third method consists of three modules. In the first module, a measure based on second-order statistics is used; in the second module, the Euclidean distance and T-2 Hotelling statistic are applied; and in the third module, the BIC is utilized. The experiments are carried out on a dataset created by concatenating speakers from the TIMIT database, that is referred to as the TIMIT data set. A comparison between the performance of the three systems is made based on t-statistics.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 4
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same scientific areas

Synthetic speech evaluation : the SUS approach and implementation for Portuguese (2003)
Article in International Conference Proceedings Book
Maria João Almeida de Sá Barros; Diamantino Rui da Silva Freitas; Daniela Filipa Macedo Braga Moreira da Silva; Luís Coelho; António Moura
Processamento linguístico aplicado à síntese da fala (2003)
Article in International Conference Proceedings Book
Helder Filipe Patrício Cabral Ferreira; Daniela Filipa Macedo Braga Moreira da Silva; Diamantino Rui da Silva Freitas
Prediction of Fujisaki model’s phrase commands (2003)
Article in International Conference Proceedings Book
João Paulo Ramos Teixeira; Diamantino Rui da Silva Freitas; Hiroya Fujisaki
Prediction of accent commands for the Fujisaki intonation model (2004)
Article in International Conference Proceedings Book
Hiroya Fujisaki; João Teixeira; Diamantino Rui da Silva Freitas
On the use of prosodic labelling in corpus-based linguistic studies of spontaneous speech (2003)
Article in International Conference Proceedings Book
Daniela Filipa Macedo Braga Moreira da Silva; Diamantino Rui da Silva Freitas; Aldina Marques; João Teixeira

See all (6)

Recommend this page Top
Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-07-08 at 07:07:38 | Privacy Policy | Personal Data Protection Policy | Whistleblowing