Go to:
Logótipo
Comuta visibilidade da coluna esquerda
Você está em: Start > Publications > View > Static features in isolated vowel recognition at high pitch
Publication

Publications

Static features in isolated vowel recognition at high pitch

Title
Static features in isolated vowel recognition at high pitch
Type
Article in International Conference Proceedings Book
Year
2008
Conference proceedings International
Pages: 63-68
International Conference on Signal Processing and Multimedia Applications
Oproto, PORTUGAL, JUL 26, 2008
Indexing
Publicação em ISI Web of Knowledge ISI Web of Knowledge - 0 Citations
Publicação em Scopus Scopus - 0 Citations
Other information
Authenticus ID: P-004-4V3
Abstract (EN): Vowel recognition is frequently based on Linear Prediction (LP) analysis and formant estimation techniques. However, the performance of these techniques decreases in the case of female or child speech because at high pitch frequencies (F0) the magnitude spectrum is scarcely sampled making formant estimation unreliable. In this paper we describe the implementation of a perceptually motivated concept of vowel recognition that is based on Perceptual Spectral Clusters (PSC) of harmonic partials. PSC based features were evaluated in automatic recognition tests using the Mahalanobis distance and using a data base of five natural Portuguese vowel sounds uttered by 44 speakers, 27 of whom are child speakers. LP based features and Mel-Frequency Cepstral Coefficients (MFCC) were also included in the tests as a reference. Results show that while the recognition performance of PSC features falls between that of LP based features and that of MFCC coefficients, the normalization of PSC features by F0 increases the performance and approaches that of MFCC coefficients. PSC features are not only amenable to a psychophysical interpretation (as LP based features are) but have also the potential to compete with global shape features such as MFCCs.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 6
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same authors

Processamento Digital de Sinal, Aulas Práticas (2004)
Educational Publication
Francisco Restivo; Aníbal Ferreira
A Review of Voicing Decision in Whispered Speech: From Rules to Machine Learning (2025)
Another Publication in an International Scientific Journal
da Silva, JMPP; Duarte Nunes, G; Aníbal Ferreira

See all (144)

Recommend this page Top
Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-07-20 at 09:59:41 | Privacy Policy | Personal Data Protection Policy | Whistleblowing