Go to:
Logótipo
Comuta visibilidade da coluna esquerda
Você está em: Start > Publications > View > Static features in real-time recognition of isolated vowels at high pitch
Publication

Publications

Static features in real-time recognition of isolated vowels at high pitch

Title
Static features in real-time recognition of isolated vowels at high pitch
Type
Article in International Scientific Journal
Year
2007
Journal
Vol. 122 No. 4
Pages: 2389-2404
ISSN: 0001-4966
Scientific classification
FOS: Engineering and technology > Other engineering and technologies
Other information
Authenticus ID: P-004-78P
Abstract (EN): This paper addresses the problem of automatic identification of vowels uttered in isolation by female and child speakers. In this case, the magnitude spectrum of voiced vowels is sparsely sampled since only frequencies at integer multiples of F0 are significant. This impacts negatively on the performance of vowel identification techniques that either ignore pitch or rely on global shape models. A new pitch-dependent approach to vowel identification is proposed that emerges from the concept of timbre and that defines perceptual spectral clusters (PSC) of harmonic partials. A representative set of static PSC-related features are estimated and their performance is evaluated in automatic classification tests using the Mahalanobis distance. Linear prediction features and Mel-frequency cepstral coefficients (MFCC) coefficients are used as a reference and a database of five (Portuguese) natural vowel sounds uttered by 44 speakers (including 27 child speakers) is used for training and testing the Gaussian models. Results indicate that perceptual spectral cluster (PSC) features perform better than plain linear prediction features, but perform slightly worse than MFCC features. However, PSC features have the potential to take full advantage of the pitch structure of voiced vowels, namely in the analysis of concurrent voices, or by using pitch as a normalization parameter. (C) 2007 Acoustical Society of America.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 16
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same authors

Processamento Digital de Sinal, Aulas Práticas (2004)
Educational Publication
Francisco Restivo; Aníbal Ferreira
A Review of Voicing Decision in Whispered Speech: From Rules to Machine Learning (2025)
Another Publication in an International Scientific Journal
da Silva, JMPP; Duarte Nunes, G; Aníbal Ferreira

See all (144)

Of the same journal

Relationships between subjective and objective acoustical measures in churches (1997)
Article in International Scientific Journal
António P. Carvalho; António E. Morgado; Luís Henrique
Musicians and non-musicians are equally adept at perceiving masked speech (2015)
Article in International Scientific Journal
Dana Boebinger; Samuel Evans; Stuart Rosen; César F. Lima; Tom Manly; Sophie K. Scott
Evaluation of the successive approximations method for acoustic streaming numerical simulations (2016)
Article in International Scientific Journal
S. O. Catarino; G. Minas; J. M. Miranda
Recommend this page Top
Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-07-09 at 01:14:49 | Privacy Policy | Personal Data Protection Policy | Whistleblowing