Você está em: Início > Publicações > Visualização > Static features in real-time recognition of isolated vowels at high pitch

Mapa das Instalações

Publicação

Pesquisa de Publicações

Publicações

Static features in real-time recognition of isolated vowels at high pitch

Título

Static features in real-time recognition of isolated vowels at high pitchExportar publicação no formato APA Exportar publicação no formato EXCEL Exportar publicação no formato RIS

Tipo

Artigo em Revista Científica Internacional

Data

2007

Título

Static features in real-time recognition of isolated vowels at high pitch

Tipo

Artigo em Revista Científica Internacional

Ano

2007

Autores

Aníbal Ferreira

(Autor)

FEUP

Ver página pessoal Enviar mensagem Pesquisar Publicações do Participante Ver página do Authenticus Ver página ORCID

Revista

Título: Journal of the Acoustical Society of AmericaImportada do Authenticus Pesquisar Publicações da Revista

Vol. 122 Nº 4

Páginas: 2389-2404

ISSN: 0001-4966

Editora: Acoustical Society of America

Indexação

ISI Web of Knowledge - 8 Citações

Scopus - 12 Citações

Classificação Científica

FOS: Ciências da engenharia e tecnologias > Outras ciências da engenharia e tecnologias

Outras Informações

ID Authenticus: P-004-78P

DOI: 10.1121/1.2772228

Abstract (EN): This paper addresses the problem of automatic identification of vowels uttered in isolation by female and child speakers. In this case, the magnitude spectrum of voiced vowels is sparsely sampled since only frequencies at integer multiples of F0 are significant. This impacts negatively on the performance of vowel identification techniques that either ignore pitch or rely on global shape models. A new pitch-dependent approach to vowel identification is proposed that emerges from the concept of timbre and that defines perceptual spectral clusters (PSC) of harmonic partials. A representative set of static PSC-related features are estimated and their performance is evaluated in automatic classification tests using the Mahalanobis distance. Linear prediction features and Mel-frequency cepstral coefficients (MFCC) coefficients are used as a reference and a database of five (Portuguese) natural vowel sounds uttered by 44 speakers (including 27 child speakers) is used for training and testing the Gaussian models. Results indicate that perceptual spectral cluster (PSC) features perform better than plain linear prediction features, but perform slightly worse than MFCC features. However, PSC features have the potential to take full advantage of the pitch structure of voiced vowels, namely in the analysis of concurrent voices, or by using pitch as a normalization parameter. (C) 2007 Acoustical Society of America.

Idioma: Inglês

Tipo (Avaliação Docente): Científica

Nº de páginas: 16

Documentos

Não foi encontrado nenhum documento associado à publicação.

Publicações Relacionadas

Dos mesmos autores

Spectral Coding and Post-Processing of High Quality Audio (1998)
Tese
Aníbal Ferreira

Codificação Perceptual de Áudio Digital Estereofónico (1992)
Tese
Aníbal Ferreira

20 Band Digital Audio Equalizer, Implementation on an Expanded TMS320C31DSK - (Texas Instruments 1997 DSP Challenge, European Top 20 entries, The European University Programme, Texas Instruments) (1997)
Relatório Técnico
Gabriel Falcão Fernandes; Luis Gustavo Martins; Miguel Falcão; Aníbal Ferreira

Processamento Digital de Sinal, Aulas Práticas (2004)
Publicação Didática
Francisco Restivo; Aníbal Ferreira

A Review of Voicing Decision in Whispered Speech: From Rules to Machine Learning (2025)
Outra Publicação em Revista Científica Internacional
da Silva, JMPP; Duarte Nunes, G; Aníbal Ferreira

Ver todas (144)

Das mesmas áreas científicas

“Limited Activation, Unlimited Potential”: Acomys Cardiac Fibroblasts in Cardiac Regeneration (2025)
Tese
María Cardona Timoner

3D-printing of magnetic particles containing hydrogels for smart drug release combined with cancer phototherapy (2023)
Tese
Filipa Celeste Ribeiro da Silva

3D-Printed Microfiber-Reinforced Hydrogels to Modulate Skin Repair and Fibrosis (2024)
Tese
Solange Maria Soares Carvalho

3DArm Inertial Sensor-based 3D Upper Limb Motion Tracking and Trajectories Reconstruction (2016)
Tese
Ana Cristina Campos Pereira

3D Uterine Cavity Reconstruction for Computer-Assisted Hysteroscopy (2025)
Tese
Ana Filipa Pereira Vieira da Rocha Fernandes

Ver todas (7918)

Da mesma revista

Relationships between subjective and objective acoustical measures in churches (1997)
Artigo em Revista Científica Internacional
António P. Carvalho; António E. Morgado; Luís Henrique

Musicians and non-musicians are equally adept at perceiving masked speech (2015)
Artigo em Revista Científica Internacional
Dana Boebinger; Samuel Evans; Stuart Rosen; César F. Lima; Tom Manly; Sophie K. Scott

Evaluation of the successive approximations method for acoustic streaming numerical simulations (2016)
Artigo em Revista Científica Internacional
S. O. Catarino; G. Minas; J. M. Miranda

Recomendar Página Voltar ao Topo

Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto I Termos e Condições I Acessibilidade I Índice A-Z
Página gerada em: 2025-11-15 às 02:32:41 | Política de Privacidade | Política de Proteção de Dados Pessoais | Denúncias | Livro Amarelo Eletrónico