Você está em: Início > Publicações > Visualização > Phonetic-oriented identification of twin speakers using 4-second vowel sounds and a combination of a shift-invariant phase feature (NRD), MFCCs and F0 information

Mapa das Instalações

Publicação

Pesquisa de Publicações

Publicações

Phonetic-oriented identification of twin speakers using 4-second vowel sounds and a combination of a shift-invariant phase feature (NRD), MFCCs and F0 information

Título

Phonetic-oriented identification of twin speakers using 4-second vowel sounds and a combination of a shift-invariant phase feature (NRD), MFCCs and F0 informationExportar publicação no formato APA Exportar publicação no formato EXCEL Exportar publicação no formato RIS

Tipo

Artigo em Livro de Atas de Conferência Internacional

Data

2019

Título

Phonetic-oriented identification of twin speakers using 4-second vowel sounds and a combination of a shift-invariant phase feature (NRD), MFCCs and F0 information

Tipo

Artigo em Livro de Atas de Conferência Internacional

Ano

2019

Autores

Aníbal Ferreira

(Autor)

FEUP

Ver página pessoal Enviar mensagem Pesquisar Publicações do Participante Ver página do Authenticus Ver página ORCID

Ata de Conferência Internacional

Título: Proceedings of the AES International Conference Pesquisar Publicações da Ata de Conferência

2019 AES International Conference on Audio Forensics

18 June 2019 through 20 June 2019

Indexação

ISI Web of Knowledge - 0 Citações

Scopus - 0 Citações

Outras Informações

ID Authenticus: P-00Q-ZRQ

Abstract (EN): Automatic speaker identification typically relies on sophisticated statistical modeling and classification which requires large amounts of data for good performance. However, in actual audio forensics casework, frequently only a few seconds of speech material are available. In this paper, we favor diversity in feature extraction, simple modeling and classification, and constructive combination of congruent classification scores. We use phase, spectral magnitude and F0-related features in speaker identification experiments on a database of 35 speakers most of whom are twins. Using only 4.4 sec. of vowel-like sounds per speaker, we characterize the performance that is reached with individual features and we characterize simple and yet effective ways of classification score fusion. Insights for further research are also presented.

Idioma: Inglês

Tipo (Avaliação Docente): Científica

Nº de páginas: 17

Documentos

Não foi encontrado nenhum documento associado à publicação.

Publicações Relacionadas

Dos mesmos autores

Spectral Coding and Post-Processing of High Quality Audio (1998)
Tese
Aníbal Ferreira

Codificação Perceptual de Áudio Digital Estereofónico (1992)
Tese
Aníbal Ferreira

20 Band Digital Audio Equalizer, Implementation on an Expanded TMS320C31DSK - (Texas Instruments 1997 DSP Challenge, European Top 20 entries, The European University Programme, Texas Instruments) (1997)
Relatório Técnico
Gabriel Falcão Fernandes; Luis Gustavo Martins; Miguel Falcão; Aníbal Ferreira

Processamento Digital de Sinal, Aulas Práticas (2004)
Publicação Didática
Francisco Restivo; Aníbal Ferreira

A Review of Voicing Decision in Whispered Speech: From Rules to Machine Learning (2025)
Outra Publicação em Revista Científica Internacional
da Silva, JMPP; Duarte Nunes, G; Aníbal Ferreira

Ver todas (144)

Recomendar Página Voltar ao Topo

Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto I Termos e Condições I Acessibilidade I Índice A-Z
Página gerada em: 2025-09-19 às 17:15:21 | Política de Privacidade | Política de Proteção de Dados Pessoais | Denúncias | Livro Amarelo Eletrónico