Você está em: Start > Publications > View > Phonetic-oriented identification of twin speakers using 4-second vowel sounds and a combination of a shift-invariant phase feature (NRD), MFCCs and F0 information

Map of Premises

Publication

Publication Search

Publications

Phonetic-oriented identification of twin speakers using 4-second vowel sounds and a combination of a shift-invariant phase feature (NRD), MFCCs and F0 information

Title

Phonetic-oriented identification of twin speakers using 4-second vowel sounds and a combination of a shift-invariant phase feature (NRD), MFCCs and F0 informationExport publication in the APA format Export publication in the EXCEL format Export publication in the RIS format

Type

Article in International Conference Proceedings Book

Date

2019

Title

Phonetic-oriented identification of twin speakers using 4-second vowel sounds and a combination of a shift-invariant phase feature (NRD), MFCCs and F0 information

Type

Article in International Conference Proceedings Book

Year

2019

Authors

Aníbal Ferreira

(Author)

FEUP

View Personal Page Send message Search for Participant Publications View Authenticus page View ORCID page

Conference proceedings International

Title: Proceedings of the AES International Conference Search for Conference Proceedings Publications

2019 AES International Conference on Audio Forensics

18 June 2019 through 20 June 2019

Indexing

ISI Web of Knowledge - 0 Citations

Scopus - 0 Citations

Other information

Authenticus ID: P-00Q-ZRQ

Abstract (EN): Automatic speaker identification typically relies on sophisticated statistical modeling and classification which requires large amounts of data for good performance. However, in actual audio forensics casework, frequently only a few seconds of speech material are available. In this paper, we favor diversity in feature extraction, simple modeling and classification, and constructive combination of congruent classification scores. We use phase, spectral magnitude and F0-related features in speaker identification experiments on a database of 35 speakers most of whom are twins. Using only 4.4 sec. of vowel-like sounds per speaker, we characterize the performance that is reached with individual features and we characterize simple and yet effective ways of classification score fusion. Insights for further research are also presented.

Language: English

Type (Professor's evaluation): Scientific

No. of pages: 17

Documents

We could not find any documents associated to the publication.

Related Publications

Of the same authors

Spectral Coding and Post-Processing of High Quality Audio (1998)
Thesis
Aníbal Ferreira

Codificação Perceptual de Áudio Digital Estereofónico (1992)
Thesis
Aníbal Ferreira

20 Band Digital Audio Equalizer, Implementation on an Expanded TMS320C31DSK - (Texas Instruments 1997 DSP Challenge, European Top 20 entries, The European University Programme, Texas Instruments) (1997)
Technical Report
Gabriel Falcão Fernandes; Luis Gustavo Martins; Miguel Falcão; Aníbal Ferreira

Processamento Digital de Sinal, Aulas Práticas (2004)
Educational Publication
Francisco Restivo; Aníbal Ferreira

A Review of Voicing Decision in Whispered Speech: From Rules to Machine Learning (2025)
Another Publication in an International Scientific Journal
da Silva, JMPP; Duarte Nunes, G; Aníbal Ferreira

See all (144)

Recommend this page Top

Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto I Terms and Conditions I Acessibility I Index A-Z
Page created on: 2025-07-21 at 01:04:24 | Privacy Policy | Personal Data Protection Policy | Whistleblowing