help

Você está em: Start > Publications > View > A customized residual neural network and bi-directional gated recurrent unit-based automatic speech recognition model

Map of Premises

Publication

Publication Search

A customized residual neural network and bi-directional gated recurrent unit-based automatic speech recognition model

Title

A customized residual neural network and bi-directional gated recurrent unit-based automatic speech recognition modelExport publication in the APA format Export publication in the EXCEL format Export publication in the RIS format

Type

Article in International Scientific Journal

Date

2022-04

Title

A customized residual neural network and bi-directional gated recurrent unit-based automatic speech recognition model

Type

Article in International Scientific Journal

Year

2022-04

Authors

Selim Reza

(Author)

Other

View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications Without AUTHENTICUS Without ORCID

Marta Campos Ferreira

(Author)

FEUP

View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page

J.J.M. Machado

(Author)

FEUP

View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page

João Manuel R. S. Tavares

(Author)

FEUP

View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page

Journal

The Journal is awaiting validation by the Administrative Services.

Title: EXPERT SYSTEMS WITH APPLICATIONS Search for Journal Publications

Vol. 215

Pages: 1-10

ISSN: 0957-4174

Indexing

ISI Web of Knowledge - 0 Citations

ISI Web of Science

Scopus - 1 Citation

Clarivate Analytics

Scientific classification

CORDIS: Technological sciences

FOS: Engineering and technology

Associated Projects

Safe Cities - Safe Cities - Inovação para Construir Cidades Seguras

Other information

Authenticus ID: P-00X-FZ7

DOI: 10.1016/j.eswa.2022.119293

Abstract (EN): Speech recognition aims to convert human speech into text and has applications in security, healthcare, commerce, automobiles, and technology, just to name a few. Inserting residual neural networks before recurrent neural network cells improves accuracy and cuts training time by a good margin. Furthermore, layer normalization instead of batch normalization is more effective in model training and performance enhancement. Also, the size of the datasets presents tremendous influences in achieving the best performance. Leveraging these tricks, this article proposes an automatic speech recognition model with a stacked five layers of customized Residual Convolution Neural Network and seven layers of Bi-Directional Gated Recurrent Units, including a logarithmic so f tmax for the model output. Each of them incorporates a learnable per-element affine parameter-based layer normalization technique. The training and testing of the new model were conducted on the LibriSpeech corpus and LJ Speech dataset. The experimental results demonstrate a character error rate (CER) of 4.7 and 3.61% on the two datasets, respectively, with only 33 million parameters without the requirement of any external language model.

Language: English

Type (Professor's evaluation): Scientific

No. of pages: 10

Documents

File name	Description	Size
1-s2.0-S0957417422023119	Paper	2250.21 KB
paper	1st Page	183.26 KB

Related Publications

Of the same authors

Traffic State Prediction Using One-Dimensional Convolution Neural Networks and Long Short-Term Memory (2022)
Article in International Scientific Journal
Selim Reza; Marta Campos Ferreira; José J. M. Machado; João Manuel R. S. Tavares

A multi-head attention-based transformer model for traffic flow forecasting with a comparative analysis to recurrent neural networks (2022)
Article in International Scientific Journal
Selim Reza; Marta Campos Ferreira; José Joaquim M. Machado; João Manuel R. S. Tavares

Of the same scientific areas

Utilização dos Campos de Granitado LASER (SPECKLE) na Medição de Deslocamentos e Deformações num Plano (1984)
Thesis
A. C. Marques Pinho; J. F. Silva Gomes

Utilização de Técnicas Interferométricas na Medição de Deformações (no Plano) em Estruturas Sujeitas a Solicitações Térmicas e Mecânicas (1996)
Thesis
António Teixeira; J. F. Silva Gomes

Utilização da Interferometria de Granitado Laser (ESPI) na Determinação dos Modos e Frequências Próprias de Vibração de Placas (1987)
Thesis
Pedro M. B. Pimentel; J. F. Silva Gomes

Transformação de um Objecto da Industria Extractiva em Sistema. Algumas consequências. (1998)
Thesis
Alexandre Leite

Solicitações Locais em Cascas Finas de Geometria Esférica e Cilíndrica (1993)
Thesis
Nuno Rilo; J. F. Silva Gomes

See all (2148)

Recommend this page Top

Copyright 1996-2024 © Faculdade de Arquitectura da Universidade do Porto I Terms and Conditions I Acessibility I Index A-Z I Guest Book
Page created on: 2024-11-08 at 14:33:12 | Acceptable Use Policy | Data Protection Policy | Complaint Portal