Saltar para:
Logótipo
Você está em: Início > Publicações > Visualização > An Actor-Critic-based adapted Deep Reinforcement Learning model for multi-step traffic state prediction
Mapa das Instalações
FC6 - Departamento de Ciência de Computadores FC5 - Edifício Central FC4 - Departamento de Biologia FC3 - Departamento de Física e Astronomia e Departamento GAOT FC2 - Departamento de Química e Bioquímica FC1 - Departamento de Matemática

An Actor-Critic-based adapted Deep Reinforcement Learning model for multi-step traffic state prediction

Título
An Actor-Critic-based adapted Deep Reinforcement Learning model for multi-step traffic state prediction
Tipo
Artigo em Revista Científica Internacional
Ano
2025-12
Autores
Selim Reza
(Autor)
Outra
Ver página pessoal Sem permissões para visualizar e-mail institucional Pesquisar Publicações do Participante Sem AUTHENTICUS Sem ORCID
Marta Campos Ferreira
(Autor)
FEUP
J.J.M. Machado
(Autor)
FEUP
João Manuel R. S. Tavares
(Autor)
FEUP
Revista
Vol. 184 113783
Páginas: 1-14
ISSN: 1568-4946
Editora: Elsevier
Indexação
Publicação em ISI Web of Knowledge ISI Web of Knowledge - 0 Citações
Publicação em ISI Web of Science ISI Web of Science
Publicação em Scopus Scopus - 0 Citações
Clarivate Analytics
Classificação Científica
CORDIS: Ciências Tecnológicas
FOS: Ciências da engenharia e tecnologias
Outras Informações
ID Authenticus: P-019-YYF
Abstract (EN): Traffic state prediction is critical to decision-making in various traffic management applications. Despite significant advancements in Deep Learning (DL) models, such as Long Short-Term Memory (LSTM), Graph Neural Networks (GNN), and attention-based transformer models, multi-step predictions remain challenging. The state-of-the-art models face a common limitation: the predictions' accuracy decreases as the prediction horizon increases, a phenomenon known as error accumulation. In addition, with the arrival of non-recurrent events and external noise, the models fail to maintain good prediction accuracy. Deep Reinforcement Learning (DRL) has been widely applied to diverse tasks, including optimising intersection traffic signal control. However, its potential to address multi-step traffic prediction challenges remains underexplored. This study introduces an Actor-Critic-based adapted DRL method to explore the solution to the challenges associated with multi-step prediction. The Actor network makes predictions by capturing the temporal correlations of the data sequence, and the Critic network optimises the Actor by evaluating the prediction quality using Q-values. This novel combination of Supervised Learning and Reinforcement Learning (RL) paradigms, along with non-autoregressive modelling, helps the model to mitigate the error accumulation problem and increase its robustness to the arrival of non-recurrent events. It also introduces a Denoising Autoencoder to deal with external noise effectively. The proposed model was trained and evaluated on three benchmark traffic flow and speed datasets. Baseline multi-step prediction models were implemented for comparison based on performance metrics such as Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE). The results reveal that the proposed method outperforms the baselines by achieving average improvements of 0.26 to 21.29% in terms of MAE and RMSE for up to 24 time steps of prediction length on the three used datasets, at the expense of relatively higher computational costs. On top of that, this adapted DRL approach outperforms traditional DRL models, such as Deep Deterministic Policy Gradient (DDPG), in accuracy and computational efficiency.
Idioma: Inglês
Tipo (Avaliação Docente): Científica
Nº de páginas: 14
Documentos
Nome do Ficheiro Descrição Tamanho
paper 1st Page 1103.88 KB
1-s2.0-S1568494625010968 Paper 2283.50 KB
Publicações Relacionadas

Dos mesmos autores

Traffic State Prediction Using One-Dimensional Convolution Neural Networks and Long Short-Term Memory (2022)
Artigo em Revista Científica Internacional
Selim Reza; Marta Campos Ferreira; José J. M. Machado; João Manuel R. S. Tavares
Road Traffic Events Monitoring Using a Multi-Head Attention Mechanism-Based Transformer and Temporal Convolutional Networks (2025)
Artigo em Revista Científica Internacional
Selim Reza; Marta Campos Ferreira; J.J.M. Machado; João Manuel R. S. Tavares
A multi-head attention-based transformer model for traffic flow forecasting with a comparative analysis to recurrent neural networks (2022)
Artigo em Revista Científica Internacional
Selim Reza; Marta Campos Ferreira; José Joaquim M. Machado; João Manuel R. S. Tavares
A customized residual neural network and bi-directional gated recurrent unit-based automatic speech recognition model (2022)
Artigo em Revista Científica Internacional
Selim Reza; Marta Campos Ferreira; J.J.M. Machado; João Manuel R. S. Tavares

Ver todas (6)

Da mesma revista

Novelty detection for multi-label stream classification under extreme verification latency (2023)
Artigo em Revista Científica Internacional
Costa, JD; Júnior; Faria, ER; João Gama; Gama, J; Cerri, R
Improving a simulated soccer team's performance through a Memory-Based Collaborative Filtering approach (2014)
Artigo em Revista Científica Internacional
Pedro Henriques Abreu; Daniel Castro Silva; Fernando Almeida; João Mendes-Moreira
Heuristics for online three-dimensional packing problems and algorithm selection framework for semi-online with full look-ahead (2024)
Artigo em Revista Científica Internacional
Ali, S; Ramos, AG; Maria Antónia Carravilla; José Fernando Oliveira
Glass container production scheduling through hybrid multi-population based evolutionary algorithm (2013)
Artigo em Revista Científica Internacional
toledo, cfm; arantes, md; de oliveira, rrr; almada-lobo, b

Ver todas (10)

Recomendar Página Voltar ao Topo
Copyright 1996-2026 © Faculdade de Ciências da Universidade do Porto  I Termos e Condições  I Acessibilidade  I Índice A-Z
Última actualização: 2016-03-23 I  Página gerada em: 2026-02-28 às 16:50:58 | Política de Privacidade | Política de Proteção de Dados Pessoais | Denúncias | Livro Amarelo Eletrónico