Você está em: Início > Publicações > Visualização > A Reinforcement Learning Based Online Coverage Path Planning Algorithm

Mapa das Instalações

Publicação

Pesquisa de Publicações

Publicações

A Reinforcement Learning Based Online Coverage Path Planning Algorithm

Título

A Reinforcement Learning Based Online Coverage Path Planning AlgorithmExportar publicação no formato APA Exportar publicação no formato EXCEL Exportar publicação no formato RIS

Tipo

Artigo em Livro de Atas de Conferência Internacional

Data

2023

Título

A Reinforcement Learning Based Online Coverage Path Planning Algorithm

Tipo

Artigo em Livro de Atas de Conferência Internacional

Ano

2023

Autores

Carvalho, JP

(Autor)

Outra

A pessoa não pertence à instituição. A pessoa não pertence à instituição. A pessoa não pertence à instituição. Sem AUTHENTICUS Sem ORCID

Aguiar, AP

(Autor)

FEUP

Ver página pessoal Enviar mensagem Pesquisar Publicações do Participante Ver página do Authenticus Ver página ORCID

Ata de Conferência Internacional

Título: 2023 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC Pesquisar Publicações da Ata de Conferência

Páginas: 81-86

IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC)

Tomar, PORTUGAL, APR 26-27, 2023

Indexação

ISI Web of Knowledge - 0 Citações

Scopus

Outras Informações

ID Authenticus: P-00Y-MGH

DOI: 10.1109/icarsc58346.2023.10129591

Abstract (EN): Coverage Path Planning (CPP) is a common task in robotics that consists in computing collision-free paths that pass through all the specified points from an area of interest. This task is known to be NP-Hard, and increasingly complex when the agent relies exclusively on sensor information. Reinforcement Learning methods appear as an interesting solution to deal with the complexity of this problem and obtain efficient solutions. This paper presents an online CPP algorithm based on Tabular Temporal Difference Learning methods, for a generic robotic platform with a ranging sensor. The problem is formulated as a Partially Observed Markov Decision Process and an RL scheme that includes a modified policy with a heuristic method is proposed. The presented approach provides a way to mix the concepts of classical algorithms with RL, enabling the tabular algorithm to overcome the shortcomings of the inherent large state space of CPP, and accelerated the training process by optimizing and reducing the policy space. The proposed algorithm is tested and its performance is compared in simulation using different Temporal Difference Learning methods, showing that it can efficiently complete the task with no prior information, with different map sizes, starting positions, and a random number of obstacles.

Idioma: Inglês

Tipo (Avaliação Docente): Científica

Nº de páginas: 6

Documentos

Não foi encontrado nenhum documento associado à publicação.

Publicações Relacionadas

Dos mesmos autores

A Unified Stability Analysis of Safety-Critical Control using Multiple Control Barrier Functions (2025)
Artigo em Revista Científica Internacional
Reis, MF; Carvalho, JP; Aguiar, AP

Model Predictive Control for B-Spline Trajectory Tracking in Omnidirectional Robots (2024)
Artigo em Livro de Atas de Conferência Internacional
Carvalho, JP; António Paulo Moreira; Aguiar, AP

Recomendar Página Voltar ao Topo

Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto I Termos e Condições I Acessibilidade I Índice A-Z
Página gerada em: 2025-09-14 às 09:38:26 | Política de Privacidade | Política de Proteção de Dados Pessoais | Denúncias | Livro Amarelo Eletrónico