Português help

Comuta visibilidade da coluna direita

Você está em: Start > Publications > View > A Reinforcement Learning Based Online Coverage Path Planning Algorithm

Map of Premises

Publication

Publication Search

Publications

A Reinforcement Learning Based Online Coverage Path Planning Algorithm

Title

A Reinforcement Learning Based Online Coverage Path Planning AlgorithmExport publication in the APA format Export publication in the EXCEL format Export publication in the RIS format

Type

Article in International Conference Proceedings Book

Date

2023

Title

A Reinforcement Learning Based Online Coverage Path Planning Algorithm

Type

Article in International Conference Proceedings Book

Year

2023

Authors

Carvalho, JP

(Author)

Other

The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID

Aguiar, AP

(Author)

FEUP

View Personal Page Send message Search for Participant Publications View Authenticus page View ORCID page

Conference proceedings International

Title: 2023 IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS, ICARSC Search for Conference Proceedings Publications

Pages: 81-86

IEEE International Conference on Autonomous Robot Systems and Competitions (ICARSC)

Tomar, PORTUGAL, APR 26-27, 2023

Indexing

ISI Web of Knowledge - 0 Citations

Scopus

Other information

Authenticus ID: P-00Y-MGH

DOI: 10.1109/icarsc58346.2023.10129591

Abstract (EN): Coverage Path Planning (CPP) is a common task in robotics that consists in computing collision-free paths that pass through all the specified points from an area of interest. This task is known to be NP-Hard, and increasingly complex when the agent relies exclusively on sensor information. Reinforcement Learning methods appear as an interesting solution to deal with the complexity of this problem and obtain efficient solutions. This paper presents an online CPP algorithm based on Tabular Temporal Difference Learning methods, for a generic robotic platform with a ranging sensor. The problem is formulated as a Partially Observed Markov Decision Process and an RL scheme that includes a modified policy with a heuristic method is proposed. The presented approach provides a way to mix the concepts of classical algorithms with RL, enabling the tabular algorithm to overcome the shortcomings of the inherent large state space of CPP, and accelerated the training process by optimizing and reducing the policy space. The proposed algorithm is tested and its performance is compared in simulation using different Temporal Difference Learning methods, showing that it can efficiently complete the task with no prior information, with different map sizes, starting positions, and a random number of obstacles.

Language: English

Type (Professor's evaluation): Scientific

No. of pages: 6

Documents

We could not find any documents associated to the publication.

Related Publications

Of the same authors

A Unified Stability Analysis of Safety-Critical Control using Multiple Control Barrier Functions (2025)
Article in International Scientific Journal
Reis, MF; Carvalho, JP; Aguiar, AP

Model Predictive Control for B-Spline Trajectory Tracking in Omnidirectional Robots (2024)
Article in International Conference Proceedings Book
Carvalho, JP; António Paulo Moreira; Aguiar, AP

Recommend this page Top

Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto I Terms and Conditions I Acessibility I Index A-Z
Page created on: 2025-07-28 at 11:52:15 | Privacy Policy | Personal Data Protection Policy | Whistleblowing