Go to:
Logótipo
Comuta visibilidade da coluna esquerda
Você está em: Start > Publications > View > Convergence Analysis of Reinforcement Learning Algorithms Using Generalized Weak Contraction Mappings
Publication

Publications

Convergence Analysis of Reinforcement Learning Algorithms Using Generalized Weak Contraction Mappings

Title
Convergence Analysis of Reinforcement Learning Algorithms Using Generalized Weak Contraction Mappings
Type
Article in International Scientific Journal
Year
2025
Authors
Belhenniche, A
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Chertovskih, R
(Author)
FEUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page
Gonçalves, R
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Journal
Title: SymmetryImported from Authenticus Search for Journal Publications
Final page: 750
ISSN: 2073-8994
Publisher: MDPI
Other information
Authenticus ID: P-018-V6R
Abstract (EN): <jats:p>We investigate the convergence properties of policy iteration and value iteration algorithms in reinforcement learning by leveraging fixed-point theory, with a focus on mappings that exhibit weak contractive behavior. Unlike traditional studies that rely on strong contraction properties, such as those defined by the Banach contraction principle, we consider a more general class of mappings that includes weak contractions. Employing Zamfirscu¿s fixed-point theorem, we establish sufficient conditions for norm convergence in infinite-dimensional policy spaces under broad assumptions. Our approach extends the applicability of these algorithms to feedback control problems in reinforcement learning, where standard contraction conditions may not hold. Through illustrative examples, we demonstrate that this framework encompasses a wider range of operators, offering new insights into the robustness and flexibility of iterative methods in dynamic programming.</jats:p>
Language: English
Type (Professor's evaluation): Scientific
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same journal

Unveiling the Dynamics of the Universe (2016)
Another Publication in an International Scientific Journal
Avelino, PP; Barreiro, T; Sofia Carvalho, CS; da Silva, A; Lobo, FSN; Martin Moruno, P; Mimoso, JP; Nunes, NJ; Rubiera Garcia, D; Saez Gomez, D; Sousa, L; Tereno, I; Trindade, A
Symmetry in Complex Systems (2020)
Another Publication in an International Scientific Journal
António Mendes Lopes; Tenreiro Machado, JAT
Sixty Years of the Maximum Principle in Optimal Control: Historical Roots and Content Classification (2024)
Another Publication in an International Scientific Journal
Chertovskih, R; Ribeiro, VM; Gonçalves, R; Aguiar, AP
Occurrence of Chiral Bioactive Compounds in the Aquatic Environment: A Review (2017)
Another Publication in an International Scientific Journal
Cláudia Ribeiro; Ana Rita Ribeiro; Alexandra S. Maia; Maria Elizabeth Tiritan
Chiral Stationary Phases for Liquid Chromatography Based on Chitin-and Chitosan-Derived Marine Polysaccharides (2017)
Another Publication in an International Scientific Journal
Ribeiro, J; M E Tiritan; Pinto, MMM; Carla Fernandes

See all (23)

Recommend this page Top
Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-08-07 at 14:46:06 | Privacy Policy | Personal Data Protection Policy | Whistleblowing