Go to:
Logótipo
Você está em: Start > Publications > View > Contextual Direct Policy Search
Map of Premises
Principal
Publication

Contextual Direct Policy Search

Title
Contextual Direct Policy Search
Type
Article in International Scientific Journal
Year
2019
Authors
Abbas Abdolmaleki
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
David Simoes
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Nuno Lau
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Gerhard Neumann
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Journal
Vol. 96 No. 2
Pages: 141-157
ISSN: 0921-0296
Publisher: Springer Nature
Indexing
Publicação em ISI Web of Knowledge ISI Web of Knowledge - 0 Citations
Publicação em Scopus Scopus - 0 Citations
Other information
Authenticus ID: P-00R-ZZ5
Abstract (EN): Stochastic search and optimization techniques are used in a vast number of areas, ranging from refining the design of vehicles, determining the effectiveness of new drugs, developing efficient strategies in games, or learning proper behaviors in robotics. However, they specialize for the specific problem they are solving, and if the problem's context slightly changes, they cannot adapt properly. In fact, they require complete re-leaning in order to perform correctly in new unseen scenarios, regardless of how similar they are to previous learned environments. Contextual algorithms have recently emerged as solutions to this problem. They learn the policy for a task that depends on a given context, such that widely different contexts belonging to the same task are learned simultaneously. That being said, the state-of-the-art proposals of this class of algorithms prematurely converge, and simply cannot compete with algorithms that learn a policy for a single context. We describe the Contextual Relative Entropy Policy Search (CREPS) algorithm, which belongs to the before-mentioned class of contextual algorithms. We extend it with a technique that allows the algorithm to severely increase its performance, and we call it Contextual Relative Entropy Policy Search with Covariance Matrix Adaptation (CREPS-CMA). We propose two variants, and demonstrate their behavior in a set of classic contextual optimization problems, and on complex simulator robot tasks.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 17
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same journal

Special Issue on Autonomous Robot Systems (2015)
Another Publication in an International Scientific Journal
reis, lp; calado, jmf; rocha, rp
Autonomous Robot Systems (2016)
Another Publication in an International Scientific Journal
Luis Almeida; Marques, L
Active Perception Fruit Harvesting Robots - A Systematic Review (2022)
Another Publication in an International Scientific Journal
Magalhaes, SA; António Paulo Moreira; Filipe Neves Santos; Dias, J
6D Localization and Kicking for Humanoid Robotic Soccer (2021)
Article in International Scientific Journal
Miguel Abreu; Tiago Silva; Henrique Teixeira; Luís Paulo Reis; Nuno Lau
Using Pre-Computed Knowledge for Goal Allocation in Multi-Agent Planning (2020)
Article in International Scientific Journal
António Paulo Moreira

See all (25)

Recommend this page Top
Copyright 1996-2025 © Faculdade de Medicina Dentária da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-07-15 at 21:19:02 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book