Go to:
Logótipo
Você está em: Start > Publications > View > Robust biped locomotion using deep reinforcement learning on top of an analytical control approach
Publication

Robust biped locomotion using deep reinforcement learning on top of an analytical control approach

Title
Robust biped locomotion using deep reinforcement learning on top of an analytical control approach
Type
Article in International Scientific Journal
Year
2021
Authors
Kasaei, M
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Abreu, M
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. View Authenticus page Without ORCID
lau, n
(Author)
Other
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page Without ORCID
Pereira, A
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Journal
Vol. 146
ISSN: 0921-8890
Publisher: Elsevier
Indexing
Publicação em ISI Web of Knowledge ISI Web of Knowledge - 0 Citations
Other information
Authenticus ID: P-00V-KPY
Abstract (EN): This paper proposes a modular framework to generate robust biped locomotion using a tight coupling between an analytical walking approach and deep reinforcement learning. This framework is composed of six main modules which are hierarchically connected to reduce the overall complexity and increase its flexibility. The core of this framework is a specific dynamics model which abstracts a humanoid's dynamics model into two masses for modeling upper and lower body. This dynamics model is used to design an adaptive reference trajectories planner and an optimal controller which are fully parametric. Furthermore, a learning framework is developed based on Genetic Algorithm (GA) and Proximal Policy Optimization (PPO) to find the optimum parameters and to learn how to improve the stability of the robot by moving the arms and changing its center of mass height. A set of simulations are performed to validate the performance of the framework using the official RoboCup 3D League simulation environment. The results validate the performance of the framework, not only in creating a fast and stable gait but also in learning to improve the upper body efficiency.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 13
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same authors

Learning hybrid locomotion skills-Learn to exploit residual actions and modulate model-based gait control (2023)
Article in International Scientific Journal
Kasaei, M; Abreu, M; lau, n; Pereira, A; reis, lp; Li, ZB

Of the same journal

Visual motion perception for mobile robots through dense optical flow fields (2017)
Article in International Scientific Journal
Pinto, AM; Paulo Gomes da Costa; Correia, M. V.; Aníbal Castilho Coimbra de Matos; António Paulo Moreira
Urban@CRAS dataset: Benchmarking of visual odometry and SLAM techniques (2018)
Article in International Scientific Journal
Ana Rita Gaspar; Alexandra Nunes; Andry Maykol Pinto; Aníbal Matos
TEFu-Net: A time-aware late fusion architecture for robust multi-modal ego-motion estimation (2024)
Article in International Scientific Journal
Agostinho, L; Pereira, D; Hiolle, A; Pinto, A
Robust 3/6 DoF self-localization system with selective map update for mobile robot platforms (2016)
Article in International Scientific Journal
Costa, CM; Sobreira, HM; Armando Jorge Sousa; Germano Veiga
Particle filter refinement based on clustering procedures for high-dimensional localization and mapping systems (2021)
Article in International Scientific Journal
André Silva Aguiar; Filipe Neves Santos; Héber Sobreira; José Boaventura Cunha; Armando Jorge Sousa

See all (15)

Recommend this page Top
Copyright 1996-2024 © Faculdade de Arquitectura da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z  I Guest Book
Page created on: 2024-11-08 at 19:54:49 | Acceptable Use Policy | Data Protection Policy | Complaint Portal