help

Você está em: Start > Publications > View > Guided Deep Reinforcement Learning in the GeoFriends2 Environment

Map of Premises

Publication

Publication Search

Guided Deep Reinforcement Learning in the GeoFriends2 Environment

Title

Guided Deep Reinforcement Learning in the GeoFriends2 EnvironmentExport publication in the APA format Export publication in the EXCEL format Export publication in the RIS format

Type

Article in International Conference Proceedings Book

Date

2018

Title

Guided Deep Reinforcement Learning in the GeoFriends2 Environment

Type

Article in International Conference Proceedings Book

Year

2018

Authors

Simoes, D

(Author)

Other

The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID

lau, n

(Author)

Other

View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page Without ORCID

reis, lp

(Author)

FEUP

View Personal Page Send message Search for Participant Publications View Authenticus page View ORCID page

Conference proceedings International

Title: Proceedings of the International Joint Conference on Neural Networks Search for Conference Proceedings Publications

2018 International Joint Conference on Neural Networks, IJCNN 2018

8 July 2018 through 13 July 2018

Indexing

Scopus - 4 Citations

Other information

Authenticus ID: P-00P-W9C

DOI: 10.1109/ijcnn.2018.8489372

Abstract (EN): In recent years, the artificial intelligence community has taken big strides in the application of reinforcement learning to games or similar environments using deep learning. From Atari to board games, including motor control or riddle solving, fairly generic deep learning algorithms can now achieve great policies by simply learning to play from experience, and minimal knowledge of the specific domain. However, these algorithms are very demanding in terms of time and hardware in order to achieve the results reported in the literature. So much so, that some algorithms would take years to achieve state-of-the-art performance in commodity hardware. Not only that, but even the learning environments can hinder the speed of the learning process, if they have not been performance optimized. In this paper, we evaluate a complex existing environment, and propose a performance-oriented version, which we call GeoFriends2. We describe the motivation behind the creation of our version, and how it is suitable for both single- and multi-agent reinforcement learning. We then use Asynchronous Deep Learning to create complex policies that can act as baselines for future research on this environment. We also describe a set of techniques that speed up the learning process such that tests can be run with commodity hardware in hours, and not weeks, and using much simpler network architectures. © 2018 IEEE.

Language: English

Type (Professor's evaluation): Scientific

Documents

We could not find any documents associated to the publication.

Related Publications

Of the same authors

Multi-agent actor centralized-critic with communication (2020)
Article in International Scientific Journal
Simoes, D; lau, n; reis, lp

MULTI AGENT DEEP LEARNING WITH COOPERATIVE COMMUNICATION (2020)
Article in International Scientific Journal
Simoes, D; lau, n; reis, lp

Exploring communication protocols and centralized critics in multi-agent deep learning (2020)
Article in International Scientific Journal
Simoes, D; lau, n; reis, lp

Multi-agent Double Deep Q-Networks (2017)
Article in International Conference Proceedings Book
Simoes, D; lau, n; reis, lp

Learning Low-Level Behaviors and High-Level Strategies in Humanoid Soccer (2020)
Article in International Conference Proceedings Book
Simoes, D; Amaro, P; Maria Teresa Andrade; lau, n; reis, lp

See all (6)

Recommend this page Top

Copyright 1996-2025 © Faculdade de Medicina Dentária da Universidade do Porto I Terms and Conditions I Acessibility I Index A-Z
Page created on: 2025-07-16 at 02:33:02 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book