ajuda

Você está em: Início > Publicações > Visualização > Guided Deep Reinforcement Learning in the GeoFriends2 Environment

Mapa das Instalações

Publicação

Pesquisa de Publicações

Guided Deep Reinforcement Learning in the GeoFriends2 Environment

Título

Guided Deep Reinforcement Learning in the GeoFriends2 EnvironmentExportar publicação no formato APA Exportar publicação no formato EXCEL Exportar publicação no formato RIS

Tipo

Artigo em Livro de Atas de Conferência Internacional

Data

2018

Título

Guided Deep Reinforcement Learning in the GeoFriends2 Environment

Tipo

Artigo em Livro de Atas de Conferência Internacional

Ano

2018

Autores

Simoes, D

(Autor)

Outra

A pessoa não pertence à instituição. A pessoa não pertence à instituição. A pessoa não pertence à instituição. Sem AUTHENTICUS Sem ORCID

lau, n

(Autor)

Outra

Ver página pessoal Sem permissões para visualizar e-mail institucional Pesquisar Publicações do Participante Ver página do Authenticus Sem ORCID

reis, lp

(Autor)

FEUP

Ver página pessoal Enviar mensagem Pesquisar Publicações do Participante Ver página do Authenticus Ver página ORCID

Ata de Conferência Internacional

Título: Proceedings of the International Joint Conference on Neural Networks Pesquisar Publicações da Ata de Conferência

2018 International Joint Conference on Neural Networks, IJCNN 2018

8 July 2018 through 13 July 2018

Indexação

Scopus - 4 Citações

Outras Informações

ID Authenticus: P-00P-W9C

DOI: 10.1109/ijcnn.2018.8489372

Abstract (EN): In recent years, the artificial intelligence community has taken big strides in the application of reinforcement learning to games or similar environments using deep learning. From Atari to board games, including motor control or riddle solving, fairly generic deep learning algorithms can now achieve great policies by simply learning to play from experience, and minimal knowledge of the specific domain. However, these algorithms are very demanding in terms of time and hardware in order to achieve the results reported in the literature. So much so, that some algorithms would take years to achieve state-of-the-art performance in commodity hardware. Not only that, but even the learning environments can hinder the speed of the learning process, if they have not been performance optimized. In this paper, we evaluate a complex existing environment, and propose a performance-oriented version, which we call GeoFriends2. We describe the motivation behind the creation of our version, and how it is suitable for both single- and multi-agent reinforcement learning. We then use Asynchronous Deep Learning to create complex policies that can act as baselines for future research on this environment. We also describe a set of techniques that speed up the learning process such that tests can be run with commodity hardware in hours, and not weeks, and using much simpler network architectures. © 2018 IEEE.

Idioma: Inglês

Tipo (Avaliação Docente): Científica

Documentos

Não foi encontrado nenhum documento associado à publicação.

Publicações Relacionadas

Dos mesmos autores

Multi-agent actor centralized-critic with communication (2020)
Artigo em Revista Científica Internacional
Simoes, D; lau, n; reis, lp

MULTI AGENT DEEP LEARNING WITH COOPERATIVE COMMUNICATION (2020)
Artigo em Revista Científica Internacional
Simoes, D; lau, n; reis, lp

Exploring communication protocols and centralized critics in multi-agent deep learning (2020)
Artigo em Revista Científica Internacional
Simoes, D; lau, n; reis, lp

Multi-agent Double Deep Q-Networks (2017)
Artigo em Livro de Atas de Conferência Internacional
Simoes, D; lau, n; reis, lp

Learning Low-Level Behaviors and High-Level Strategies in Humanoid Soccer (2020)
Artigo em Livro de Atas de Conferência Internacional
Simoes, D; Amaro, P; Maria Teresa Andrade; lau, n; reis, lp

Ver todas (6)

Recomendar Página Voltar ao Topo

Copyright 1996-2025 © Faculdade de Medicina Dentária da Universidade do Porto I Termos e Condições I Acessibilidade I Índice A-Z
Página gerada em: 2025-07-24 às 12:22:03 | Política de Privacidade | Política de Proteção de Dados Pessoais | Denúncias | Livro Amarelo Eletrónico