Você está em: Início > Publicações > Visualização > Mixed-Policy Asynchronous Deep Q-Learning

Publicação

Pesquisa de Publicações

Mixed-Policy Asynchronous Deep Q-Learning

Título

Mixed-Policy Asynchronous Deep Q-LearningExportar publicação no formato APA Exportar publicação no formato EXCEL Exportar publicação no formato RIS

Tipo

Artigo em Livro de Atas de Conferência Internacional

Data

2017

Título

Mixed-Policy Asynchronous Deep Q-Learning

Tipo

Artigo em Livro de Atas de Conferência Internacional

Ano

2017

Autores

Simões, D

(Autor)

Outra

A pessoa não pertence à instituição. A pessoa não pertence à instituição. A pessoa não pertence à instituição. Sem AUTHENTICUS Sem ORCID

lau, n

(Autor)

FCUP

Ver página pessoal Sem permissões para visualizar e-mail institucional Pesquisar Publicações do Participante Ver página do Authenticus Sem ORCID

reis, lp

(Autor)

REIT

Ver página pessoal Enviar mensagem Pesquisar Publicações do Participante Ver página do Authenticus Ver página ORCID

Ata de Conferência Internacional

Título: Advances in Intelligent Systems and Computing Pesquisar Publicações da Ata de Conferência

Páginas: 129-140

3rd Iberian Robotics Conference, ROBOT 2017

22 November 2017 through 24 November 2017

Indexação

Scopus - 8 Citações

Outras Informações

ID Authenticus: P-00N-N4Z

DOI: 10.1007/978-3-319-70836-2_11

Abstract (EN): There are many open issues and challenges in the reinforcement learning field, such as handling high-dimensional environments. Function approximators, such as deep neural networks, have been successfully used in both single- and multi-agent environments with high dimensional state-spaces. The multi-agent learning paradigm faces even more problems, due to the effect of several agents learning simultaneously in the environment. One of its main concerns is how to learn mixed policies that prevent opponents from exploring them in competitive environments, achieving a Nash equilibrium. We propose an extension of several algorithms able to achieve Nash equilibriums in single-state games to the deep-learning paradigm. We compare their deep-learning and table-based implementations, and demonstrate how WPL is able to achieve an equilibrium strategy in a complex environment, where agents must find each other in an infinite-state game and play a modified version of the Rock Paper Scissors game. © Springer International Publishing AG 2018.

Idioma: Inglês

Tipo (Avaliação Docente): Científica

Documentos

Não foi encontrado nenhum documento associado à publicação.

Publicações Relacionadas

Dos mesmos autores

Learning a Humanoid Kick with Controlled Distance (2016)
Artigo em Livro de Atas de Conferência Internacional
Abdolmaleki, A; Simões, D; lau, n; reis, lp; Neumann, G

Recomendar Página Voltar ao Topo

Copyright 1996-2025 © Centro de Desporto da Universidade do Porto I Termos e Condições I Acessibilidade I Índice A-Z
Página gerada em: 2025-10-18 às 18:35:20 | Política de Privacidade | Política de Proteção de Dados Pessoais | Denúncias | Livro Amarelo Eletrónico