Você está em: Início > Publicações > Visualização > Regularized Covariance Estimation for Weighted Maximum Likelihood Policy Search Methods

Mapa das Instalações

Publicação

Pesquisa de Publicações

Publicações

Regularized Covariance Estimation for Weighted Maximum Likelihood Policy Search Methods

Título

Regularized Covariance Estimation for Weighted Maximum Likelihood Policy Search MethodsExportar publicação no formato APA Exportar publicação no formato EXCEL Exportar publicação no formato RIS

Tipo

Artigo em Livro de Atas de Conferência Internacional

Data

2015

Título

Regularized Covariance Estimation for Weighted Maximum Likelihood Policy Search Methods

Tipo

Artigo em Livro de Atas de Conferência Internacional

Ano

2015

Autores

Abdolmaleki, A

(Autor)

Outra

A pessoa não pertence à instituição. A pessoa não pertence à instituição. A pessoa não pertence à instituição. Sem AUTHENTICUS Sem ORCID

lau, n

(Autor)

FCUP

Ver página pessoal Sem permissões para visualizar e-mail institucional Pesquisar Publicações do Participante Ver página do Authenticus Sem ORCID

reis, lp

(Autor)

REIT

Ver página pessoal Enviar mensagem Pesquisar Publicações do Participante Ver página do Authenticus Ver página ORCID

Neumann, G

(Autor)

Outra

A pessoa não pertence à instituição. A pessoa não pertence à instituição. A pessoa não pertence à instituição. Sem AUTHENTICUS Sem ORCID

Ata de Conferência Internacional

Título: IEEE-RAS International Conference on Humanoid Robots Pesquisar Publicações da Ata de Conferência

Páginas: 154-159

15th IEEE RAS International Conference on Humanoid Robots, Humanoids 2015

3 November 2015 through 5 November 2015

Indexação

ISI Web of Knowledge - 3 Citações

Scopus - 12 Citações

Outras Informações

ID Authenticus: P-00K-AP3

DOI: 10.1109/HUMANOIDS.2015.7363529

Abstract (EN): Many episode-based (or direct) policy search algorithms, maintain a multivariate Gaussian distribution as search distribution over the parameter space of some objective function. One class of algorithms, such as episodic REPS, PoWER or PI2 uses, a weighted maximum likelihood estimate (WMLE) to update the mean and covariance matrix of this distribution in each iteration. However, due to high dimensionality of covariance matrices and limited number of samples, the WMLE is an unreliable estimator. The use of WMLE leads to overfitted covariance estimates, and, hence the variance/entropy of the search distribution decreases too quickly, which may cause premature convergence. In order to alleviate this problem, the estimated covariance matrix can be regularized in different ways, for example by using a convex combination of the diagonal covariance estimate and the sample covariance estimate. In this paper, we propose a new covariance matrix regularization technique for policy search methods that uses the convex combination of the sample covariance matrix and the old covariance matrix used in last iteration. The combination weighting is determined by specifying the desired entropy of the new search distribution. With this mechanism, the entropy of the search distribution can be gradually decreased without damage from the maximum likelihood estimate.

Idioma: Inglês

Tipo (Avaliação Docente): Científica

Nº de páginas: 6

Documentos

Não foi encontrado nenhum documento associado à publicação.

Publicações Relacionadas

Dos mesmos autores

Contextual Policy Search for Linear and Nonlinear Generalization of a Humanoid Walking Controller (2016)
Artigo em Revista Científica Internacional
Abdolmaleki, A; lau, n; reis, lp; Peters, J; Neumann, G

Stochastic Search In Changing Situations (2017)
Artigo em Livro de Atas de Conferência Internacional
Abdolmaleki, A; Simães, DA; lau, n; reis, lp; Price, B; Neumann, G

Non-Parametric Contextual Stochastic Search (2016)
Artigo em Livro de Atas de Conferência Internacional
Abdolmaleki, A; lau, n; reis, lp; Neumann, G

Model-Based Relative Entropy Stochastic Search (2016)
Artigo em Livro de Atas de Conferência Internacional
Abdolmaleki, A; Lioutikov, R; lau, n; reis, lp; Peters, J; Neumann, G

Learning a Humanoid Kick with Controlled Distance (2016)
Artigo em Livro de Atas de Conferência Internacional
Abdolmaleki, A; Simões, D; lau, n; reis, lp; Neumann, G

Ver todas (10)

Recomendar Página Voltar ao Topo

Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto I Termos e Condições I Acessibilidade I Índice A-Z
Página gerada em: 2025-09-25 às 20:17:56 | Política de Privacidade | Política de Proteção de Dados Pessoais | Denúncias | Livro Amarelo Eletrónico