Go to:
Logótipo
Comuta visibilidade da coluna esquerda
Você está em: Start > Publications > View > Regularized Covariance Estimation for Weighted Maximum Likelihood Policy Search Methods
Publication

Publications

Regularized Covariance Estimation for Weighted Maximum Likelihood Policy Search Methods

Title
Regularized Covariance Estimation for Weighted Maximum Likelihood Policy Search Methods
Type
Article in International Conference Proceedings Book
Year
2015
Authors
Abdolmaleki, A
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
lau, n
(Author)
FCUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page Without ORCID
Neumann, G
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Conference proceedings International
Pages: 154-159
15th IEEE RAS International Conference on Humanoid Robots, Humanoids 2015
3 November 2015 through 5 November 2015
Other information
Authenticus ID: P-00K-AP3
Abstract (EN): Many episode-based (or direct) policy search algorithms, maintain a multivariate Gaussian distribution as search distribution over the parameter space of some objective function. One class of algorithms, such as episodic REPS, PoWER or PI2 uses, a weighted maximum likelihood estimate (WMLE) to update the mean and covariance matrix of this distribution in each iteration. However, due to high dimensionality of covariance matrices and limited number of samples, the WMLE is an unreliable estimator. The use of WMLE leads to overfitted covariance estimates, and, hence the variance/entropy of the search distribution decreases too quickly, which may cause premature convergence. In order to alleviate this problem, the estimated covariance matrix can be regularized in different ways, for example by using a convex combination of the diagonal covariance estimate and the sample covariance estimate. In this paper, we propose a new covariance matrix regularization technique for policy search methods that uses the convex combination of the sample covariance matrix and the old covariance matrix used in last iteration. The combination weighting is determined by specifying the desired entropy of the new search distribution. With this mechanism, the entropy of the search distribution can be gradually decreased without damage from the maximum likelihood estimate.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 6
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same authors

Contextual Policy Search for Linear and Nonlinear Generalization of a Humanoid Walking Controller (2016)
Article in International Scientific Journal
Abdolmaleki, A; lau, n; reis, lp; Peters, J; Neumann, G
Stochastic Search In Changing Situations (2017)
Article in International Conference Proceedings Book
Abdolmaleki, A; Simães, DA; lau, n; reis, lp; Price, B; Neumann, G
Non-Parametric Contextual Stochastic Search (2016)
Article in International Conference Proceedings Book
Abdolmaleki, A; lau, n; reis, lp; Neumann, G
Model-Based Relative Entropy Stochastic Search (2016)
Article in International Conference Proceedings Book
Abdolmaleki, A; Lioutikov, R; lau, n; reis, lp; Peters, J; Neumann, G
Learning a Humanoid Kick with Controlled Distance (2016)
Article in International Conference Proceedings Book
Abdolmaleki, A; Simões, D; lau, n; reis, lp; Neumann, G

See all (10)

Recommend this page Top
Copyright 1996-2025 © Faculdade de Direito da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2025-08-05 at 19:33:08 | Privacy Policy | Personal Data Protection Policy | Whistleblowing