Kernelizing LSPE λ

Jung, T. and Polani, D. (2007) Kernelizing LSPE λ. In: Procs of the 2007 Symposium on Approximate Dynamic Programming & Reinforcement Learning (ADPRL 2007) :. Institute of Electrical and Electronics Engineers (IEEE), pp. 338-345. ISBN 1-4244-0706-0

Copy

We propose the use of kernel-based methods as underlying function approximator in the least-squares based policy evaluation framework of LSPE(λ) and LSTD(λ). In particular we present the ‘kernelization’ of model-free LSPE(λ). The ‘kernelization’ is computationally made possible by using the subset of regressors approximation, which approximates the kernel using a vastly reduced number of basis functions. The core of our proposed solution is an efficient recursive implementation with automatic supervised selection of the relevant basis functions. The LSPE method is well-suited for optimistic policy iteration and can thus be used in the context of online reinforcement learning. We use the high-dimensional Octopus benchmark to demonstrate this.

Item Type	Book Section
Date Deposited	15 May 2025 16:22
Last Modified	30 May 2025 23:10

Explore Further

picture_as_pdf: 902107.pdf
subject: Published Version

View

Download

Atom

BibTeX

OpenURL ContextObject in Span

OpenURL ContextObject

Dublin Core

MPEG-21 DIDL

EndNote

HTML Citation

METS

MODS

RIOXX2 XML

Reference Manager

Refer

ASCII Citation

Export

Downloads