City Research Online - Value-Gradient Learning

Value-Gradient Learning

Fairbank, M. & Alonso, E. (2012). Value-Gradient Learning. Paper presented at the WCCI 2012 IEEE World Congress on Computational Intelligence, 10-06-2012 - 15-06-2012, Brisbane, Australia. doi: 10.1109/IJCNN.2012.6252791

Abstract

We describe an Adaptive Dynamic Programming algorithm VGL(λ) for learning a critic function over a large continuous state space. The algorithm, which requires a learned model of the environment, extends Dual Heuristic Dynamic Programming to include a bootstrapping parameter analogous to that used in the reinforcement learning algorithm TD(λ). We provide on-line and batch mode implementations of the algorithm, and summarise the theoretical relationships and motivations of using this method over its precursor algorithms Dual Heuristic Dynamic Programming and TD(λ). Experiments for control problems using a neural network and greedy policy are provided.

Publication Type:	Conference or Workshop Item (Paper)
Additional Information:	© 2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
Publisher Keywords:	Value-Gradient Learning, Dual Heuristic Dynamic Programming, DHP, Adaptive Dynamic Programming
Subjects:	Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Departments:	School of Science & Technology > Computer Science

Preview

Text - Accepted Version
Download (334kB) | Preview

Export

Downloads

Downloads per month over past year

View more statistics

Metadata

Altmetric

CORE (COnnecting REpositories)

Actions (login required)

Admin Login

Creators:	Fairbank, M. Alonso, E.
Event Title:	WCCI 2012 IEEE World Congress on Computational Intelligence
Event Type:	Conference
Event Location:	Brisbane, Australia
Event Dates:	10-06-2012 - 15-06-2012
Status:	Published
Refereed:	Yes
Publisher:	IEEE Press
URI:	https://openaccess.city.ac.uk/id/eprint/5205
Date available in CRO:	14 Jul 2015 14:34
Date deposited:	25 July 2017
Dates:	Date Event 2012 Published