City Research Online - Simple and Fast Calculation of the Second-Order Gradients for Globalized Dual Heuristic Dynamic Programming in Neural Networks

Simple and Fast Calculation of the Second-Order Gradients for Globalized Dual Heuristic Dynamic Programming in Neural Networks

Fairbank, M., Alonso, E. & Prokhorov, D. (2012). Simple and Fast Calculation of the Second-Order Gradients for Globalized Dual Heuristic Dynamic Programming in Neural Networks. IEEE Transactions on Neural Networks and Learning Systems, 23(10), pp. 1671-1676. doi: 10.1109/tnnls.2012.2205268

Abstract

We derive an algorithm to exactly calculate the mixed second-order derivatives of a neural network's output with respect to its input vector and weight vector. This is necessary for the adaptive dynamic programming (ADP) algorithms globalized dual heuristic programming (GDHP) and value-gradient learning. The algorithm calculates the inner product of this second-order matrix with a given fixed vector in a time that is linear in the number of weights in the neural network. We use a “forward accumulation” of the derivative calculations which produces a much more elegant and easy-to-implement solution than has previously been published for this task. In doing so, the algorithm makes GDHP simple to implement and efficient, bridging the gap between the widely used DHP and GDHP ADP methods.

Publication Type:	Article
Additional Information:	(c) 2012 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works.
Publisher Keywords:	Neural Networks, Adaptive Dynamic Programming, Dual Heuristic Programming
Subjects:	Q Science > QA Mathematics > QA75 Electronic computers. Computer science
Departments:	School of Science & Technology > Computer Science
Related URLs:	Author
SWORD Depositor:	Symplectic Administrator

[thumbnail of Simple_and_fast_2012IEEEsecondOrderGradients.pdf]

Preview

PDF - Accepted Version
Download (302kB) | Preview

Official URL: https://doi.org/10.1109/tnnls.2012.2205268

Export

Downloads

Downloads per month over past year

View more statistics

Metadata

Altmetric

CORE (COnnecting REpositories)

Actions (login required)

Admin Login

Creators:	Fairbank, M. Alonso, E. Prokhorov, D.
Status:	Published
Refereed:	Yes
Journal or Publication Title:	IEEE Transactions on Neural Networks and Learning Systems
Publisher:	Institute of Electrical and Electronics Engineers (IEEE)
ISSN:	2162-237X
e-ISSN:	2162-2388
URI:	https://openaccess.city.ac.uk/id/eprint/5187
Date available in CRO:	06 Jan 2015 16:05
Date deposited:	1 August 2017
Dates:	Date Event October 2012 Published