The Essential Dynamics Algorithm: Essential Results

Please use this identifier to cite or link to this item: http://dspace.mediu.edu.my:8181/xmlui/handle/1721.1/6718

Full metadata record

DC Field	Value	Language
dc.creator	Martin, Martin C.	-
dc.date	2004-10-08T20:38:57Z	-
dc.date	2004-10-08T20:38:57Z	-
dc.date	2003-05-01	-
dc.date.accessioned	2013-10-09T02:46:33Z	-
dc.date.available	2013-10-09T02:46:33Z	-
dc.date.issued	2013-10-09	-
dc.identifier	AIM-2003-014	-
dc.identifier	http://hdl.handle.net/1721.1/6718	-
dc.identifier.uri	http://koha.mediu.edu.my:8181/xmlui/handle/1721	-
dc.description	This paper presents a novel algorithm for learning in a class of stochastic Markov decision processes (MDPs) with continuous state and action spaces that trades speed for accuracy. A transform of the stochastic MDP into a deterministic one is presented which captures the essence of the original dynamics, in a sense made precise. In this transformed MDP, the calculation of values is greatly simplified. The online algorithm estimates the model of the transformed MDP and simultaneously does policy search against it. Bounds on the error of this approximation are proven, and experimental results in a bicycle riding domain are presented. The algorithm learns near optimal policies in orders of magnitude fewer interactions with the stochastic MDP, using less domain knowledge. All code used in the experiments is available on the project's web site.	-
dc.format	12 p.	-
dc.format	1085830 bytes	-
dc.format	303781 bytes	-
dc.format	application/postscript	-
dc.format	application/pdf	-
dc.language	en_US	-
dc.relation	AIM-2003-014	-
dc.subject	AI	-
dc.subject	Reinforcement learning	-
dc.subject	bicycle	-
dc.subject	policy search	-
dc.subject	markov decision processes	-
dc.title	The Essential Dynamics Algorithm: Essential Results	-
Appears in Collections:	MIT Items

Files in This Item:

There are no files associated with this item.

Show simple item record

DSpace JSPUI

DSpace preserves and enables easy and open access to all types of digital content including text, images, moving images, mpegs and data sets