The Essential Dynamics Algorithm: Essential Results

DSpace Home
→
Harvested articles مقالات مستوردة من مؤسسات وجامعات عالمية
→
MIT Items
→
View Item

dc.creator	Martin, Martin C.
dc.date	2004-10-08T20:38:57Z
dc.date	2004-10-08T20:38:57Z
dc.date	2003-05-01
dc.date.accessioned	2013-10-09T02:46:33Z
dc.date.available	2013-10-09T02:46:33Z
dc.date.issued	2013-10-09
dc.identifier	AIM-2003-014
dc.identifier	http://hdl.handle.net/1721.1/6718
dc.identifier.uri	http://koha.mediu.edu.my:8181/xmlui/handle/1721
dc.description	This paper presents a novel algorithm for learning in a class of stochastic Markov decision processes (MDPs) with continuous state and action spaces that trades speed for accuracy. A transform of the stochastic MDP into a deterministic one is presented which captures the essence of the original dynamics, in a sense made precise. In this transformed MDP, the calculation of values is greatly simplified. The online algorithm estimates the model of the transformed MDP and simultaneously does policy search against it. Bounds on the error of this approximation are proven, and experimental results in a bicycle riding domain are presented. The algorithm learns near optimal policies in orders of magnitude fewer interactions with the stochastic MDP, using less domain knowledge. All code used in the experiments is available on the project's web site.
dc.format	12 p.
dc.format	1085830 bytes
dc.format	303781 bytes
dc.format	application/postscript
dc.format	application/pdf
dc.language	en_US
dc.relation	AIM-2003-014
dc.subject	AI
dc.subject	Reinforcement learning
dc.subject	bicycle
dc.subject	policy search
dc.subject	markov decision processes
dc.title	The Essential Dynamics Algorithm: Essential Results

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

MIT Items

Show simple item record

Search DSpace

Advanced Search

Browse

All of DSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects

The Essential Dynamics Algorithm: Essential Results

Files in this item

This item appears in the following Collection(s)

Search DSpace

Browse

All of DSpace

This Collection

My Account