DSpace Repository

The Essential Dynamics Algorithm: Essential Results

Show simple item record

dc.creator Martin, Martin C.
dc.date 2004-10-08T20:38:57Z
dc.date 2004-10-08T20:38:57Z
dc.date 2003-05-01
dc.date.accessioned 2013-10-09T02:46:33Z
dc.date.available 2013-10-09T02:46:33Z
dc.date.issued 2013-10-09
dc.identifier AIM-2003-014
dc.identifier http://hdl.handle.net/1721.1/6718
dc.identifier.uri http://koha.mediu.edu.my:8181/xmlui/handle/1721
dc.description This paper presents a novel algorithm for learning in a class of stochastic Markov decision processes (MDPs) with continuous state and action spaces that trades speed for accuracy. A transform of the stochastic MDP into a deterministic one is presented which captures the essence of the original dynamics, in a sense made precise. In this transformed MDP, the calculation of values is greatly simplified. The online algorithm estimates the model of the transformed MDP and simultaneously does policy search against it. Bounds on the error of this approximation are proven, and experimental results in a bicycle riding domain are presented. The algorithm learns near optimal policies in orders of magnitude fewer interactions with the stochastic MDP, using less domain knowledge. All code used in the experiments is available on the project's web site.
dc.format 12 p.
dc.format 1085830 bytes
dc.format 303781 bytes
dc.format application/postscript
dc.format application/pdf
dc.language en_US
dc.relation AIM-2003-014
dc.subject AI
dc.subject Reinforcement learning
dc.subject bicycle
dc.subject policy search
dc.subject markov decision processes
dc.title The Essential Dynamics Algorithm: Essential Results


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account