000 | 03030nam a2200505 i 4500 | ||
---|---|---|---|
001 | 6267343 | ||
003 | IEEE | ||
005 | 20220712204635.0 | ||
006 | m o d | ||
007 | cr |n||||||||| | ||
008 | 151223s1998 maua ob 001 eng d | ||
010 | _z 97026416 (print) | ||
020 |
_z9780262193986 _qprint |
||
020 |
_a9780262257053 _qelectronic |
||
020 |
_z0262193981 _qalk. paper |
||
035 | _a(CaBNVSL)mat06267343 | ||
035 | _a(IDAMS)0b000064818b431d | ||
040 |
_aCaBNVSL _beng _erda _cCaBNVSL _dCaBNVSL |
||
050 | 4 |
_aQ325.6 _b.S88 1998eb |
|
082 | 0 | 0 |
_a006.3/1 _221 |
100 | 1 |
_aSutton, Richard S., _eauthor. _922249 |
|
245 | 1 | 0 |
_aReinforcement learning : _ban introduction / _cRichard S. Sutton and Andrew G. Barto. |
264 | 1 |
_aCambridge, Massachusetts : _bMIT Press, _cc1998. |
|
264 | 2 |
_a[Piscataqay, New Jersey] : _bIEEE Xplore, _c[1998] |
|
300 |
_a1 PDF (xviii, 322 pages) : _billustrations. |
||
336 |
_atext _2rdacontent |
||
337 |
_aelectronic _2isbdmedia |
||
338 |
_aonline resource _2rdacarrier |
||
490 | 1 | _aAdaptive computation and machine learning series | |
504 | _aIncludes bibliographical references (p. [291]-312) and index. | ||
506 | 1 | _aRestricted to subscribers or individual electronic text purchasers. | |
520 | _aReinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives when interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability.The book is divided into three parts. Part I defines the reinforcement learning problem in terms of Markov decision processes. Part II provides basic solution methods: dynamic programming, Monte Carlo methods, and temporal-difference learning. Part III presents a unified view of the solution methods and incorporates artificial neural networks, eligibility traces, and planning; the two final chapters present case studies and consider the future of reinforcement learning. | ||
530 | _aAlso available in print. | ||
538 | _aMode of access: World Wide Web | ||
588 | _aDescription based on PDF viewed 12/23/2015. | ||
650 | 0 |
_aReinforcement learning. _99427 |
|
655 | 0 |
_aElectronic books. _93294 |
|
700 | 1 |
_aBarto, Andrew G. _922250 |
|
710 | 2 |
_aIEEE Xplore (Online Service), _edistributor. _922251 |
|
710 | 2 |
_aMIT Press, _epublisher. _922252 |
|
776 | 0 | 8 |
_iPrint version _z9780262193986 |
830 | 0 |
_aAdaptive computation and machine learning series _921885 |
|
856 | 4 | 2 |
_3Abstract with links to resource _uhttps://ieeexplore.ieee.org/xpl/bkabstractplus.jsp?bkn=6267343 |
942 | _cEBK | ||
999 |
_c72998 _d72998 |