MARC View

000			03030nam a2200505 i 4500
001			6267343
003			IEEE
005			20220712204635.0
006			m o d
007			cr \|n\|\|\|\|\|\|\|\|\|
008			151223s1998 maua ob 001 eng d
010			_z 97026416 (print)
020			_z9780262193986 _qprint
020			_a9780262257053 _qelectronic
020			_z0262193981 _qalk. paper
035			_a(CaBNVSL)mat06267343
035			_a(IDAMS)0b000064818b431d
040			_aCaBNVSL _beng _erda _cCaBNVSL _dCaBNVSL
050		4	_aQ325.6 _b.S88 1998eb
082	0	0	_a006.3/1 _221
100	1		_aSutton, Richard S., _eauthor. _922249
245	1	0	_aReinforcement learning : _ban introduction / _cRichard S. Sutton and Andrew G. Barto.
264		1	_aCambridge, Massachusetts : _bMIT Press, _cc1998.
264		2	_a[Piscataqay, New Jersey] : _bIEEE Xplore, _c[1998]
300			_a1 PDF (xviii, 322 pages) : _billustrations.
336			_atext _2rdacontent
337			_aelectronic _2isbdmedia
338			_aonline resource _2rdacarrier
490	1		_aAdaptive computation and machine learning series
504			_aIncludes bibliographical references (p. [291]-312) and index.
506	1		_aRestricted to subscribers or individual electronic text purchasers.
520			_aReinforcement learning, one of the most active research areas in artificial intelligence, is a computational approach to learning whereby an agent tries to maximize the total amount of reward it receives when interacting with a complex, uncertain environment. In Reinforcement Learning, Richard Sutton and Andrew Barto provide a clear and simple account of the key ideas and algorithms of reinforcement learning. Their discussion ranges from the history of the field's intellectual foundations to the most recent developments and applications. The only necessary mathematical background is familiarity with elementary concepts of probability.The book is divided into three parts. Part I defines the reinforcement learning problem in terms of Markov decision processes. Part II provides basic solution methods: dynamic programming, Monte Carlo methods, and temporal-difference learning. Part III presents a unified view of the solution methods and incorporates artificial neural networks, eligibility traces, and planning; the two final chapters present case studies and consider the future of reinforcement learning.
530			_aAlso available in print.
538			_aMode of access: World Wide Web
588			_aDescription based on PDF viewed 12/23/2015.
650		0	_aReinforcement learning. _99427
655		0	_aElectronic books. _93294
700	1		_aBarto, Andrew G. _922250
710	2		_aIEEE Xplore (Online Service), _edistributor. _922251
710	2		_aMIT Press, _epublisher. _922252
776	0	8	_iPrint version _z9780262193986
830		0	_aAdaptive computation and machine learning series _921885
856	4	2	_3Abstract with links to resource _uhttps://ieeexplore.ieee.org/xpl/bkabstractplus.jsp?bkn=6267343
942			_cEBK
999			_c72998 _d72998