Algorithms for Reinforcement Learning (Record no. 85289)

000 -LEADER
fixed length control field 03369nam a22005295i 4500
001 - CONTROL NUMBER
control field 978-3-031-01551-9
005 - DATE AND TIME OF LATEST TRANSACTION
control field 20240730164105.0
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION
fixed length control field 220601s2010 sz | s |||| 0|eng d
020 ## - INTERNATIONAL STANDARD BOOK NUMBER
ISBN 9783031015519
-- 978-3-031-01551-9
082 04 - CLASSIFICATION NUMBER
Call Number 006.3
100 1# - AUTHOR NAME
Author Szepesvári, Csaba.
245 10 - TITLE STATEMENT
Title Algorithms for Reinforcement Learning
250 ## - EDITION STATEMENT
Edition statement 1st ed. 2010.
300 ## - PHYSICAL DESCRIPTION
Number of Pages XIII, 89 p.
490 1# - SERIES STATEMENT
Series statement Synthesis Lectures on Artificial Intelligence and Machine Learning,
505 0# - FORMATTED CONTENTS NOTE
Remark 2 Markov Decision Processes -- Value Prediction Problems -- Control -- For Further Exploration.
520 ## - SUMMARY, ETC.
Summary, etc Reinforcement learning is a learning paradigm concerned with learning to control a system so as to maximize a numerical performance measure that expresses a long-term objective. What distinguishes reinforcement learning from supervised learning is that only partial feedback is given to the learner about the learner's predictions. Further, the predictions may have long term effects through influencing the future state of the controlled system. Thus, time plays a special role. The goal in reinforcement learning is to develop efficient learning algorithms, as well as to understand the algorithms' merits and limitations. Reinforcement learning is of great interest because of the large number of practical applications that it can be used to address, ranging from problems in artificial intelligence to operations research or control engineering. In this book, we focus on those algorithms of reinforcement learning that build on the powerful theory of dynamic programming. We give a fairly comprehensive catalog of learning problems, describe the core ideas, note a large number of state of the art algorithms, followed by the discussion of their theoretical properties and limitations. Table of Contents: Markov Decision Processes / Value Prediction Problems / Control / For Further Exploration.
856 40 - ELECTRONIC LOCATION AND ACCESS
Uniform Resource Identifier https://doi.org/10.1007/978-3-031-01551-9
942 ## - ADDED ENTRY ELEMENTS (KOHA)
Koha item type eBooks
264 #1 -
-- Cham :
-- Springer International Publishing :
-- Imprint: Springer,
-- 2010.
336 ## -
-- text
-- txt
-- rdacontent
337 ## -
-- computer
-- c
-- rdamedia
338 ## -
-- online resource
-- cr
-- rdacarrier
347 ## -
-- text file
-- PDF
-- rda
650 #0 - SUBJECT ADDED ENTRY--SUBJECT 1
-- Artificial intelligence.
650 #0 - SUBJECT ADDED ENTRY--SUBJECT 1
-- Machine learning.
650 #0 - SUBJECT ADDED ENTRY--SUBJECT 1
-- Neural networks (Computer science) .
650 14 - SUBJECT ADDED ENTRY--SUBJECT 1
-- Artificial Intelligence.
650 24 - SUBJECT ADDED ENTRY--SUBJECT 1
-- Machine Learning.
650 24 - SUBJECT ADDED ENTRY--SUBJECT 1
-- Mathematical Models of Cognitive Processes and Neural Networks.
830 #0 - SERIES ADDED ENTRY--UNIFORM TITLE
-- 1939-4616
912 ## -
-- ZDB-2-SXSC

No items available.