MARC 가야대학교 분성도서관

MARC보기
LDR		03293cmm u2200577Ii 4500
001		000000312438
003		OCoLC
005		20230525152248
006		m d
007		cr unu\|\|\|\|\|\|\|\|
008		180731s2018 enka ob 001 0 eng d
019		▼a 1042318736
020		▼a 9781788839303
020		▼a 1788839307
020		▼z 9781788834247
035		▼a 1837369 ▼b (N$T)
035		▼a (OCoLC)1046682461 ▼z (OCoLC)1042318736
037		▼a CL0500000982 ▼b Safari Books Online
040		▼a UMI ▼b eng ▼e rda ▼e pn ▼c UMI ▼d STF ▼d TOH ▼d OCLCF ▼d EBLCP ▼d N$T ▼d MERUC ▼d ZCU ▼d NLE ▼d TEFOD ▼d CEF ▼d 248032
049		▼a MAIN
050	4	▼a Q325.5
072	7	▼a COM ▼x 000000 ▼2 bisacsh
082	04	▼a 006.31 ▼2 23
100	1	▼a Lapan, Maxim, ▼e author.
245	10	▼a Deep reinforcement learning hands-on : ▼b apply modern RL methods, with deep Q-networks, value iteration, policy gradients, TRPO, AlphaGo Zero and more / ▼c Maxim Lapan.
260		▼a Birmingham, UK : ▼b Packt Publishing, ▼c 2018.
300		▼a 1 online resource (1 volume) : ▼b illustrations
336		▼a text ▼b txt ▼2 rdacontent
337		▼a computer ▼b c ▼2 rdamedia
338		▼a online resource ▼b cr ▼2 rdacarrier
500		▼a "Expert insight."
504		▼a Includes bibliographical references and index.
505	0	▼a Table of ContentsWhat is Reinforcement Learning?OpenAI GymDeep Learning with PyTorchThe Cross-Entropy MethodTabular Learning and the Bellman EquationDeep Q-NetworksDQN ExtensionsStocks Trading Using RLPolicy Gradients - An AlternativeThe Actor-Critic MethodAsynchronous Advantage Actor-CriticChatbots Training with RL Web NavigationContinuous Action SpaceTrust Regions - TRPO, PPO, and ACKTRBlack-Box Optimization in RLBeyond Model-Free - ImaginationAlphaGo Zero.
520		▼a This book is a practical, developer-oriented introduction to deep reinforcement learning (RL). Explore the theoretical concepts of RL, before discovering how deep learning (DL) methods and tools are making it possible to solve more complex and challenging problems than ever before. Apply deep RL methods to training your agent to beat arcade ...
588		▼a Description based on online resource; title from cover (Safari, viewed July 30, 2018).
590		▼a Master record variable field(s) change: 072, 082 - OCLC control number change
650	0	▼a Reinforcement learning.
650	0	▼a Machine learning.
650	0	▼a Natural language processing (Computer science)
650	0	▼a Artificial intelligence.
650	7	▼a Artificial intelligence. ▼2 fast ▼0 (OCoLC)fst00817247
650	7	▼a Machine learning. ▼2 fast ▼0 (OCoLC)fst01004795
650	7	▼a Natural language processing (Computer science) ▼2 fast ▼0 (OCoLC)fst01034365
650	7	▼a Reinforcement learning. ▼2 fast ▼0 (OCoLC)fst01732553
650	7	▼a COMPUTERS / General. ▼2 bisacsh
655	4	▼a Electronic books.
776	08	▼i Print version: ▼a Lapan, Maxim ▼t Deep Reinforcement Learning Hands-On : Apply Modern RL Methods, with Deep Q-Networks, Value Iteration, Policy Gradients, TRPO, AlphaGo Zero and More ▼d Birmingham : Packt Publishing Ltd,c2018 ▼z 9781788834247
856	40	▼3 EBSCOhost ▼u http://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&db=nlabk&AN=1837369
938		▼a EBL - Ebook Library ▼b EBLB ▼n EBL5434975
938		▼a EBSCOhost ▼b EBSC ▼n 1837369
990		▼a 관리자
994		▼a 92 ▼b N$T