가야대학교 분성도서관

상단 글로벌/추가 메뉴

회원 로그인

주메뉴

전체메뉴보기

전체메뉴바탕이미지TL

전체메뉴바탕이미지TR

: • 전체검색; • 단행본; • 정기간행물; • 학위논문; • 비도서; • 특정번호 검색; • 기사색인; • 학습연구지원; • 지정도서

: • 전자자원통합검색; • 해외 WEB DB; • E-Book; • E-Learning; • 국내 WEB DB; • 인터넷검색

: • 대출현황조회/연장; • 예약현황조회/취소; • 희망도서; • 내서재; • 나의서평; • 나의태그; • 나의위젯; • RSS; • SDI; • 개인정보관리

: • 공지사항; • FAQ; • Q&A 일반·학술; • 신착/인기 도서; • 학위논문온라인제출; • 외부인 이용안내; • 설문조사(학생용); • 설문조사(교수용)

: • 도서관 소개; • 연혁; • 현황; • 이용안내; • 찾아오시는 길

전체메뉴바탕이미지BL

전체메뉴바탕이미지BR

자료검색

자료검색

Home
상세정보

상세정보

부가기능

Python Reinforcement Learning : Solve Complex Real-World Problems by Mastering Reinforcement Learning Algorithms Using OpenAI Gym and TensorFlow /

상세 프로파일

상세정보
자료유형	E-Book
개인저자	Ravichandiran, Sudharsan, author. Saito, Sean, author. Shanmugamani, Rajalingappaa, author. Wenzhuo, Yang, author.
서명/저자사항	Python Reinforcement Learning :Solve Complex Real-World Problems by Mastering Reinforcement Learning Algorithms Using OpenAI Gym and TensorFlow /Sudharsan Ravichandiran, Sean Saito, Rajalingappaa Shanmugamani and Yang Wenzhuo.
발행사항	Birmingham : Packt Publishing, Limited, 2019.
형태사항	1 online resource (484 pages)
소장본 주기	Master record variable field(s) change: 050, 082, 650
ISBN	1838640142 9781838640149
일반주기	Implementation of the Atari emulator
내용주기	Cover; Title Page; Copyright and Credits; About Packt; Contributors; Table of Contents; Preface; Chapter 1: Introduction to Reinforcement Learning; What is RL?; RL algorithm; How RL differs from other ML paradigms; Elements of RL; Agent; Policy function; Value function; Model; Agent environment interface; Types of RL environment; Deterministic environment; Stochastic environment; Fully observable environment; Partially observable environment; Discrete environment; Continuous environment; Episodic and non-episodic environment; Single and multi-agent environment; RL platforms OpenAI Gym and UniverseDeepMind Lab; RL-Glue; Project Malmo; ViZDoom; Applications of RL; Education; Medicine and healthcare; Manufacturing; Inventory management; Finance; Natural Language Processing and Computer Vision; Summary; Questions; Further reading; Chapter 2: Getting Started with OpenAI and TensorFlow; Setting up your machine; Installing Anaconda; Installing Docker; Installing OpenAI Gym and Universe; Common error fixes; OpenAI Gym; Basic simulations; Training a robot to walk; OpenAI Universe; Building a video game bot; TensorFlow; Variables, constants, and placeholders; Variables ConstantsPlaceholders; Computation graph; Sessions; TensorBoard; Adding scope; Summary; Questions; Further reading; Chapter 3: The Markov Decision Process and Dynamic Programming; The Markov chain and Markov process; Markov Decision Process; Rewards and returns; Episodic and continuous tasks; Discount factor; The policy function; State value function; State-action value function (Q function); The Bellman equation and optimality; Deriving the Bellman equation for value and Q functions; Solving the Bellman equation; Dynamic programming; Value iteration; Policy iteration Solving the frozen lake problemValue iteration; Policy iteration; Summary; Questions; Further reading; Chapter 4: Gaming with Monte Carlo Methods; Monte Carlo methods; Estimating the value of pi using Monte Carlo; Monte Carlo prediction; First visit Monte Carlo; Every visit Monte Carlo; Let's play Blackjack with Monte Carlo; Monte Carlo control; Monte Carlo exploration starts; On-policy Monte Carlo control; Off-policy Monte Carlo control; Summary; Questions; Further reading; Chapter 5: Temporal Difference Learning; TD learning; TD prediction; TD control; Q learning Solving the taxi problem using Q learningSARSA; Solving the taxi problem using SARSA; The difference between Q learning and SARSA; Summary; Questions; Further reading; Chapter 6: Multi-Armed Bandit Problem; The MAB problem; The epsilon-greedy policy; The softmax exploration algorithm; The upper confidence bound algorithm; The Thompson sampling algorithm; Applications of MAB; Identifying the right advertisement banner using MAB; Contextual bandits; Summary; Questions; Further reading; Chapter 7: Playing Atari Games; Introduction to Atari games; Building an Atari emulator; Getting started
요약	Reinforcement learning and deep reinforcement learning are the trending and most promising branches of artificial intelligence. This Learning Path will enable you to master not only the basic reinforcement learning algorithms but also the advanced deep reinforcement learning algorithms and their limitations.
일반주제명	Python (Computer program language) Reinforcement learning.
언어	영어
기타형태 저록	Print version:Ravichandiran, Sudharsan.Python Reinforcement Learning : Solve Complex Real-World Problems by Mastering Reinforcement Learning Algorithms Using OpenAI Gym and TensorFlow.Birmingham : Packt Publishing, Limited, 짤20199781838649777
대출바로가기	http://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&db=nlabk&AN=2108228

소장정보

소장정보

인쇄

메세지가 없습니다
No.	등록번호	청구기호	소장처	도서상태	반납예정일	예약	서비스	매체정보
1	WE00017094	006.31	가야대학교/전자책서버(컴퓨터서버)/	대출가능

서평

서평

태그

태그

태그추가 (로그인 필요)

나의 태그

나의 태그 (0)

모든 이용자 태그

모든 이용자 태그 (0)

대출현황/연장

예약현황조회/취소

자료구입신청

상호대차

FAQ

교외접속

사서에게 물어보세요

메뉴추가

quickBottom

카피라이터

하단 로고

김해캠퍼스 | 621-748 | 경남 김해시 삼계로 208 | TEL:055-330-1033 | FAX:055-330-1032
Copyright 2012 by kaya university Bunsung library All rights reserved.