가야대학교 분성도서관

상단 글로벌/추가 메뉴

회원 로그인

주메뉴

전체메뉴보기

전체메뉴바탕이미지TL

전체메뉴바탕이미지TR

: • 전체검색; • 단행본; • 정기간행물; • 학위논문; • 비도서; • 특정번호 검색; • 기사색인; • 학습연구지원; • 지정도서

: • 전자자원통합검색; • 해외 WEB DB; • E-Book; • E-Learning; • 국내 WEB DB; • 인터넷검색

: • 대출현황조회/연장; • 예약현황조회/취소; • 희망도서; • 내서재; • 나의서평; • 나의태그; • 나의위젯; • RSS; • SDI; • 개인정보관리

: • 공지사항; • FAQ; • Q&A 일반·학술; • 신착/인기 도서; • 학위논문온라인제출; • 외부인 이용안내; • 설문조사

: • 도서관 소개; • 연혁; • 현황; • 이용안내; • 찾아오시는 길

전체메뉴바탕이미지BL

전체메뉴바탕이미지BR

자료검색

자료검색

Home
상세정보

상세정보

검색결과 돌아가기

부가기능

TensorFlow Reinforcement Learning Quick Start Guide : Get up and Running with Training and Deploying Intelligent, Self-Learning Agents Using Python

상세 프로파일

상세정보
자료유형	E-Book
개인저자	Balakrishnan, Kaushik.
서명/저자사항	TensorFlow Reinforcement Learning Quick Start Guide :Get up and Running with Training and Deploying Intelligent, Self-Learning Agents Using Python.
발행사항	Birmingham : Packt Publishing Ltd, 2019.
형태사항	1 online resource (175 pages)
소장본 주기	Added to collection customer.56279.3
ISBN	1789533449 9781789533446
일반주기	The A3C algorithm applied to LunarLander
내용주기	Cover; Title Page; Copyright and Credits; Dedication; About Packt; Contributors; Table of Contents; Preface; Chapter 1: Up and Running with Reinforcement Learning; Why RL?; Formulating the RL problem; The relationship between an agent and its environment; Defining the states of the agent; Defining the actions of the agent; Understanding policy, value, and advantage functions; Identifying episodes; Identifying reward functions and the concept of discounted rewards; Rewards; Learning the Markov decision process ; Defining the Bellman equation; On-policy versus off-policy learning On-policy methodOff-policy method; Model-free and model-based training; Algorithms covered in this book; Summary; Questions; Further reading; Chapter 2: Temporal Difference, SARSA, and Q-Learning; Technical requirements; Understanding TD learning; Relation between the value functions and state; Understanding SARSA and Q-Learning ; Learning SARSA ; Understanding Q-learning; Cliff walking and grid world problems; Cliff walking with SARSA; Cliff walking with Q-learning; Grid world with SARSA; Summary; Further reading; Chapter 3: Deep Q-Network; Technical requirements Learning the theory behind a DQNUnderstanding target networks; Learning about replay buffer; Getting introduced to the Atari environment; Summary of Atari games; Pong; Breakout; Space Invaders; LunarLander; The Arcade Learning Environment ; Coding a DQN in TensorFlow; Using the model.py file; Using the funcs.py file; Using the dqn.py file; Evaluating the performance of the DQN on Atari Breakout; Summary; Questions; Further reading; Chapter 4: Double DQN, Dueling Architectures, and Rainbow; Technical requirements; Understanding Double DQN ; Coding DDQN and training to play Atari Breakout Evaluating the performance of DDQN on Atari BreakoutUnderstanding dueling network architectures; Coding dueling network architecture and training it to play Atari Breakout; Combining V and A to obtain Q; Evaluating the performance of dueling architectures on Atari Breakout ; Understanding Rainbow networks; DQN improvements; Prioritized experience replay ; Multi-step learning; Distributional RL; Noisy nets; Running a Rainbow network on Dopamine; Rainbow using Dopamine; Summary; Questions; Further reading; Chapter 5: Deep Deterministic Policy Gradient; Technical requirements Actor-Critic algorithms and policy gradientsPolicy gradient; Deep Deterministic Policy Gradient; Coding ddpg.py; Coding AandC.py; Coding TrainOrTest.py; Coding replay_buffer.py; Training and testing the DDPG on Pendulum-v0; Summary; Questions; Further reading; Chapter 6: Asynchronous Methods -- A3C and A2C; Technical requirements; The A3C algorithm; Loss functions; CartPole and LunarLander; CartPole; LunarLander; The A3C algorithm applied to CartPole; Coding cartpole.py; Coding a3c.py; The AC class; The Worker() class; Coding utils.py; Training on CartPole
요약	This book is an essential guide for anyone interested in Reinforcement Learning. The book provides an actionable reference for Reinforcement Learning algorithms and their applications using TensorFlow and Python. It will help readers leverage the power of algorithms such as Deep Q-Network (DQN), Deep Deterministic Policy Gradients (DDPG), and ...
일반주제명	Python (Computer program language) Artificial intelligence. Machine learning. Artificial intelligence. Machine learning. Python (Computer program language)
언어	영어
기타형태 저록	Print version:Balakrishnan, Kaushik.TensorFlow Reinforcement Learning Quick Start Guide : Get up and Running with Training and Deploying Intelligent, Self-Learning Agents Using Python.Birmingham : Packt Publishing Ltd, 짤20199781789533583
대출바로가기	http://search.ebscohost.com/login.aspx?direct=true&scope=site&db=nlebk&db=nlabk&AN=2094787

소장정보

소장정보

인쇄

메세지가 없습니다
No.	등록번호	청구기호	소장처	도서상태	반납예정일	예약	서비스	매체정보
1	WE00017076	005.133	가야대학교/전자책서버(컴퓨터서버)/	대출가능

서평

서평

태그

태그

태그추가 (로그인 필요)

나의 태그

나의 태그 (0)

모든 이용자 태그

모든 이용자 태그 (0)

대출현황/연장

예약현황조회/취소

자료구입신청

상호대차

FAQ

교외접속

사서에게 물어보세요

메뉴추가

quickBottom

카피라이터

하단 로고

김해캠퍼스 | 621-748 | 경남 김해시 삼계로 208 | TEL:055-330-1033 | FAX:055-330-1032
Copyright 2012 by kaya university Bunsung library All rights reserved.