[Reinforcement learning]#1. Introduction

👩‍💻LEARN : ML&Data/Lecture

[Reinforcement learning]#1. Introduction

쟈니유 2023. 3. 30. 16:19

728x90

#1. Introduction

▶️ What is reinforcement learning

특정 State 에 따라 rewards를 정적강화(+n)/부적강화(-m) 을 세팅해서 자동으로 good action으로 행동하게 하는 것

▶️ Mars rover example

(s,a,R(s),s') = state, action, rewards, updated state after take action

▶️ The return in reinforcement learning

Discount factor (감마) : 이동(action)에 대한 비용을 계산하는 것 . 증권에서는 돈의 가치 하락 등을 반영함.

State에 따라 행동에 따른 return 값이 다르므로 이를 행동 가이드에 반영할 수도 있음

To summarize, the return in reinforcement learning is the sum of the rewards that the system gets,
weighted by the discount factor, where rewards in the far future are weighted by the discount factor raised to a higher power.

▶️ Making decisions: Policies in reinforcement learning

Policy(pi)

pi(state) = action

Goal

Find a policy pi that tells you what action to take in every state so as to maximize the return

Markov Decision Process (MPD)

'👩‍💻LEARN : ML&Data > Lecture' 카테고리의 다른 글

[알고리즘 구현으로 배우는 선형대수] #11. 직교 행렬 (0)	2023.04.13
[Reinforcement learning]#2. State-action value function & #3. Continuous state spaces (0)	2023.03.30
[Unsupervised Learning, Recommenders, Reinforcement Learning] #4. Content-based filtering (0)	2023.03.30
[Unsupervised Learning, Recommenders, Reinforcement Learning] #3. Collaborative Filtering (0)	2023.03.30
[Unsupervised Learning, Recommenders, Reinforcement Learning] #2. Anomaly detection (0)	2023.03.28

현재글[Reinforcement learning]#1. Introduction

전방에 정체가 있어 새로운 길로 안내합니다

딥러닝, 미분, 머신러닝을위한수학, coursera, 컨볼루션, 선형회귀, 프로퇴사러, 문과생살아남기, 노잼, 적분, 지도학습, neural network, 코세라, HR Analytics, 경사하강법, HRD, People Analytics, 지금은 개념만 우겨넣자..우선..., 7차교육과정은 미적분을 안배웠어요, 비지도학습,

Today :
Yesterday :

일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

낡고 지친 회사원