Skip to content

Latest commit

 

History

History
37 lines (30 loc) · 1.99 KB

README.md

File metadata and controls

37 lines (30 loc) · 1.99 KB

Reinforcement-learning (In-progress)

I'm trying to implement some reinforcement-learning algorithms. Most of my implementation based on three lectures below:

My codes are like a rewrite from Denny Britz's Repo, But because I can't write such a beautiful code like he does yet :( So I try to implement many of it by myself ;)

Table of contents

0. Environment

1. Dynamic Programming

  • Grid World (Environment, DP-Policy Evaluation, DP-Policy Iteration, DP-Value Iteration)

2. Simple Model-Free

3. Eligibility Traces