Reinforcement Learning Posted on 2021-03-22 | In AI Planning and Decision Making Words count in article: 40 | Reading time ≈ 1 RL: figure out what leads to good result / bad result do something to get the good result Adaptive Dynamic Programming: Learn the model (transition & reward function): supervised learning policy evaluation TD’s advantage over ADP:no need for simulator? Post author: HUANG Liu Post link: https://huangliu0909.github.io/2021/03/22/AI_RL/ Copyright Notice: All articles in this blog are licensed under CC BY-NC-SA 3.0 unless stating additionally.