Deep RL 1 Introduction

less than 1 minute read

This is my notes for CS285 Deep Reinforcement Learning at UC Berkeley.

In the course intro, there are a few questions that interest me:

Why ‘Deep’RL?

Deep learning can handle unstructured environments, complex sensory input and adaptively select features for task at hand.

What does it mean by learning not end-to-end in RL?

Not end-to-end in RL means the recognition part, or understand what is happening, and control part, or decide what action to take, are separate.

What can deep learning & reinforcement learning do well now?

Acquire high degree of proficiency in domains governed by simple, known rules. E.g. Atari, Go.
Learn simple skills with raw sensory inputs, given enough experience. E.g. example needed
Learn from imitating enough human-provided expert behavior. E.g. example needed

What are the challenges of DeepRL?

Efficiency: DeepRL is slow
Transfer Learning: how to reuse past experience?
Not clear what the reward function should be.
Not clear what the role of predictino should be.

It’s hard to truly understand all the points above if you don’t have much experience in Deep RL, but it’s helpful to keep these in mind through out the course of learning.

Share on

X Facebook LinkedIn Bluesky

Puyuan Peng

Deep RL 1 Introduction

Why ‘Deep’RL?

What does it mean by learning not end-to-end in RL?

What can deep learning & reinforcement learning do well now?

What are the challenges of DeepRL?

Share on

You May Also Enjoy

做一个更快乐的博士生

Deep RL 12 Reinforcement Learning and Control as Probabilistic Inference

Deep RL 11 Model-Based Policy Learning

Deep RL 10 Model-based Reinforcement Learning