KL divergence : size of update

In the section [Policy Gradients diagnostics](https://github.com/williamFalcon/DeepRLHacks#policy-gradient-diagnostics), it is mentioned :

> If KL is .01 then very small.
    If 10 then too much.

I couldn't find the given values in the [slides](http://rll.berkeley.edu/deeprlcourse/docs/nuts-and-bolts.pdf) of the original talk. Where these values are from ?

---

Also, in the previous part (about entropy), you mentioned how to fix the problem with fast-dropping entropy, _and it is really helpful_.

**Does anyone know how to fix the problem of KL divergence being too low ?**


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KL divergence : size of update #3

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

KL divergence : size of update #3

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions