Example for lagrangian constraints 

The documentation of this library is very limited and it is hard to parse how it should be applied for different applications. For instance, in Many RL algorithms such as [PPO](https://arxiv.org/pdf/1707.06347.pdf) and [MPO](https://arxiv.org/pdf/1806.06920.pdf), an entropy regularization terms or the KL constraints between policies employed to stabilise standard RL objectives. Is it possible to give a practical example of how one can use this library for a dual problem with lagrangian multipliers for such applications?

Many thanks.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Example for lagrangian constraints #51

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Example for lagrangian constraints #51

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions