Formalizing Markov Decision Processes in Lean

Verified Lean algorithms for solving tabular MDPs and proving their properties. The focus of this project is on two main goals:

Basic algorithms that can solve robust and risk-averse MDPs of moderate size.
Proofs of correctness of algorithms and fundamental MDP properties which can be used independently to prove structural results, such as the optimality of certain policy class.

Status

Basic probability properties

Definitions: probability space and definition
Definitions: probability, expectation, conditional properties
Independent of random variables
Tower property, law of the unconscious statistician
Quantile definition and basic properties
Quantile under monotone transformation

Value at Risk

Definition (non-constructive)
Practical implementation O(n^2) and correctness
Fast practical implementation O(n log n) and correctness
VaR is positively homogeneous and monotone
VaR is translation (cash) invariant
VaR under monotone transformation

MDP: Basics

Definition of MDP
Definition of policies (history, Markov, stationary)
Definition of value function (history-dependent)

Finite Horizon

Histories and manipulation
Probability space over histories
Return and optimal return using histories
History-dependent value function and dynamic program
Markov optimal value function and optimal policy
DP algorithms

Risk-averse finite horizon

History-dependent utility functions
Augmented value function dynamic program
VaR computation from utility function
VaR DP decomposition as in Hau et al., 2023

Discounted infinite horizon

Average reward horizon

Lean Resources

Most useful

Overview of tactics: https://github.com/madvorak/lean4-tactics
Comprehensive list of tactics: https://seasawher.github.io/mathlib4-help/tactics/
Loogle: https://loogle.lean-lang.org/
Moogle: https://www.moogle.ai/

Others

Blueprint: https://github.com/PatrickMassot/leanblueprint
Lean packages and extensions: https://reservoir.lean-lang.org/
Notations: https://github.com/leanprover-community/lean4-mode/blob/master/data/abbreviations.json
Resource for Probability: https://korivernon.com/documents/MathematicalStatisticsandDataAnalysis3ed.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 333 Commits
.github/workflows		.github/workflows
MDPLib		MDPLib
blueprint/src		blueprint/src
home_page		home_page
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
MDPLib.lean		MDPLib.lean
Main.lean		Main.lean
README.md		README.md
lake-manifest.json		lake-manifest.json
lakefile.toml		lakefile.toml
lean-toolchain		lean-toolchain

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Formalizing Markov Decision Processes in Lean

Status

Basic probability properties

Value at Risk

MDP: Basics

Finite Horizon

Risk-averse finite horizon

Discounted infinite horizon

Average reward horizon

Lean Resources

Most useful

Others

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Formalizing Markov Decision Processes in Lean

Status

Basic probability properties

Value at Risk

MDP: Basics

Finite Horizon

Risk-averse finite horizon

Discounted infinite horizon

Average reward horizon

Lean Resources

Most useful

Others

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages