🎯
Focusing
Popular repositories Loading
-
PYthon_Algorithms_Library
PYthon_Algorithms_Library PublicMigrating popular data structures and algorithms in C++ to Python, such as linked list, balanced search tree, heap with a support of updating and deletion, as well as other commonly used small func…
Python 4
-
TRePO-Response-Level_Rewards_Are_All_You_Need_for_Online_Reinforcement_Learning
TRePO-Response-Level_Rewards_Are_All_You_Need_for_Online_Reinforcement_Learning PublicResponse-Level Rewards Are Equivalent to Token-Level Rewards: Mathematical Principles for Online Reinforcement Learning
-
AgentTrainer.LLM
AgentTrainer.LLM PublicMulti-turn Agent RL training in OpenRLHF, an AgentFlow reimplementation.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
