SummerRainET2008

Follow

🎯

Focusing

Tian Xia SummerRainET2008

🎯

Focusing

Follow

NLP, LLM, agent

1 follower · 0 following

Achievements

Achievements

Popular repositories Loading

PYthon_Algorithms_Library PYthon_Algorithms_Library Public

Migrating popular data structures and algorithms in C++ to Python, such as linked list, balanced search tree, heap with a support of updating and deletion, as well as other commonly used small func…

Python 4
TopSpin TopSpin Public

A PyTorch based high-level Deep Learning training framework. Seamlessly switch between single-GPU and multi-server; Supports gradient accumulation, learning rate warmup and decay, early stop, mixed…

Python 4
TRePO-Response-Level_Rewards_Are_All_You_Need_for_Online_Reinforcement_Learning TRePO-Response-Level_Rewards_Are_All_You_Need_for_Online_Reinforcement_Learning Public

Response-Level Rewards Are Equivalent to Token-Level Rewards: Mathematical Principles for Online Reinforcement Learning
AgentTrainer.LLM AgentTrainer.LLM Public

Multi-turn Agent RL training in OpenRLHF, an AgentFlow reimplementation.

Python