Skip to content
View SummerRainET2008's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report SummerRainET2008

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. PYthon_Algorithms_Library PYthon_Algorithms_Library Public

    Migrating popular data structures and algorithms in C++ to Python, such as linked list, balanced search tree, heap with a support of updating and deletion, as well as other commonly used small func…

    Python 4

  2. TopSpin TopSpin Public

    A PyTorch based high-level Deep Learning training framework. Seamlessly switch between single-GPU and multi-server; Supports gradient accumulation, learning rate warmup and decay, early stop, mixed…

    Python 4

  3. TRePO-Response-Level_Rewards_Are_All_You_Need_for_Online_Reinforcement_Learning TRePO-Response-Level_Rewards_Are_All_You_Need_for_Online_Reinforcement_Learning Public

    Response-Level Rewards Are Equivalent to Token-Level Rewards: Mathematical Principles for Online Reinforcement Learning

  4. AgentTrainer.LLM AgentTrainer.LLM Public

    Multi-turn Agent RL training in OpenRLHF, an AgentFlow reimplementation.

    Python