ShenzhiYang2000

Follow

🎯

Focusing

Shenzhi Yang ShenzhiYang2000

🎯

Focusing

Follow

Ph.D. Student@ZJU

16 followers · 139 following

Zhejiang University
https://shenzhiyang2000.github.io/

Achievements

Achievements

Pinned Loading

TRAPO TRAPO Public

Official Repository of "[ICLR26] TRAPO: A Semi-Supervised Reinforcement Learning Framework for Boosting LLM Reasoning"

Python 25 1
OLR OLR Public

Can LLMs Learn to Reason Robustly under Noisy Supervision?

Python 11
OPRD OPRD Public

Code for "OPRD: On-Policy Representation Distillation"

Python 3