Popular repositories Loading
-
deepritz_martensitic
deepritz_martensitic Public本项目旨在使用deepritz方法复现学术论文 "Branching of twins near an austenite—twinned-martensite interface" 中的核心模拟结果。/The objective of this project is to reproduce the key findings from the seminal paper, "Branchi…
Python 1
-
cs336_spring2025_assignment1
cs336_spring2025_assignment1 PublicImplementation of a Decoder-only Transformer language model from scratch for CS336, featuring a byte-level BPE tokenizer, RoPE, Multi-Head Self-Attention and SwiGLU FFN. Trained on TinyStories with…
Python 1
-
cs336_spring2025_assignment5
cs336_spring2025_assignment5 PublicCS336 作业 5:基于 Qwen2.5 模型的 LLM 对齐与推理强化学习。完整实现了监督微调(SFT)与组相对策略优化(GRPO)算法,并在 GSM8K 数据集上完成零样本、在策与离策的训练与评估对比。
Python 1
If the problem persists, check the GitHub status page or contact support.