tinylora

Here are 2 public repositories matching this topic...

Chi-Shan0707 / TinyLoRA-GRPO-Coder

Inspired by 《Learning to Reason in 13 parameters》, use TinyLoRA+GRPO(32 parameters) to fine-tune Qwen2.5-Coder-3B-Instruct(or other models) to accomplish competitive programming.

python cpp rl cpp17 deepcoder peft good-first-issue good-first-pr good-first-contribution qwen2-5 grpo qwen-coder tinylora learning-to-reason-in-13-parameters code-contests deepmind-code-contests

Updated Mar 11, 2026
Python

Chi-Shan0707 / Qwen4Luogu-RL

Star

This repo can work. But I make some updates in a new repo. Please see more in https://github.com/Chi-Shan0707/TinyLoRA-Qwen-Coder

cpp fudan-university rl lora fudan luogu good-first-issue modelscope qwen grpo qwen-coder tinylora learning-to-reason-in-13-parameters

Updated Feb 11, 2026
Python

Improve this page

Add a description, image, and links to the tinylora topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the tinylora topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly