Skip to content

feat(mini_swe_env): add SWE-Gym async GRPO environment with Pi interception and HF Space deployment#695

Open
rycerzes wants to merge 83 commits into
huggingface:mainfrom
rycerzes:feat/mini_swe_env
Open

feat(mini_swe_env): add SWE-Gym async GRPO environment with Pi interception and HF Space deployment#695
rycerzes wants to merge 83 commits into
huggingface:mainfrom
rycerzes:feat/mini_swe_env

refactor: loss_mask/prefix-merging fix

c6612b3
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs