-
Notifications
You must be signed in to change notification settings - Fork 533
Pull requests: allenai/open-instruct
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix DataPreparationActor hanging on shutdown
#1611
opened Apr 14, 2026 by
hamishivi
Collaborator
Loading…
3 tasks done
Warn about checkpoint disk space only on the first checkpoint
#1608
opened Apr 13, 2026 by
mnoukhov
Contributor
Loading…
Use Ray to validate that allocated gpus correspond to requeusted # of GPUs
#1606
opened Apr 13, 2026 by
mnoukhov
Contributor
Loading…
Auto-update CHANGELOG.md from PR descriptions on merge
#1596
opened Apr 9, 2026 by
finbarrtimbers
Collaborator
Loading…
2 of 3 tasks
Stabilize GRPO LLM judge calls by routing them through the guarded LiteLLM helper
#1587
opened Apr 3, 2026 by
taivu1998
Loading…
grpo_fast: harden single-node startup resource checks and diagnostics
#1586
opened Apr 3, 2026 by
taivu1998
Loading…
Changes
DataPreparationActor so that we can configure it into a replay buffer
#1583
opened Apr 2, 2026 by
finbarrtimbers
Collaborator
Loading…
Fix GRPO rank_microbatch_size units
#1557
opened Mar 24, 2026 by
finbarrtimbers
Collaborator
Loading…
1 task
Rename num_unique_prompts_rollout and num_samples_per_prompt_rollout
#1538
opened Mar 19, 2026 by
finbarrtimbers
Collaborator
Loading…
Migrate to vLLM native weight transfer API
#1515
opened Mar 6, 2026 by
finbarrtimbers
Collaborator
Loading…
Port vLLM v1 AsyncMPClient weight-update flow
#1506
opened Mar 2, 2026 by
mnoukhov
Contributor
Loading…
Remove cast from vLLM tool definitions typing
#1504
opened Mar 2, 2026 by
finbarrtimbers
Collaborator
Loading…
Rename TIS ratio cap, add low bound and hard filter flag
#1503
opened Mar 2, 2026 by
finbarrtimbers
Collaborator
Loading…
Require checkpoint on Beaker restarts for DPO and GRPO training
codex
#1469
opened Feb 10, 2026 by
finbarrtimbers
Collaborator
Loading…
Adds Olmo-core SFT script that matches
finetune.py's interface and Olmo-core's efficiency
#1327
opened Jan 7, 2026 by
finbarrtimbers
Collaborator
Loading…
2 of 3 tasks
ProTip!
Updated in the last three days: updated:>2026-04-11.