long-horizon

Star

Here are 16 public repositories matching this topic...

microsoft / delegate52

Star

Code that accompanies the paper release for "LLMs Corrupt Your Documents When You Delegate"

simulation delegation llms long-horizon

Updated Apr 20, 2026
Python

qiqihezh / agentic-grpo-longhorizon

Star

Fixing GRPO training collapse in long-horizon multi-tool agents. A lightweight PRM-Lite + LATA joint approach achieves +37% over vanilla GRPO on τ-bench airline (50-task, multi-turn).

reinforcement-learning long-horizon qwen agentic-ai tool-calling process-reward-model grpo tau-bench multi-turn-agents

Updated May 11, 2026
Python

avanturist322 / awesome-memory-vla

Star

🧠 Awesome Memory-VLA: A curated list of Visual-Language-Action models with memory

robotics memory vla pomdp vlm embodied-ai long-horizon visual-language-models long-context-modeling visual-language-action-models memory-vlm memory-vla

Updated May 3, 2026

abundant-ai / long-horizon

Star

SWE-Marathon: an ultra long-horizon SWE benchmark

benchmark terminal swe long-horizon

Updated May 16, 2026
Rust

kwanyoungpark / MAC

Star

Code for Scalable Offline Model-Based RL with Action chunking

reinforcement-learning model-based-reinforcement-learning offline-reinforcement-learning long-horizon action-chunking

Updated Feb 20, 2026
Python

kwanyoungpark / LEQ

Star

Code for Tackling Long-Horizon Tasks with Model-based Offline Reinforcement Learning

reinforcement-learning model-based-reinforcement-learning offline-reinforcement-learning long-horizon

Updated Feb 6, 2025
Python

TheSeamau5 / envoi

Star

The simplest way to build long-horizon environments

ai rl agents long-horizon agentic-ai long-horizon-ai

Updated Apr 8, 2026
Python

3xcaffeine / frontier-swe-openenv

Star

A family of long-horizon software-engineering environments for OpenEnv, adapted from https://github.com/Proximal-Labs/frontier-swe

rl-environment long-horizon openenv agent-harness

Updated Apr 26, 2026
C

mturan33 / isaac-g1-hierarchical

Star

VLM-RL Hierarchical Loco-Manupilation For Long-Horizon Tasks With G1 robot in Isaac Lab/Sim

vlm g1 ppo semantic-map isaacsim loco-manipulation isaac-sim unitree long-horizon long-horizon-robotic-manipulation isaac-lab isaaclab unitree-g1 long-horizon-ai long-horizon-manipulation long-horizon-intelligence long-horizon-tasks

Updated Apr 28, 2026
Python

OtherPowers / clawdbot

Star

OpenClaw humanity infusions OtherPowers Creative Intelligence Agency. 🦞

intelligence agency creative long-horizon walksonthebeach age-of-aquarius-tech sgidoula

Updated Feb 24, 2026
TypeScript

Aditya-Ranjan1234 / Long-Horizon-Memory-V2

Star

A real-world inspired environment for selective context retention under noise. It evaluates an LLM's ability to manage a fixed-capacity memory buffer, retaining high-value information while filtering out distractors

learning environment context retention long-horizon

Updated Apr 25, 2026
Jupyter Notebook

Aditya-Ranjan1234 / Long-Horizon-Memory-V2-Dashboard

Star

Dashboard for real-world inspired environment for selective context retention under noise. It evaluates an LLM's ability to manage a fixed-capacity memory buffer, retaining high-value information while filtering out distractors

monitoring dashboard reinforment-learning long-horizon

Updated Apr 25, 2026
Python

Ten-Trillion-Triangles / TPipe

Star

TPipe is the agent operating environment for deterministic, multimodal AI systems. Built Kotlin-first, it composes runtime substrates into governed pipelines with rich tracing, disciplined context and token control, native function binding, and provider-agnostic execution for long-running, headless agents.

orchestration multi-agent governance determ ai-agents llm long-horizon operating-en headless-ag

Updated May 14, 2026
Kotlin

headcrabz / horizonX

Star

Long-horizon agent execution harness — reliable autonomous runs for Claude Code, Codex, OpenHands, and custom agents. Goal graphs, spin detection, HITL gates, fork/merge, 8 strategies, 6 validators.

python openai agents codex llm long-horizon claude-code agent-harness

Updated May 1, 2026
Python

broomva / persist

Sponsor

Star

bstack P12 — Persistent Loop Discipline. Cross-context restart loop with state in the filesystem. Closes the long-horizon context-rot failure mode. Composes with bstack P5/P6/P7/P10/P11.

long-horizon claude-code agent-skill ralph-loop bstack broomva context-restart

Updated May 6, 2026
Python

hayoungjungg / SciConBench

Star

Official repository for the paper: Can AI Agents Synthesize Scientific Conclusions?

benchmark ai-agents long-form long-horizon agentic-workflow scientific-conclusion-synthesis clean-room-evaluation

Updated May 9, 2026

Improve this page

Add a description, image, and links to the long-horizon topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the long-horizon topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

long-horizon

Here are 16 public repositories matching this topic...

microsoft / delegate52

qiqihezh / agentic-grpo-longhorizon

avanturist322 / awesome-memory-vla

abundant-ai / long-horizon

kwanyoungpark / MAC

kwanyoungpark / LEQ

TheSeamau5 / envoi

3xcaffeine / frontier-swe-openenv

mturan33 / isaac-g1-hierarchical

OtherPowers / clawdbot

Aditya-Ranjan1234 / Long-Horizon-Memory-V2

Aditya-Ranjan1234 / Long-Horizon-Memory-V2-Dashboard

Ten-Trillion-Triangles / TPipe

headcrabz / horizonX

broomva / persist

hayoungjungg / SciConBench

Improve this page

Add this topic to your repo