feat: Nous Research provider support with OAuth auto-refresh by phuongvm · Pull Request #737 · plastic-labs/honcho

phuongvm · 2026-05-29T00:57:23Z

Summary

Adds nous as a supported LLM transport with automatic OAuth credential refresh on 401 AuthenticationError.

Key Changes

New transport: nous added to ModelTransport literal
OAuth refresh module: src/llm/nous_refresh.py — handles token refresh, secure state file management (0700/0600 permissions), and agent key minting
OpenAIBackend autorefresh: _call_with_autorefresh() helper eliminates DRY violation; used in parse, create, and stream paths
Missing imports fixed: BadRequestError, LengthFinishReasonError now properly imported
Config: NOUS_API_KEY, NOUS_BASE_URL (defaults to https://inference-api.nousresearch.com/v1)
Exception class: NousAuthError for OAuth-specific failures

Configuration

NOUS_API_KEY=***
NOUS_BASE_URL=https://inference-api.nousresearch.com/v1

Tests (11 tests)

Credential loading/saving with state file management
OAuth refresh flow (success + failure propagation)
Auto-refresh on 401 (parse, create, stream paths)
Non-Nous backends do NOT trigger refresh on 401

Split from #692 per maintainer request to separate Nous provider support from LLM fallback.

Summary by CodeRabbit

New Features
- Added Nous as a supported LLM provider with configurable API key and base URL.
- Automatic Nous credential refresh and agent key minting to maintain/authenticate sessions.
- Improved tool-selection handling for broader provider compatibility.
Bug Fixes
- Backend now transparently retries on Nous authentication failures to reduce spurious errors.
Tests
- Added integration and unit tests covering Nous auto-refresh, OAuth flows, and env/state handling.

Adds 'nous' as a supported LLM transport with automatic OAuth credential refresh on 401 AuthenticationError. - New 'nous' transport in ModelTransport - OAuth refresh module (src/llm/nous_refresh.py) with secure state files - OpenAIBackend autorefresh integration (parse/create/stream paths) - NousAuthError exception class - Config: NOUS_API_KEY, NOUS_BASE_URL defaults - Fix missing imports (BadRequestError, LengthFinishReasonError) - Includes tests for autorefresh and credential loading.

coderabbitai · 2026-05-29T00:57:37Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 2e885a4f-939c-4d54-a9a3-cbee05cd997d

📥 Commits

Reviewing files that changed from the base of the PR and between 9338951 and a9df7fa.

📒 Files selected for processing (5)

src/config.py
src/exceptions.py
src/llm/backends/openai.py
src/llm/nous_refresh.py
tests/llm/test_nous_refresh.py

🚧 Files skipped from review as they are similar to previous changes (5)

src/exceptions.py
src/config.py
tests/llm/test_nous_refresh.py
src/llm/backends/openai.py
src/llm/nous_refresh.py

Walkthrough

This PR integrates Nous as a supported LLM transport by adding OAuth credential auto-refresh to the OpenAI backend: configuration and exception types, a Nous refresh orchestrator module, backend interception and retry on 401 when enabled, and tests for both refresh logic and backend retry behavior.

Changes

Nous credential auto-refresh with OpenAI backend integration

Layer / File(s)	Summary
Configuration and exception setup `src/config.py`, `src/exceptions.py`	`ModelTransport` gains `"nous"` literal; `LLMSettings` adds `NOUS_API_KEY` and `NOUS_BASE_URL` with default URL; `NousAuthError` exception introduced for OAuth failures.
Credentials integration for Nous `src/llm/credentials.py`	`default_transport_api_key` returns `NOUS_API_KEY` when transport is `"nous"`.
OAuth refresh orchestration module `src/llm/nous_refresh.py`	New module implementing atomic state persistence, bootstrap from Hermes/project config, project discovery, `.env` key updates, async HTTP token refresh and agent key minting, and main orchestrator that loads state, refreshes tokens, mints agent keys, persists credentials, updates `.env`, and returns new key or `None` on failure.
OpenAI backend auto-refresh on 401 `src/llm/backends/openai.py`	`OpenAIBackend` constructor gains `is_nous` flag; new `_call_with_autorefresh` helper retries on `AuthenticationError` after credential refresh; parse, create, and stream paths now intercept 401 errors; tool choice `"any"` maps to `"required"`.
Backend auto-refresh integration tests `tests/llm/test_backends/test_nous_autorefresh.py`	Validates 401 interception and single retry for both create and stream paths when `is_nous=True`; confirms no retry when disabled; verifies failed refresh propagates error; asserts streamed responses include expected chunks and usage metadata.
Nous refresh module unit tests `tests/llm/test_nous_refresh.py`	Isolates module via mocking: state persistence (missing file, save/load round-trip), `.env` updates (create, replace, append), project root discovery, and end-to-end refresh with mocked HTTP and injected settings.

Sequence Diagram

sequenceDiagram
  participant Client
  participant OpenAIBackend
  participant refresh_nous_credentials
  participant OpenAI_API
  Client->>OpenAIBackend: complete/stream with is_nous=true
  OpenAIBackend->>OpenAI_API: chat.completions request
  OpenAI_API-->>OpenAIBackend: AuthenticationError (401)
  OpenAIBackend->>refresh_nous_credentials: get new agent_key
  refresh_nous_credentials-->>OpenAIBackend: agent_key
  OpenAIBackend->>OpenAI_API: update client.api_key
  OpenAIBackend->>OpenAI_API: retry chat.completions
  OpenAI_API-->>OpenAIBackend: response
  OpenAIBackend-->>Client: return content

🎯 3 (Moderate) | ⏱️ ~25 minutes

A bunny hops through OAuth gates,
Tokens minted, keys await,
When auth fails with codes that scare,
Refresh magic fills the air! 🐰✨

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 71.88% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The pull request title accurately and concisely summarizes the main feature: Nous Research provider support with OAuth auto-refresh, which is the central change across all modified and new files.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 5

🧹 Nitpick comments (4)

tests/llm/test_nous_refresh.py (1)
156-159: ⚡ Quick win

Move these imports to the top of the file.

SimpleNamespace, Mock, and sys are imported at the bottom but used in the test at lines 111-141. It only works because names resolve at call time; placing them at module top is clearer and isort-compliant.
♻️ Proposed change
 import json
 import os
+import sys
 from pathlib import Path
-from unittest.mock import AsyncMock, MagicMock, patch
+from types import SimpleNamespace
+from unittest.mock import AsyncMock, MagicMock, Mock, patch
-
-
-# Helper for SimpleNamespace
-from types import SimpleNamespace
-from unittest.mock import Mock
-import sys
As per coding guidelines: "Follow isort conventions with absolute imports preferred in Python code".
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/llm/test_nous_refresh.py` around lines 156 - 159, Move the three
imports (SimpleNamespace, Mock, and sys) from their current bottom location into
the module-level import block at the top of the test file so they are declared
with the other imports and conform to isort conventions; specifically add "from
types import SimpleNamespace", "from unittest.mock import Mock", and "import
sys" to the top import section so tests referencing SimpleNamespace, Mock, and
sys in the test_nous_refresh functions (lines around 111–141) resolve clearly
and remain isort-compliant.
src/llm/nous_refresh.py (2)
50-54: 💤 Low value

Avoid catching blind Exception; narrow to the expected error types.

The bare except Exception here (and the same pattern at lines 101 and 122) swallows unexpected errors. For these file/JSON reads, catch (OSError, json.JSONDecodeError) instead so genuine programming errors surface.
♻️ Proposed change for load_state
     if STATE_FILE.exists():
         try:
             return json.loads(STATE_FILE.read_text())
-        except Exception as exc:
+        except (OSError, json.JSONDecodeError) as exc:
             logger.warning("Failed to parse %s: %s", STATE_FILE, exc)
As per coding guidelines: "Use explicit error handling with appropriate exception types in Python code".
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/llm/nous_refresh.py` around lines 50 - 54, Replace the broad except
Exception handlers around state file reading/parsing with explicit exceptions to
avoid swallowing unrelated errors: catch (OSError, json.JSONDecodeError) for the
blocks that call STATE_FILE.read_text() and json.loads(...) (the current
try/except around STATE_FILE.exists() and the similar handlers later in the
file). Update the except clauses to "except (OSError, json.JSONDecodeError) as
exc" and keep the same logger.warning message, so filesystem and JSON parse
errors are handled but other exceptions still surface.
162-177: 💤 Low value

Handle missing refresh_token in Nous refresh response

In src/llm/nous_refresh.py (lines 162-177), data["refresh_token"] can raise KeyError if the token endpoint omits refresh_token, aborting the refresh even though a new access_token was obtained. Keep the existing refresh_token when it’s absent.
♻️ Proposed change
     data = resp.json()
-    return data["access_token"], data["refresh_token"]
+    return data["access_token"], data.get("refresh_token", refresh_token)
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/llm/nous_refresh.py` around lines 162 - 177, In
refresh_access_token(refresh_token: str) update the return logic to handle a
missing refresh_token in the response: after parsing data = resp.json(), extract
access = data["access_token"] and set new_refresh = data.get("refresh_token",
refresh_token) so that if the token endpoint omits refresh_token we keep the
existing refresh_token parameter; return (access, new_refresh) and avoid raising
KeyError when refresh_token is absent (keep existing variable names: resp, data,
refresh_token, refresh_access_token).
tests/llm/test_backends/test_nous_autorefresh.py (1)
119-180: ⚡ Quick win

Add coverage for the structured parse() auto-refresh branch.

The create and stream paths are exercised, but the structured-output path (response_format=<BaseModel subclass>) has its own 401 refresh branch (openai.py Lines 177-193) wrapped by additional LengthFinishReasonError / BadRequestError handlers. That branch is currently untested, so a regression there (e.g. when consolidating onto _call_with_autorefresh) wouldn't be caught.

Want me to add a test_nous_backend_parse_auto_refresh case that calls complete(..., response_format=SomeModel) with parse raising 401 then succeeding?
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@tests/llm/test_backends/test_nous_autorefresh.py` around lines 119 - 180, Add
a new async test (e.g., test_nous_backend_parse_auto_refresh) that mirrors the
existing stream/create auto-refresh test but exercises the structured-output
(response_format) branch: mock OpenAIBackend.complete to call into the path that
uses response_format=<BaseModel subclass> and arrange for the first parse
attempt to raise an AuthenticationError (401) and subsequent call to succeed;
ensure refresh_nous_credentials is patched to return "new_key", assert
mock_client.api_key updated, that refresh was called once, and validate the
final parsed output and token/done semantics; reference OpenAIBackend.complete,
the parse() call and the refresh_nous_credentials patch, and include handling of
the LengthFinishReasonError/BadRequestError wrappers so the test specifically
triggers the 401 inside parse and then verifies retry behavior.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/config.py`:
- Line 25: The _normalize_model_transport function currently rejects shorthand
forms like "nous/<model>" even though ModelTransport includes "nous"; update
_normalize_model_transport to treat "nous" like the other providers by accepting
and normalizing shorthand transport strings that start with "nous/" (or equal
"nous") to the canonical transport and model parts. Locate the
_normalize_model_transport function and add "nous" to the accepted transport
list/regex and the mapping logic so inputs such as "nous/whatever" are parsed
and returned in the same normalized tuple/format as "anthropic/...",
"openai/...", and "gemini/...".

In `@src/exceptions.py`:
- Around line 136-140: Change NousAuthError to subclass HonchoException instead
of Exception so FastAPI maps it to the HonchoException handler; set its
status_code to 401 and detail to "Nous authentication failed". Locate the class
NousAuthError in src/exceptions.py, replace its base class with HonchoException
and add the status_code and detail attributes (keeping the docstring), ensuring
the exception will be handled by the HonchoException handler.

In `@src/llm/backends/openai.py`:
- Around line 122-144: The helper _call_with_autorefresh is never used and the
refresh-and-retry logic is copy-pasted into the parse/create/stream paths;
remove those inline retry blocks and call
_call_with_autorefresh(use_parse=True/False, params=params) from the parse,
create and stream code paths so all retries go through the single helper; inside
the helper keep the deferred import but change it to an absolute import (from
src.llm.nous_refresh import refresh_nous_credentials) and ensure its behavior
(setting self._client.api_key, updating settings.LLM.NOUS_API_KEY under
contextlib.suppress, logging, and re-raising if refresh fails) matches the
original inline logic so exceptions like LengthFinishReasonError still propagate
to the outer handlers.

In `@src/llm/nous_refresh.py`:
- Around line 78-84: The directory creation for STATE_FILE currently uses
STATE_FILE.parent.mkdir(parents=True, exist_ok=True) but does not enforce the
requested 0o700 permissions; explicitly set the directory mode to 0o700 after
ensuring it exists (e.g., call os.chmod on STATE_FILE.parent to 0o700) so both
newly created and pre-existing directories are tightened, then continue with the
existing atomic write flow (tmp = STATE_FILE.with_suffix(".tmp"),
tmp.write_text(...), os.chmod(tmp, 0o600), tmp.replace(STATE_FILE),
os.chmod(STATE_FILE, 0o600)).

In `@tests/llm/test_nous_refresh.py`:
- Around line 89-153: The test fails because STATE_FILE is bound at import and
load_state() is never stubbed so refresh_nous_credentials() sees no
refresh_token; fix by stubbing load_state or overriding STATE_FILE in the
imported module before calling refresh_nous_credentials(): e.g., in the test set
refresh_mod.load_state to a callable that returns {"refresh_token":
"some_token"} (or monkeypatch refresh_mod.STATE_FILE to point to the
tmp_path/state.json and write that file) so refresh_nous_credentials() has a
refresh_token to use; ensure you patch before awaiting
refresh_nous_credentials() and restore originals after the test.

---

Nitpick comments:
In `@src/llm/nous_refresh.py`:
- Around line 50-54: Replace the broad except Exception handlers around state
file reading/parsing with explicit exceptions to avoid swallowing unrelated
errors: catch (OSError, json.JSONDecodeError) for the blocks that call
STATE_FILE.read_text() and json.loads(...) (the current try/except around
STATE_FILE.exists() and the similar handlers later in the file). Update the
except clauses to "except (OSError, json.JSONDecodeError) as exc" and keep the
same logger.warning message, so filesystem and JSON parse errors are handled but
other exceptions still surface.
- Around line 162-177: In refresh_access_token(refresh_token: str) update the
return logic to handle a missing refresh_token in the response: after parsing
data = resp.json(), extract access = data["access_token"] and set new_refresh =
data.get("refresh_token", refresh_token) so that if the token endpoint omits
refresh_token we keep the existing refresh_token parameter; return (access,
new_refresh) and avoid raising KeyError when refresh_token is absent (keep
existing variable names: resp, data, refresh_token, refresh_access_token).

In `@tests/llm/test_backends/test_nous_autorefresh.py`:
- Around line 119-180: Add a new async test (e.g.,
test_nous_backend_parse_auto_refresh) that mirrors the existing stream/create
auto-refresh test but exercises the structured-output (response_format) branch:
mock OpenAIBackend.complete to call into the path that uses
response_format=<BaseModel subclass> and arrange for the first parse attempt to
raise an AuthenticationError (401) and subsequent call to succeed; ensure
refresh_nous_credentials is patched to return "new_key", assert
mock_client.api_key updated, that refresh was called once, and validate the
final parsed output and token/done semantics; reference OpenAIBackend.complete,
the parse() call and the refresh_nous_credentials patch, and include handling of
the LengthFinishReasonError/BadRequestError wrappers so the test specifically
triggers the 401 inside parse and then verifies retry behavior.

In `@tests/llm/test_nous_refresh.py`:
- Around line 156-159: Move the three imports (SimpleNamespace, Mock, and sys)
from their current bottom location into the module-level import block at the top
of the test file so they are declared with the other imports and conform to
isort conventions; specifically add "from types import SimpleNamespace", "from
unittest.mock import Mock", and "import sys" to the top import section so tests
referencing SimpleNamespace, Mock, and sys in the test_nous_refresh functions
(lines around 111–141) resolve clearly and remain isort-compliant.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 5cbf1bc4-0295-4b81-89fe-d745cf9b81db

📥 Commits

Reviewing files that changed from the base of the PR and between 85239a6 and 9338951.

📒 Files selected for processing (7)

src/config.py
src/exceptions.py
src/llm/backends/openai.py
src/llm/credentials.py
src/llm/nous_refresh.py
tests/llm/test_backends/test_nous_autorefresh.py
tests/llm/test_nous_refresh.py

- Add 'nous' and 'lmstudio' to _normalize_model_transport shorthand (config.py) - NousAuthError subclasses HonchoException with status_code=401 (exceptions.py) - Refactor 3 inline retry blocks into _call_with_autorefresh helper (openai.py) - Narrow except Exception to (OSError, json.JSONDecodeError) (nous_refresh.py) - Set state directory permissions to 0o700 (nous_refresh.py) - Handle missing refresh_token in response with .get() fallback (nous_refresh.py) - Move imports to top of test file, stub load_state (test_nous_refresh.py)

phuongvm mentioned this pull request May 29, 2026

feat: LLM model fallback on failure — fast failover to secondary provider/model #692

Open

coderabbitai Bot reviewed May 29, 2026

View reviewed changes

Comment thread src/config.py

Comment thread src/exceptions.py Outdated

Comment thread src/llm/backends/openai.py

Comment thread src/llm/nous_refresh.py

Comment thread tests/llm/test_nous_refresh.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Nous Research provider support with OAuth auto-refresh#737

feat: Nous Research provider support with OAuth auto-refresh#737
phuongvm wants to merge 2 commits into
plastic-labs:mainfrom
phuongvm:pr-692-nous-provider

phuongvm commented May 29, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented May 29, 2026 •

edited

Loading

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

phuongvm commented May 29, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Key Changes

Configuration

Tests (11 tests)

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

phuongvm commented May 29, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 29, 2026 •

edited

Loading