Implement structured output generation for both LlamaLanguageModel / MLXLanguageModel #75

eastriverlee · 2025-12-24T15:45:59Z

Related to #27

Copilot

Pull request overview

This PR implements structured output generation for LlamaLanguageModel and MLXLanguageModel by adding constrained token sampling to generate JSON that conforms to a schema. The implementation includes comprehensive tests covering various data types and structures.

Key changes:

Added ConstrainedJSONGenerator that uses token-level sampling to generate schema-conformant JSON
Implemented TokenBackend protocol with adapters for both Llama and MLX models
Enhanced GenerationGuide to store constraint values for min/max on numbers and arrays
Extended GenerationSchema with character validation and schema prompt generation

Reviewed changes

Copilot reviewed 9 out of 9 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
Tests/AnyLanguageModelTests/StructuredGenerationTests.swift	Comprehensive test suite covering simple types, nested structs, enums, arrays, and optionals across all supported model types
Tests/AnyLanguageModelTests/GenerableMacroTests.swift	Added round-trip tests for enums, nested structs, and arrays
Sources/AnyLanguageModelMacros/GenerableMacro.swift	Refactored guide extraction to use a structured Constraints type and properly parse numeric ranges and array count constraints
Sources/AnyLanguageModel/StructuredGeneration.swift	New file implementing token-level constrained JSON generation with TokenBackend protocol and ConstrainedJSONGenerator
Sources/AnyLanguageModel/Models/SystemLanguageModel.swift	Updated to use schema-based generation for non-String types and added conversion to FoundationModels.DynamicGenerationSchema
Sources/AnyLanguageModel/Models/MLXLanguageModel.swift	Implemented MLXTokenBackend and structured JSON generation with proper token sampling and repetition penalty handling
Sources/AnyLanguageModel/Models/LlamaLanguageModel.swift	Implemented LlamaTokenBackend and structured JSON generation with batch-based decoding and sampler integration
Sources/AnyLanguageModel/GenerationSchema.swift	Added schemaPrompt() method, character validation for JSON strings, improved node equality checking, and support for constraint propagation
Sources/AnyLanguageModel/GenerationGuide.swift	Made GenerationGuide store actual constraint values (min/max, minCount/maxCount) for use during schema generation

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Sources/AnyLanguageModel/StructuredGeneration.swift

Sources/AnyLanguageModel/GenerationSchema.swift

Sources/AnyLanguageModel/Models/MLXLanguageModel.swift

Sources/AnyLanguageModel/StructuredGeneration.swift

Sources/AnyLanguageModel/Models/LlamaLanguageModel.swift

Sources/AnyLanguageModel/StructuredGeneration.swift

mattt · 2026-01-05T14:27:48Z

@eastriverlee Thank you for your contribution! And thank you for your patience. I'll have a chance to look a this soon.

…nerator

mattt · 2026-01-20T14:31:02Z

@eastriverlee Thanks again for your patience. I just rebased, resolving the conflicts as best I could. I recently merged #59, which takes a slightly different approach for schema conversion. I'm working to harmonize these implementations now...

…rompt via JSONSchema

…hema

mattt · 2026-02-03T11:37:07Z

Nice! We've got CI passing, and everything looking good. Just going to give this one last automated PR review, and we should be good to merge.

Copilot

Pull request overview

Copilot reviewed 12 out of 12 changed files in this pull request and generated 8 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Tests/AnyLanguageModelTests/MLXLanguageModelTests.swift

Sources/AnyLanguageModel/StructuredGeneration.swift

Sources/AnyLanguageModel/Models/MLXLanguageModel.swift

Sources/AnyLanguageModel/StructuredGeneration.swift

Sources/AnyLanguageModel/GenerationSchema.swift

Extract Character extension to separate file

Copilot

Pull request overview

Copilot reviewed 15 out of 15 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Sources/AnyLanguageModel/StructuredGeneration.swift

Sources/AnyLanguageModel/GenerationSchema.swift

…e, .count, .minimumCount, .maximumCount instead of GenerationGuide(minimum:maximum:) Updated GenerableMacro to implement Float/Decimal guide methods to set min/max bounds so constraints are preserved

…inators during structured JSON generation, so the sampler can’t select EOS/EOT mid-structure and return a partial/non-object

…lti-string structures

mattt · 2026-02-03T18:47:20Z

Decided to let this one cook a bit more to make sure the implementation was correct. Kicking off another review after making some changes. Let's see what comes back this time...

Copilot

Pull request overview

Copilot reviewed 17 out of 18 changed files in this pull request and generated 21 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Sources/AnyLanguageModel/StructuredGeneration.swift

Sources/AnyLanguageModel/Models/MLXLanguageModel.swift

Sources/AnyLanguageModel/StructuredGeneration.swift

Sources/AnyLanguageModel/Models/MLXLanguageModel.swift

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

mattt · 2026-02-04T09:25:57Z

@eastriverlee Merged! Thanks for your work on this and your patience with my review. Really excited to support guided generation / structured output for MLX and Llama.

mattt requested a review from Copilot January 5, 2026 13:56

Copilot started reviewing on behalf of mattt January 5, 2026 13:56 View session

Copilot AI reviewed Jan 5, 2026

View reviewed changes

eastriverlee and others added 9 commits January 20, 2026 06:01

Fix SystemLanguageModel to pass schema for structured generation

ded134c

Implement logit-constrained structured generation for LlamaLanguageModel

0fa246d

Implement logit-constrained structured generation for MLXLanguageModel

be79024

Fix duplicate type crash in schema generation

887964a

Enforce count + numeric range guides

ecae1bd

Respect temperature for structured generation

9fb930b

Refactor Llama and MLX structured generation to shared constrained ge…

77950e3

…nerator

swift format -i -r .

c8fdb9d

Restore SystemLanguageModel.swift from HEAD of main

200886b

mattt force-pushed the main branch from 63bcd14 to 200886b Compare January 20, 2026 14:29

mattt added 4 commits January 20, 2026 06:41

Align structured generation prompts and defaults, and enrich schema p…

51294c6

…rompt via JSONSchema

Respect schema prompt flag and enhance structured prompts with JSONSc…

2488201

…hema

Add documentation comments to helper methods

04650e1

Refactor tests and fixtures

a5ccc1e

mattt requested a review from Copilot February 3, 2026 11:37

Copilot started reviewing on behalf of mattt February 3, 2026 11:37 View session

Copilot AI reviewed Feb 3, 2026

View reviewed changes

mattt added 4 commits February 3, 2026 04:07

Incorporate feedback from review

996db5a

Conform internal GenerationSchema.Node to Equatable

3659992

Refactor and reorganize GenerationSchema

e76f048

Extract Character extension to separate file

Rework StructuredGeneration adding docs and tests

87ef375

mattt requested a review from Copilot February 3, 2026 13:00

Copilot started reviewing on behalf of mattt February 3, 2026 13:00 View session

Copilot AI reviewed Feb 3, 2026

View reviewed changes

Sources/AnyLanguageModel/StructuredGeneration.swift Show resolved Hide resolved

Sources/AnyLanguageModel/StructuredGeneration.swift Outdated Show resolved Hide resolved

Sources/AnyLanguageModel/GenerationSchema.swift Outdated Show resolved Hide resolved

mattt added 7 commits February 3, 2026 05:15

Incorporate feedback from review

2564d59

Update GenerableMacro to build guides using .minimum, .maximum, .rang…

5e5ed58

…e, .count, .minimumCount, .maximumCount instead of GenerationGuide(minimum:maximum:) Updated GenerableMacro to implement Float/Decimal guide methods to set min/max bounds so constraints are preserved

Lower APIs from package to internal access

42e5934

Change var to let

4768a98

Adjust constrained generation to never allow end tokens as valid term…

dfb2f5d

…inators during structured JSON generation, so the sampler can’t select EOS/EOT mid-structure and return a partial/non-object

Tighten free-string budget so we don’t exhaust the token budget on mu…

c0cc2a3

…lti-string structures

Fix eos token logic

8e46927

mattt requested a review from Copilot February 3, 2026 18:46

Copilot started reviewing on behalf of mattt February 3, 2026 18:46 View session

Copilot AI reviewed Feb 3, 2026

View reviewed changes

mattt added 2 commits February 4, 2026 00:45

Incorporate feedback from review

27f39b6

Fix parsing of guide constraints without a description string

5d10c2a

mattt merged commit dd8522b into mattt:main Feb 4, 2026
3 checks passed

Implement structured output generation for both LlamaLanguageModel / MLXLanguageModel #75

Implement structured output generation for both LlamaLanguageModel / MLXLanguageModel #75

Uh oh!

Conversation

eastriverlee commented Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mattt commented Jan 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattt commented Jan 20, 2026

Uh oh!

mattt commented Feb 3, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mattt commented Feb 3, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

mattt commented Feb 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

eastriverlee commented Dec 24, 2025 •

edited

Loading

mattt commented Jan 5, 2026 •

edited

Loading