Skip to content

Support dynamic LiteRT-LM LoRA adapters and web parity #264

Description

@leehack

Summary

Split from #173 after initial native LiteRT-LM load-time LoRA support landed.

Current llamadart support covers one default-scale text LoRA adapter supplied through ModelParams.loras when loading a native .litertlm model. The remaining parity work is dynamic/runtime LoRA control and web support.

Remaining work

  • Support runtime setLora, removeLora, and clearLoras for native LiteRT-LM contexts when the runtime exposes stable APIs for dynamic adapter changes.
  • Support adapter stacking and custom scales if the LiteRT-LM runtime defines compatible semantics.
  • Support LiteRT-LM web LoRA if @litert-lm/core exposes equivalent browser runtime APIs.
  • Keep unsupported combinations failing early with clear errors until runtime support exists.
  • Add regression tests and at least one real-model smoke once test artifacts are available.

Current blocker

This needs upstream/native runtime API support beyond the current session-config lora-path setter. Dart should not bind to mangled C++ internals.

Related

Metadata

Metadata

Assignees

No one assigned

    Labels

    blocked-runtimeRequires upstream or native runtime support before Dart package work can complete.enhancementNew feature or requestlitert-lm-parityLiteRT-LM backend parity features

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions