Skip to content

[SCENARIO] Research Sandbox #96

@shchukins

Description

@shchukins

Problem

Automated experimentation can be valuable later, but it must not enter the product core before feedback, snapshots, calibration analytics, and dataset export are stable. Research should stay offline, reproducible, and review-gated.

User-facing flow

  1. Calibration dataset export produces reproducible JSONL/CSV records.
  2. Offline scripts run controlled parameter sweeps or model comparisons.
  3. Backtests produce validation reports.
  4. Human review decides whether deterministic formulas, thresholds, or recommendation rules should change.
  5. Accepted changes are versioned and documented before they affect production outputs.

Definition of done

  • Sandbox runs outside production decision paths.
  • Experiments are reproducible and tied to exported dataset versions.
  • Reports compare model versions against subjective outcomes and data quality.
  • No LLM or automated research loop changes readiness, recommendations, or calibration without review.

Linked existing issues

Migration note

Created from the accepted end-to-end epic proposal in docs/product/END_TO_END_EPICS_PROPOSAL.md. This epic intentionally keeps speculative ML and automated experimentation separate from the deterministic production core.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    Status

    No status

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions