Skip to content

Handle Ambiguous Scene Descriptions in Synthetic Evaluation #5

@creativeessence

Description

@creativeessence

Problem

Synthetic queries like:

“2 characters fight each other with guns”

may match multiple scenes in a video. Miners may return a different valid fight scene than the validator’s chosen ground-truth clip, causing unfair penalties.

Why It Matters

This directly affects:

  • Miner fairness
  • Validator scoring integrity
  • Incentive alignment on Bittensor

Proposed Direction

  • Enforce temporal grounding via additional constraints
  • Use scene disambiguation hints (e.g., clothing, location)
  • Penalize only if semantic mismatch > threshold
  • Allow multiple valid ground-truth intervals

Acceptance Criteria

  • Validator scoring does not unfairly penalize semantically correct alternative scenes.
  • Evaluation metrics robust to multi-instance events.

Metadata

Metadata

Labels

enhancementNew feature or request

Type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions