Skip to content

Question about DITTO model benchmarks mentioned in new paper about the SAMA Model (Open Sourced Instruction-Guided Video Editing model based on Wan2.1) #35

@CCpt5

Description

@CCpt5

Hello,

There is a new Wan2.1 based video editing model released this week called SAMA (https://github.com/Cynthiazxy123/SAMA). They used some of your amazing DITTO-1m dataset for training, and mention your model a number of times in reference to their results (Paper: https://arxiv.org/abs/2603.19228)

I assume since the true DITTO edit model has not, afaik, been released, that the scores they used or DITTOs benchmarks were from either the STYLE or GLOBAL models, correct? If this is an accurate assumption, I'm wondering if you feel the model you trained/partially-trained intended specifically for editing would fare against SAMA?

Thanks again for sharing all of you code and models for Ditto! Ditto-1m is clearly helping others continue research into this exciting field!

Image Image

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions