GitHub - Eliot-Shen/DF-LLaVA: DF-LLaVA: Unlocking MLLMs for Synthetic Image Detection via Knowledge Injection and Conflict-Driven Self-Reflection

DF-LLaVA: Unlocking MLLMs for Synthetic Image Detection via Knowledge Injection and Conflict-Driven Self-Reflection

Paper

News

2025.9.23 🤗 We have released the code and classifier weights.
2025.9.18 🔥 We have released DF-LLaVA: Unlocking MLLM's potential for Synthetic Image Detection via Prompt-Guided Knowledge Injection. Check out the paper. We present DF-LLaVA model.

Evaluate image authenticity and obtain comprehensive artifact explanations


DF-LLaVA provides comprehensive artifact-level interpretability with detection accuracy outperforming expert models.

Overview of DF-LLaVA during inference. DF-LLaVA leverages its frozen vision encoder via a binary classifier for initial authenticity estimation. The probabilistic output is used as reference in prompts, based on which DF-LLaVA makes its prediction. The prediction then undergoes a conflict check and a possible self-reflection process from model to ensure its precision and robustness. Finally, artifacts are explained from various perspectives.

[
  {
    "image": "ff++/fake/Deepfakes/c23/frames/071_054/160.png",
    "label": 0,
    "cate": "deepfake",
    "width": 256,
    "height": 256,
    "conversations": [
      {
        "from": "human",
        "value": "<image>Does the image looks real/fake?"
      },
      {
        "from": "gpt",
        "value": "..."
      }
    ],
    "confidence_score": 0.9914323091506958 
  },
]

4.Train the LLaVA

sh ./scripts/train_dfllava.sh

Make sure to set "data_path" to the location of your augmented train.json.

Evaluation

Please download the test data used in the paper from FakeClue, LOKI and DMImage.

BibTeX

@article{Shen2025DFLLaVA,
      title={DF-LLaVA: Unlocking MLLMs for Synthetic Image Detection via Knowledge Injection and Conflict-Driven Self-Reflection}, 
      author={Zhuokang Shen and Kaisen Zhang and Bohan Jia and Heming Jia and Yuan Fang and Zhou Yu and Shaohui Lin},
      journal={arXiv preprint arXiv:2509.14957},
      year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
images		images
llava.egg-info		llava.egg-info
llava		llava
scripts		scripts
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DF-LLaVA: Unlocking MLLMs for Synthetic Image Detection via Knowledge Injection and Conflict-Driven Self-Reflection

Paper

News

Evaluate image authenticity and obtain comprehensive artifact explanations

Contents

Install

Models

Training

1.Download training data

2.Train the auxiliary classifier

3. Augment the train set

4.Train the LLaVA

Evaluation

BibTeX

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DF-LLaVA: Unlocking MLLMs for Synthetic Image Detection via Knowledge Injection and Conflict-Driven Self-Reflection

Paper

News

Evaluate image authenticity and obtain comprehensive artifact explanations

Contents

Install

Models

Training

1.Download training data

2.Train the auxiliary classifier

3. Augment the train set

4.Train the LLaVA

Evaluation

BibTeX

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages