Skip to content

[NeurIPS'24 Spotlight] A comprehensive benchmark & codebase for Image manipulation detection/localization.

License

Notifications You must be signed in to change notification settings

scu-zjz/IMDLBenCo

Repository files navigation

OSQ

[NeurIPS'24 Spotlight] IMDL-BenCo: Comprehensive Benchmark and Codebase for Image Manipulation Detection & Localization

Xiaochen Ma †, Xuekang Zhu†, Lei Su†, Bo Du†, Zhuohang Jiang†, Bingkui Tong†, Zeyu Lei†, Xinyu Yang†, Chi-Man Pun, Jiancheng Lv, Jizhe Zhou *


†: joint first author & equal contribution *: corresponding author
🏎️Special thanks to Dr. Wentao Feng for the workplace, computation power, and physical infrastructure support.

Arxiv Documents PyPI Downloads pypi version license

Action-Test Codecov

News

  • [2025/03/11] We have released pre-trained checkpoints on Baidu NetDisk that we utilized to report all metrics in the paper. Please see this section in our documents for details.

  • [2024/12/10] Mesorch Stars, our new IML backbone model, which adopts a parallel CNN+Transformer structure to simultaneously deal with image semantics and non-semantics, is accepted by AAAI 25!!!🎉🎉🎉

  • [2024/12/10] Sparse-ViT Stars, the very first solution of constructing the non-semantic feature extractor through a self-supervised manner in IML is proposed by us and accepted by AAAI 25!!!🎉🎉🎉

  • [2024/09/26] This paper, IMDL-BenCo, has been accepted as Spotlight to NeurIPS 2024 Track Datasets and Benchmarks!!! 🎉🎉🎉

Warning

Existing bug:

  1. The Image-Acc evaluator may not return an accurate value under the current version during multi-GPU inference. Please be careful with it! We are locating the exact issue. Details found in issue: #42. Thanks @iamwangyabin for reporting the issue.

Important

Known Differences with original CAT-Net Protocol

  1. The CAT-Protocol (implementation of default balanced_dataset.py) used in the IMDLBenCo paper differ from the original CAT-Net settings. Several real image datasets are omitted. Please pay special attention! For more details, please check issue #65.

Overview

☑️Welcome to IMDL-BenCo, the first comprehensive IMDL benchmark and modular codebase.

  • This codebase is under long-term maintenance and updating. New features, extra baseline/sota models, and bug fixes will be continuously involved. You can find the corresponding plan here shortly.
  • This repo decomposes the IMDL framework into standardized, reusable components and revises the model construction pipeline, improving coding efficiency and customization flexibility.
  • This repo fully implements or incorporates training code for state-of-the-art models to establish a comprehensive IMDL benchmark.
  • Cite and star if you feel helpful. This will encourage us a lot 🥰.

Quick Start

IMDL-BenCo is a Python library managed on PYPI now, It's easy to install by following the command:

pip install imdlbenco

To verify your installation, you can try the following commands:

benco -v

Of course, the following command is also okay:

benco --version

This repository is under rapid development, thus, you can also use the command above to check if the current version is our latest version.

If everything works well, it should looks like this:

IMDLBenCo codebase version: 0.1.23
        Checking for updates...
        Local version:  0.1.23
        PyPI newest version:  0.1.23
You are using the latest version: 0.1.23.

For further guidance, please click the buttons below for official documentation:

Documents Documents

Documents Documents

We will keep updating the document with tricks and user cases. Please stay tuned!

We also welcome contributors to translate it into other languages.

Awesome Works Using IMDLBenCo

  • OpenSDI: A large dataset for Spotting Diffusion-Generated Images in the Open World and a corresponding SoTA model. CVPR25. GitHub Repo stars
  • Sparse-ViT: A SoTA model constructing the non-semantic feature extractor through a sparse-designed attention transformer. AAAI'25. Stars
  • Mesorch: A SoTA model adopts a parallel CNN+Transformer structure to simultaneously deal with image semantics and non-semantics. AAAI'25. Stars
  • IML-ViT: A pure Vision Transformer based model for IML task, which easy to be extent for further reserch. ArXiv. Stars

About

☑️About the Developers:

Citation

If you find our work valuable and it has contributed to your research or projects, we kindly request that you cite our paper. Your recognition is a driving force for our continuous improvement and innovation🤗.

@article{ma2025imdl,
  title={Imdl-benco: A comprehensive benchmark and codebase for image manipulation detection \& localization},
  author={Ma, Xiaochen and Zhu, Xuekang and Su, Lei and Du, Bo and Jiang, Zhuohang and Tong, Bingkui and Lei, Zeyu and Yang, Xinyu and Pun, Chi-Man and Lv, Jiancheng and others},
  journal={Advances in Neural Information Processing Systems},
  volume={37},
  pages={134591--134613},
  year={2025}
}

Statistics and Star History