[NeurIPS'24 Spotlight] IMDL-BenCo: Comprehensive Benchmark and Codebase for Image Manipulation Detection & Localization
Xiaochen Ma †, Xuekang Zhu†, Lei Su†, Bo Du†, Zhuohang Jiang†, Bingkui Tong†, Zeyu Lei†, Xinyu Yang†, Chi-Man Pun, Jiancheng Lv, Jizhe Zhou *
†: joint first author & equal contribution *: corresponding author
🏎️Special thanks to Dr. Wentao Feng for the workplace, computation power, and physical infrastructure support.
-
[2025/03/11] We have released pre-trained checkpoints on Baidu NetDisk that we utilized to report all metrics in the paper. Please see this section in our documents for details.
-
[2024/12/10] Mesorch
, our new IML backbone model, which adopts a parallel CNN+Transformer structure to simultaneously deal with image semantics and non-semantics, is accepted by AAAI 25!!!🎉🎉🎉
-
[2024/12/10] Sparse-ViT
, the very first solution of constructing the non-semantic feature extractor through a self-supervised manner in IML is proposed by us and accepted by AAAI 25!!!🎉🎉🎉
-
[2024/09/26] This paper, IMDL-BenCo, has been accepted as Spotlight to NeurIPS 2024 Track Datasets and Benchmarks!!! 🎉🎉🎉
Warning
Existing bug:
- The
Image-Acc
evaluator may not return an accurate value under the current version during multi-GPU inference. Please be careful with it! We are locating the exact issue. Details found in issue: #42. Thanks @iamwangyabin for reporting the issue.
Important
Known Differences with original CAT-Net Protocol
- The CAT-Protocol (implementation of default balanced_dataset.py) used in the IMDLBenCo paper differ from the original CAT-Net settings. Several real image datasets are omitted. Please pay special attention! For more details, please check issue #65.
☑️Welcome to IMDL-BenCo, the first comprehensive IMDL benchmark and modular codebase.
- This codebase is under long-term maintenance and updating. New features, extra baseline/sota models, and bug fixes will be continuously involved. You can find the corresponding plan here shortly.
- This repo decomposes the IMDL framework into standardized, reusable components and revises the model construction pipeline, improving coding efficiency and customization flexibility.
- This repo fully implements or incorporates training code for state-of-the-art models to establish a comprehensive IMDL benchmark.
- Cite and star if you feel helpful. This will encourage us a lot 🥰.
IMDL-BenCo is a Python library managed on PYPI now, It's easy to install by following the command:
pip install imdlbenco
To verify your installation, you can try the following commands:
benco -v
Of course, the following command is also okay:
benco --version
This repository is under rapid development, thus, you can also use the command above to check if the current version is our latest version.
If everything works well, it should looks like this:
IMDLBenCo codebase version: 0.1.23
Checking for updates...
Local version: 0.1.23
PyPI newest version: 0.1.23
You are using the latest version: 0.1.23.
For further guidance, please click the buttons below for official documentation:
We will keep updating the document with tricks and user cases. Please stay tuned!
We also welcome contributors to translate it into other languages.
- OpenSDI: A large dataset for Spotting Diffusion-Generated Images in the Open World and a corresponding SoTA model. CVPR25.
- Sparse-ViT: A SoTA model constructing the non-semantic feature extractor through a sparse-designed attention transformer. AAAI'25.
- Mesorch: A SoTA model adopts a parallel CNN+Transformer structure to simultaneously deal with image semantics and non-semantics. AAAI'25.
- IML-ViT: A pure Vision Transformer based model for IML task, which easy to be extent for further reserch. ArXiv.
☑️About the Developers:
- IMDL-BenCo's project leader/supervisor is Associate Professor 🏀Jizhe Zhou (周吉喆), Sichuan University🇨🇳.
- IMDL-BenCo's codebase designer and coding leader is the Research Assitant Xiaochen Ma (马晓晨), Sichuan University🇨🇳.
- IMDL-BenCo is jointly sponsored and advised by Prof. Jiancheng LV (吕建成), Sichuan University 🐼, and Prof. Chi-Man PUN (潘治文), University of Macau 🇲🇴, through the Research Center of Machine Learning and Industry Intelligence, China MOE platform.
If you find our work valuable and it has contributed to your research or projects, we kindly request that you cite our paper. Your recognition is a driving force for our continuous improvement and innovation🤗.
@article{ma2025imdl,
title={Imdl-benco: A comprehensive benchmark and codebase for image manipulation detection \& localization},
author={Ma, Xiaochen and Zhu, Xuekang and Su, Lei and Du, Bo and Jiang, Zhuohang and Tong, Bingkui and Lei, Zeyu and Yang, Xinyu and Pun, Chi-Man and Lv, Jiancheng and others},
journal={Advances in Neural Information Processing Systems},
volume={37},
pages={134591--134613},
year={2025}
}