Image Forgery Datasets
Image Tampering Datasets, Image Manipulations Datasets... A repo intend to collect most of datasets (training and evaluation) for the image forgery detection and localization.
图像篡改检测定位数据集收集
Summary
Dataset name |
mask |
Image Format |
Post-processing |
Forgery types |
Real/Forged Images |
Train/Test Images |
Download |
Paper |
Year |
CASIA v1.0 |
No |
JPEG, TIFF |
|
Splicing, copy move, removal |
800/921 |
|
|
|
|
CASIA v2.0 |
No |
TIFF, JPEG, BMP |
Yes |
Splicing, copy move, removal |
7491/5123 |
715/100 |
http://forensics.idealtest.org/#/ , https://github.com/namtpham/casia1groundtruth , https://github.com/namtpham/casia2groundtruth |
CASIA Image Tampering Detection Evaluation Database |
2013 |
Columbia Gray |
No |
BMP |
No |
Splicing |
933/912 |
1845, 933 Authentic, 912 Spliced, 128 x 128 |
https://www.ee.columbia.edu/ln/dvmm/downloads/AuthSplicedDataSet/AuthSplicedDataSet.htm |
A data set of authentic and spliced image blocks |
2004 |
Columbia Color |
Yes |
TIFF |
No |
Splicing |
183/180 |
125/45 |
Columbia Uncompressed Image Splicing Detection Evaluation Dataset: https://www.ee.columbia.edu/ln/dvmm/downloads/authsplcuncmp/ |
Detecting Image Splicing using Geometry Invariants and Camera Characteristics Consistency |
2006 |
NIST 2016(Nimble Challenge 2016 (NC16)), NC17, MFC18 |
|
JPEG |
Yes |
Splicing, copy move, removal |
560/564 |
184/50 |
NC17: https://www.nist.gov/itl/iad/mig/nimble-challenge-2017-evaluation; MFC18: https://www.nist.gov/itl/iad/mig/media-forensics-challenge-2018; REF: https://mfc.nist.gov/#pills-resources |
MFC Datasets: Large-Scale Benchmark Datasets for Media Forensic Challenge Evaluation |
2019 |
Fantastic Reality |
|
JPEG |
No |
Splicing |
16592/19423 |
12000/1000 |
http://zefirus.org/MAG |
The Point Where Reality Meets Fantasy: Mixed Adversarial Generators for Image Splice Detection |
NeurIPS '19 |
Carvalho |
yes |
PNG |
yes |
splicing |
100/100 |
94, |
http://www.ic.unicamp.br/tjose/files/database-tifs-small-resolution.zip (BROKEN) http://ic.unicamp.br/~rocha/pub/downloads/2014-tiago-carvalho-thesis/tifs-database.zip |
Exposing digital image forgeries by illumination color classification |
TIFS '13 |
Realistic Tampering |
|
TIFF |
|
object insertion , removal. |
|
220 |
https://pkorus.pl/downloads/dataset-realistic-tampering |
Multi-scale analysis strategies in prnu-based tampering localization |
TIFS '16 |
COVERAGE |
|
TIFF |
No |
Copy move |
|
100 pairs, 400 x486 |
https://github.com/wenbihan/coverage |
COVERAGE — A novel database for copy-move forgery detection |
ICIP '16 |
CoMoFoD |
|
|
Yes |
Copy move |
260/260 |
260 sets |
https://www.vcl.fer.hr/comofod/ |
CoMoFoD — New database for copy-move forgery detection |
2013 |
MICC F220/F2000 |
|
|
|
copy move |
110/110, 440/160, 1300/700 |
2200 |
|
A SIFT-Based Forensic Method for Copy–Move Attack Detection and Transformation Recovery |
TIFS '11 |
FAU/Manip |
|
|
Yes |
Copy move |
|
48 |
http://www5.cs.fau.de/research/data/image-manipulation/index.html |
An Evaluation of Popular Copy-Move Forgery Detection Approaches |
TIFS '12 |
Dresden |
|
|
|
|
|
|
http://forensics.inf.tu-dresden.de/ddimgdb/ |
The Dresden Image Database for Benchmarking Digital Image Forensics |
SAC '10 |
GRIP |
|
|
|
Copy move |
|
|
http://www.grip.unina.it/download/prog/CMFD/ |
Efficient dense-field copy-move forgery detection |
TIFS '15 |
IMD2020 |
|
|
Yes |
Splicing, copy move, removal |
35000/35000 |
2010 |
http://staff.utia.cas.cz/novozada/db/ |
An evaluation of popular copy-move forgery detection approaches |
WACV '20 |
In the Wild |
Yes |
|
|
Splicing |
-/201 |
|
https://minyoungg.github.io/selfconsistency/ |
Fighting Fake News: Image Splice Detection via Learned Self-Consistency |
ECCV '18 |
DEFACTO |
|
|
|
Splicing, copy move, removal |
-/229000 |
|
https://defactodataset.github.io/ |
http://www.eurecom.fr/en/publication/5973/download/sec-publi-5973.pdf |
|
PS-battles |
|
|
|
Splicing, copy move, removal |
11142/102028 |
|
https://github.com/tophatraptor/psdetector.git . Link: https://github.com/dbisUnibas/PS-Battles |
The PS-battles dataset—an image collection for image manipulation detection |
CoRR '18 |
Wild Web |
Yes |
PNG |
|
splicing |
90/9657 |
|
|
Detecting image splicing in the wild (WEB) |
ICMEW '15 |
VIPP Synth |
yes |
JPEG |
|
splicing |
4800/4800 |
|
|
Image Forgery Localization via Block-Grained Analysis of JPEG Artifacts |
TIFS '12 |
VIPP Real |
manual |
JPEG |
|
splicing |
69/69 |
|
|
Image Forgery Localization via Block-Grained Analysis of JPEG Artifacts |
TIFS '12 |
Reddit |
|
|
|
|
|
|
|
Image Provenance Analysis at Scale |
TIP '18 |
AbhAS |
|
JPEG |
|
splicing |
45/48 |
|
|
AbhAS: A Novel Realistic Image Splicing Forensics Dataset |
Journal of Applied Security Research '22 |
MISD |
Yes |
JPEG |
Yes |
splicing |
618/300 |
|
https://zenodo.org/records/5525829 |
Multiple Image Splicing Dataset (MISD): A Dataset for Multiple Splicing |
Data '21 |
DSO-1 |
Yes |
JPEG |
|
splicing |
100/100 |
|
https://recodbr.wordpress.com/code-n-data/#dso1_dsi1 |
Exposing digital image forgeries by illumination color classification |
TIFS '13 |
DIS25k |
Yes |
JPG |
deep image harmonization |
splicing |
0/24964 |
21376/3588 |
https://github.com/99eren99/DIS25k Dataset: https://www.kaggle.com/datasets/erentahir/dis25k |
Deep Image Composition Meets Image Forgery |
2024 |
tampCOCO |
Yes |
|
|
|
|
|
https://www.kaggle.com/datasets/qsii24/tampcoco |
CAT-Net |
IJCV '22 |
compRAISE |
Yes |
|
|
|
|
|
https://www.kaggle.com/datasets/qsii24/compraise |
CAT-Net |
IJCV '22 |
CocoGlide |
|
|
|
|
|
|
https://www.grip.unina.it/download/prog/TruFor/CocoGlide.zip |
TruFor: Leveraging all-round clues for trustworthy image forgery detection and localization |
CVPR '23 |
|
|
|
|
|
|
|
|
|
2024 |
Details
CASIA v2
Fantasatic Reality
IMD2020
Paper: IMD2020: A Large-Scale Annotated Dataset Tailored for Detecting Manipulated Images
Project Page: http://staff.utia.cas.cz/novozada/db/
Download: http://staff.utia.cas.cz/novozada/db/
NC16 Splicing
Appear in: [1]P. Zhuang, H. Li, S. Tan, B. Li, and J. Huang, “Image Tampering Localization Using a Dense Fully Convolutional Network,” Ieee T Inf Foren Sec, vol. 16, pp. 2986–2999, 2021, doi: 10.1109/tifs.2021.3070444.
-
Artificial PS dataset with post-processing on boundary (PS-boundary)
-
Artificial PS dataset with arbitrary post-processing (PS-arbitrary)
-
NIST 2016 dataset (NIST-2016)
Image Splicing
Train
Dresden(16961 images)
Paper: The 'Dresden Image Database' for benchmarking digital image forensics SAC '10
Official: http://forensics.inf.tu-dresden.de/dresden_image_database/ (Cannot be ACCESSED)
DEFACTO database
Vision(34427 images)
Socrates(8742 images)
FODB (23106 images)
Kaggle (2750 images)
Test
Columbia (363 images, 180 spliced)
https://www.ee.columbia.edu/ln/dvmm/newDownloads.htm
Columbia Uncompressed Image Splicing DetectionColumbia Uncompressed Image Splicing Detection: https://www.dropbox.com/sh/786qv3yhvc7s9ki/AACbEEzGPrD3_y38bpWHzgdqa?dl=0
Columbia Image Splicing Detection Evaluation Dataset: https://www.dropbox.com/s/bo10et4p1zg08aj/ImSpliceDataset.rar?dl=0
Carvalho/DSO (200 images, 100 spliced)
Realistic Tampering (RT)/Korus (440 images, 220 spliced)
Description: This dataset contains 220 realistic forgeries created by hand in modern photo-editing software (GIMP and Affinity Photo) and covers various challenging tampering scenarios involving both object insertion and removal...
Citation: Pawel Korus and Jiwu Huang. Multi-scale analysis strategiesin prnu-based tampering localization. Trans. Info. For. Sec.,12(4):809–824, apr 2017.
Page: https://pkorus.pl/downloads/dataset-realistic-tampering
Download: realistic-tampering-dataset.zip via google drive (1.7 GB) you'll need request access
Copy Move
GRIP
CPH
GRIP
CMH
CoMoFoD
Small image category database: 512 x 512, 200 image sets, 40 images per transformation type, total number of images with post-processed images = 10400. Large image category database (please contact us for downloading): 3000 x 2000, 60 image sets, 10/20 images per transformation type, total number of images with postprocessed images = 3120.
Removal
Image Inpainting
DeepFake
Useful Link
- Information Forensics and Security
- Forensics Dataset
- Pattern Recognition Lab, FAU, https://lme.tf.fau.de/category/dataset/
Name |
Paper |
Project |
Download |
Type |
Remarks |
Columbia |
|
|
|
|
|
CASIA |
|
|
|
|
|
MICC |
|
|
|
|
|
IMD |
|
|
|
|
|
CoMoFoD |
|
|
|
|
|
COVERAGE |
|
|
|
|
|
GRIP |
|
|
|
|
|
FAU |
|
|
|
|
|
|
|
|
|
|
|