greatzh / Image-Forgery-Datasets-List

Available datasets for training and testing the method about Image Forgery Detection and Localization
43 stars 2 forks source link

Image Forgery Datasets

Image Tampering Datasets, Image Manipulations Datasets... A repo intend to collect most of datasets (training and evaluation) for the image forgery detection and localization.

图像篡改检测定位数据集收集

Summary

Dataset name mask Image Format Post-processing Forgery types Real/Forged Images Train/Test Images Download Paper Year
CASIA v1.0 No JPEG, TIFF Splicing, copy move, removal 800/921
CASIA v2.0 No TIFF, JPEG, BMP Yes Splicing, copy move, removal 7491/5123 715/100 http://forensics.idealtest.org/#/ , https://github.com/namtpham/casia1groundtruth , https://github.com/namtpham/casia2groundtruth CASIA Image Tampering Detection Evaluation Database 2013
Columbia Gray No BMP No Splicing 933/912 1845, 933 Authentic, 912 Spliced, 128 x 128 https://www.ee.columbia.edu/ln/dvmm/downloads/AuthSplicedDataSet/AuthSplicedDataSet.htm A data set of authentic and spliced image blocks 2004
Columbia Color Yes TIFF No Splicing 183/180 125/45 Columbia Uncompressed Image Splicing Detection Evaluation Dataset: https://www.ee.columbia.edu/ln/dvmm/downloads/authsplcuncmp/ Detecting Image Splicing using Geometry Invariants and Camera Characteristics Consistency 2006
NIST 2016(Nimble Challenge 2016 (NC16)), NC17, MFC18 JPEG Yes Splicing, copy move, removal 560/564 184/50 NC17: https://www.nist.gov/itl/iad/mig/nimble-challenge-2017-evaluation; MFC18: https://www.nist.gov/itl/iad/mig/media-forensics-challenge-2018; REF: https://mfc.nist.gov/#pills-resources MFC Datasets: Large-Scale Benchmark Datasets for Media Forensic Challenge Evaluation 2019
Fantastic Reality JPEG No Splicing 16592/19423 12000/1000 http://zefirus.org/MAG The Point Where Reality Meets Fantasy: Mixed Adversarial Generators for Image Splice Detection NeurIPS '19
Carvalho yes PNG yes splicing 100/100 94, http://www.ic.unicamp.br/tjose/files/database-tifs-small-resolution.zip (BROKEN) http://ic.unicamp.br/~rocha/pub/downloads/2014-tiago-carvalho-thesis/tifs-database.zip Exposing digital image forgeries by illumination color classification TIFS '13
Realistic Tampering TIFF object insertion , removal. 220 https://pkorus.pl/downloads/dataset-realistic-tampering Multi-scale analysis strategies in prnu-based tampering localization TIFS '16
COVERAGE TIFF No Copy move 100 pairs, 400 x486 https://github.com/wenbihan/coverage COVERAGE — A novel database for copy-move forgery detection ICIP '16
CoMoFoD Yes Copy move 260/260 260 sets https://www.vcl.fer.hr/comofod/ CoMoFoD — New database for copy-move forgery detection 2013
MICC F220/F2000 copy move 110/110, 440/160, 1300/700 2200 A SIFT-Based Forensic Method for Copy–Move Attack Detection and Transformation Recovery TIFS '11
FAU/Manip Yes Copy move 48 http://www5.cs.fau.de/research/data/image-manipulation/index.html An Evaluation of Popular Copy-Move Forgery Detection Approaches TIFS '12
Dresden http://forensics.inf.tu-dresden.de/ddimgdb/ The Dresden Image Database for Benchmarking Digital Image Forensics SAC '10
GRIP Copy move http://www.grip.unina.it/download/prog/CMFD/ Efficient dense-field copy-move forgery detection TIFS '15
IMD2020 Yes Splicing, copy move, removal 35000/35000 2010 http://staff.utia.cas.cz/novozada/db/ An evaluation of popular copy-move forgery detection approaches WACV '20
In the Wild Yes Splicing -/201 https://minyoungg.github.io/selfconsistency/ Fighting Fake News: Image Splice Detection via Learned Self-Consistency ECCV '18
DEFACTO Splicing, copy move, removal -/229000 https://defactodataset.github.io/ http://www.eurecom.fr/en/publication/5973/download/sec-publi-5973.pdf
PS-battles Splicing, copy move, removal 11142/102028 https://github.com/tophatraptor/psdetector.git . Link: https://github.com/dbisUnibas/PS-Battles The PS-battles dataset—an image collection for image manipulation detection CoRR '18
Wild Web Yes PNG splicing 90/9657 Detecting image splicing in the wild (WEB) ICMEW '15
VIPP Synth yes JPEG splicing 4800/4800 Image Forgery Localization via Block-Grained Analysis of JPEG Artifacts TIFS '12
VIPP Real manual JPEG splicing 69/69 Image Forgery Localization via Block-Grained Analysis of JPEG Artifacts TIFS '12
Reddit Image Provenance Analysis at Scale TIP '18
AbhAS JPEG splicing 45/48 AbhAS: A Novel Realistic Image Splicing Forensics Dataset Journal of Applied Security Research '22
MISD Yes JPEG Yes splicing 618/300 https://zenodo.org/records/5525829 Multiple Image Splicing Dataset (MISD): A Dataset for Multiple Splicing Data '21
DSO-1 Yes JPEG splicing 100/100 https://recodbr.wordpress.com/code-n-data/#dso1_dsi1 Exposing digital image forgeries by illumination color classification TIFS '13
DIS25k Yes JPG deep image harmonization splicing 0/24964 21376/3588 https://github.com/99eren99/DIS25k Dataset: https://www.kaggle.com/datasets/erentahir/dis25k Deep Image Composition Meets Image Forgery 2024
tampCOCO Yes https://www.kaggle.com/datasets/qsii24/tampcoco CAT-Net IJCV '22
compRAISE Yes https://www.kaggle.com/datasets/qsii24/compraise CAT-Net IJCV '22
CocoGlide https://www.grip.unina.it/download/prog/TruFor/CocoGlide.zip TruFor: Leveraging all-round clues for trustworthy image forgery detection and localization CVPR '23
2024

Details

CASIA v2

Fantasatic Reality

IMD2020

Paper: IMD2020: A Large-Scale Annotated Dataset Tailored for Detecting Manipulated Images

Project Page: http://staff.utia.cas.cz/novozada/db/

Download: http://staff.utia.cas.cz/novozada/db/

NC16 Splicing

Appear in: [1]P. Zhuang, H. Li, S. Tan, B. Li, and J. Huang, “Image Tampering Localization Using a Dense Fully Convolutional Network,” Ieee T Inf Foren Sec, vol. 16, pp. 2986–2999, 2021, doi: 10.1109/tifs.2021.3070444.

  1. Artificial PS dataset with post-processing on boundary (PS-boundary)

  2. Artificial PS dataset with arbitrary post-processing (PS-arbitrary)

  3. NIST 2016 dataset (NIST-2016)

Image Splicing

Train

Dresden(16961 images)

Paper: The 'Dresden Image Database' for benchmarking digital image forensics SAC '10

Official: http://forensics.inf.tu-dresden.de/dresden_image_database/ (Cannot be ACCESSED)

DEFACTO database

Vision(34427 images)

Socrates(8742 images)

FODB (23106 images)

Kaggle (2750 images)

Test

Columbia (363 images, 180 spliced)

https://www.ee.columbia.edu/ln/dvmm/newDownloads.htm

Columbia Uncompressed Image Splicing DetectionColumbia Uncompressed Image Splicing Detection: https://www.dropbox.com/sh/786qv3yhvc7s9ki/AACbEEzGPrD3_y38bpWHzgdqa?dl=0

Columbia Image Splicing Detection Evaluation Dataset: https://www.dropbox.com/s/bo10et4p1zg08aj/ImSpliceDataset.rar?dl=0

Carvalho/DSO (200 images, 100 spliced)

Realistic Tampering (RT)/Korus (440 images, 220 spliced)

Description: This dataset contains 220 realistic forgeries created by hand in modern photo-editing software (GIMP and Affinity Photo) and covers various challenging tampering scenarios involving both object insertion and removal...

Citation: Pawel Korus and Jiwu Huang. Multi-scale analysis strategiesin prnu-based tampering localization. Trans. Info. For. Sec.,12(4):809–824, apr 2017.

Page: https://pkorus.pl/downloads/dataset-realistic-tampering

Download: realistic-tampering-dataset.zip via google drive (1.7 GB) you'll need request access

Copy Move

GRIP

CPH

GRIP

CMH

CoMoFoD

Small image category database: 512 x 512, 200 image sets, 40 images per transformation type, total number of images with post-processed images = 10400. Large image category database (please contact us for downloading): 3000 x 2000, 60 image sets, 10/20 images per transformation type, total number of images with postprocessed images = 3120.

Removal

Image Inpainting

DeepFake

Useful Link

  1. Information Forensics and Security
  2. Forensics Dataset
  3. Pattern Recognition Lab, FAU, https://lme.tf.fau.de/category/dataset/
Name Paper Project Download Type Remarks
Columbia
CASIA
MICC
IMD
CoMoFoD
COVERAGE
GRIP
FAU