Datasets
Multimedia Forensics
GAN Inversion
This dataset contains face images from different datasets and their reconstruction through the StyleGAN2 inversion process.
The dataset can be downloaded here.
TrueFace
TrueFace is a dataset containing real and synthetic human faces generated by StyleGAN and StyleGAN2 generative models and shared on three popular social networks (Facebook, Telegram, and Twitter), for a total of 210k images.
The dataset can be downloaded here.
SHADE
SHADE (SHAring DEvice) is a collection of images shared on WhatsApp from different types of devices, operating systems and user interfaces (e.g., mobile, desktop, browser).
The dataset can be downloaded here.
Perception of Synthetic Faces
We release the dataset and collected data of the work:
F. Lago, C. Pasquini, R. Böhme, H. Dumont, V. Goffaux, G. Boato, "More Real than Real: A Study on Human Visual Perception of Synthetic Faces", accepted for publication in IEEE Signal Processing Magazine (2021).
The material can be downloaded here.
FF++ Social
FF++ Social contains videos from the widely known dataset FaceForensics++ (validation and testing splits) that have been shared through Facebook and YouTube. The dataset is intended to support researchers in multimedia forensics in evaluating the generalization ability of forensic detectors when dealing with data circulating on the web through popular sharing platforms.
The FF++ Social dataset has been used in the work:
F. Marcon, C. Pasquini, G. Boato, "Detection of manipulated face videos over social networks:
a large-scale study", submitted to Journal of Imaging (under minor revision).
RAISE
RAISE is a challenging real-world image dataset, primarily designed for the evaluation of digital forgery detection algorithms.
ISIMA
The ISIMA (Images Sharing via Instant Messaging App) dataset contains images that are shared between Android phone and iOS phone via three instant messaging applications Facebook Messenger, Whatsapp, and Telegram. All images are stored in JPEG format
RSMUD & VSMUD
To facilitate research on tracking social network origin of images, we collect two datasets: R-SMUD (RAISE Social Multiple Up-Download), and V-SMUD (VISION Social Multiple Up-Download).
All images are shared at maximum three times through three platforms: Facebook (FB), Flickr (FL), Twitter (TW).
DECEPTION
Computer Vision and Behaviour Analysis
Synthetic Crowds
All the videos in the dataset are retrieved from YouTube; 25 of them are recorded using car mounted Dash-Cams, the remaining ones have been taken by other devices such as mobile phones.
Re-DID
All the videos in the dataset are retrieved from YouTube; 25 of them are recorded using car mounted Dash-Cams, the remaining ones have been taken by other devices such as mobile phones.
UCD
The UNITN Crowd Dataset consists of two video sequences which have been segmented into two different sub-sequences each, and used for both crowd motion segmentation and anomaly detection.
USID
The UNITN Social Interaction(USI) Dataset consists of 4 types of two-person interactions: Talking, Shaking, Hugging and Fighting. Each type of two-person interaction has 16 samples, with the total number of 16x4 = 64 samples.
Semantic Retrieval
ACM EiMM DB
EventMask
The EventMask Dataset consists of two archives of images. Each archive contains three directories, one with the original images used for the game and another one with the event saliency maps already in binary form.