mediatechnologycenter / Aestheval

Code for the paper "Understanding Aesthetics with Language: A Photo Critique Dataset for Aesthetic Assessment"
Creative Commons Attribution 4.0 International
82 stars 5 forks source link
aesthetic-image-captioning image-aesthetic-assessment photo-critique-dataset

Image aesthetic assessment framework

This repository makes easy to access and process different datasets used for image aesthetic assessment methods, as well as the newly introduced Reddit Photo Critique Dataset (RPCD). It contains the framework for running the experiments described in the paper 'Understanding Aesthetics with Language: A Photo Critique Dataset for Aesthetic Assessment'.

RPCD: Reddit Photo Critique Dataset

The Reddit Photo Critique Dataset (RPCD) contains tuples of image and photo critiques. It consists of 74K images and 220K comments and is collected from a Reddit community used by hobbyists and professional photographers to improve their photography skills by leveraging constructive community feedback.

The proposed dataset differs from previous aesthetics datasets mainly in three aspects, namely:

A detailed description of the dataset can be found on the datasheet.

Models

ViT L/16 models trained on AVA, PCCD and RPCD datasets to predict the proposed sentiment score can be found here.

Steps

The files available in DOI correspond to the ids of the posts, divided in the splits used in the experiments of our work.

Steps for running the experiments

1. Create environment and install requirements

conda env create -n aestheval
pip install -e .

2. Download datasets

Follow the data README to download the RPCD dataset.

3. Process RPCD

python aestheval/data/reddit/prepare_dataset.py

Default --reddit_dir is data/.

4. Predict sentiment of comments and compute informativeness score

On RPCD dataset (takes a while)

python main.py --compute_sentiment_score --compute_informativeness_score --dataset_name Reddit

Already processed files can be found under data/. This directory can be changed using the --data_path argument:

Repository structure

The repository is structured as follows:

Citation

If you use this repository, please cite the following paper:

@inproceedings{nieto2022understanding,
    title={Understanding Aesthetics with Language: A Photo Critique Dataset for Aesthetic Assessment},
    author={Daniel Vera Nieto and Luigi Celona and Clara Fernandez Labrador},
    booktitle={Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track},
    year={2022},
    url={https://openreview.net/forum?id=-VyJim9UBxQ}
}