The ArtWhisperer Dataset captures human interactions with an image-generation AI model (variants of the Stable Diffusion model) as they attempt to generate a specified target image. This dataset includes the trajectory of prompts and images generated, the target image, and a score trajectory for each user-AI interaction.
More details on the dataset can be found in our paper: ArtWhisperer: A Dataset for Characterizing Human-AI Interactions in Artistic Creations
Our dataset is available on Hugging Face. Loading from Hugging Face requires installation of the datasets library (pip install datasets
). Example code is below:
from datasets import load_dataset
dataset = load_dataset("kailasv/ArtWhisperer")
The dataset contains two splits train
and validation
. Details on how these are defined are described in our paper. The validation
dataset contains additional information where the humans interacting with the model also gave their own ratings for how close their generated images are to the target image.
Each data instance contains several entries:
user_id
: string identifying the usertarget_id
: string identifying the target image
target_image
: a PIL Image of the target image the user was tasked with generatingtarget_positive_prompt
: Description of the target imagetarget_negative_prompt
: Negative description of the target image (for all target images we used, this is an empty string)target_image_embedding
: CLIP image embedding oftarget_image
target_positive_text_embedding
: CLIP text embedding oftarget_positive_prompt
target_negative_text_embedding
: CLIP text embedding oftarget_positive_prompt
generated_image
: a PIL Image of the user-generated imagegenerated_positive_prompt
: user-submitted prompt for generatinggenerated_image
generated_negative_prompt
: user-submitted negative prompt for generatinggenerated_image
generated_image_embedding
: CLIP image embedding ofgenerated_image
generated_positive_text_embedding
: CLIP text embedding ofgenerated_positive_prompt
generated_negative_text_embedding
: CLIP text embedding ofgenerated_negative_prompt
ai_model_name
: name of the AI model used for this interaction (either 'SDv2.1' or 'SDv1.5')trajectory_index
: ordering for the given interaction (indexing starts from 1 and restarts for eachuser_id
,target_id
pair)score
: automated scoring to assess how similartarget_image
andgenerated_image
are (bewteen 0 and 100)human_rating
: user's rating for similarity bewteentarget_image
andgenerated_image
(bewteen 0 and 100)time_taken
: duration in seconds the user took to write/update their promptsfiltered_image
: whether the user-generated image triggered an NSFW-filter (if it did,generated_image
will be a black image)
If you find this work useful or use this dataset in your research, please cite:
@article{vodrahalli2023artwhisperer,
title={ArtWhisperer: A Dataset for Characterizing Human-AI Interactions in Artistic Creations},
author={Vodrahalli, Kailas and Zou, James},
journal={arXiv preprint arXiv:2306.08141},
year={2023}
}
If you have any questions, please feel free to email the authors.