Prompting madness: exploring the representation of mental illness and experience in AI art

[AI] is not a mirror, and it is not a parrot — Joanna J. Bryson

Text-to-image generators are deep learning models that create images based on natural language prompts. These generators (such as DALL·E 2, Midjourney, and IMAGEN) are now able to construct novel, credible photograph- and painting-like images. As computer vision algorithms, these tools are built by training them on millions or billions of pairs of images and text descriptions.

An ethical concern with text-to-image generators is that they “reflect social stereotypes, oppressive viewpoints, and derogatory, or otherwise harmful, associations to marginalized identity groups”. This project explores similar concerns for the representation of mental health, mental disorders, and neurodivergence in AI-generated imagery. At the same time, it looks into how variations, fluctuations, and disruptions of the human psyche have been depicted through art as captured by the latent space of deep learning embeddings.

Prior art

Researcher @eolasinntinn used craiyon to generate terms from the “D(ALL-E)SM-5”.