THE FINDINGS
BACKGROUND
Text-to-image AI generators have rapidly grown in popularity over recent years. Platforms such as Midjourney utilise powerful, deep-learning technology that is capable of generating original images from simple text descriptions known as prompts. This has sparked controversy over the current landscape of art, potentially threatening the livelihoods of artists due to its effortless usage.
Midjourney is an independent research lab that produces a proprietary artificial intelligence program that creates images from textual descriptions, similar to OpenAI's DALL-E and the open-source Stable Diffusion. The tool is currently in open beta, which it entered on July 12, 2022. It is currently only accessible through a Discord bot on their official Discord, by direct messaging the bot, or by inviting the bot to a third party server. Founder, David Holz shared with The Register that currently artists use Midjourney for rapid prototyping of artistic concepts to show to clients before starting work themselves.
FINDINGS AND INSIGHTS
The flag has a significant impact on how AI interprets a particular country
We all imagine Singapore to be portrayed as a green city, but majority of the images we generated had large portions of red. Singapore’s flag is 50% red and 50% white, making red the more prominent color out of the two.
FINDINGS AND INSIGHTS
The names of places also have a significant impact on how AI presents
a country
In our attempt to test AI’s ability to identity Singapore, we purely used descriptions. We realised that once the word ‘Singapore’ is taken away, the images tend to be generic or almost westernised.
FINDINGS AND INSIGHTS
Inherent stereotypes are evident in the generated images
The most obvious stereotype is gender. AI automatically registers text prompts ‘prime minister’ and ‘president’ as male figures. We first picked this up when we tried to generate an image of Halimah Yacob, current president of Singapore but was faced with an image of a man.
To test our hypothesis about the gender stereotype, we inputted text prompts like ‘New Zealand Prime Minister’ and ‘Bangladesh Prime Minister’. Both Jacinda Ardern and Sheikh Hasina are both women and prime ministers in their respective countries. Putting all three generated images together proves that indeed AI is gender biased.
DESIGN DECISIONS
There were several text-to-image generators for us to choose from - Dall E 2, MidJourney, Dream by WOMBO, Nightcafe AI (Stable Diffusion), Dream Studio AI etc. Looking at the different platforms, we wanted something that was not too realistic and yet not too far off from what Singapore is like. We tested three generators - MidJourney, Dall E 2 and Stable Diffusion by inputting the same prompt ‘Singapore Merlion’. While DALL-E 2 and Stable Diffusion generated a far more realistic image, Mid Journey generated a unique stylised image. After comparing the three softwares, we decided to go with MidJourney because of the stylisation it offers.