In the creative practice of text-to-image generation (TTI), images are generated from text prompts. However, TTI models are trained to always yield an output, even if the prompt contains unknown terms. In this case, the model may generate what we call "default images": images that closely resemble each other across many unrelated prompts. We argue studying default images is valuable for designing better solutions for TTI and prompt engineering. In this paper, we provide the first investigation into default images on Midjourney, a popular image generator. We describe our systematic approach to create input prompts triggering default images, and present the results of our initial experiments and several small-scale ablation studies. We also report on a survey study investigating how default images affect user satisfaction. Our work lays the foundation for understanding default images in TTI and highlights challenges and future research directions.
View on arXiv@article{simonen2025_2505.09166, title={ An Initial Exploration of Default Images in Text-to-Image Generation }, author={ Hannu Simonen and Atte Kiviniemi and Jonas Oppenlaender }, journal={arXiv preprint arXiv:2505.09166}, year={ 2025 } }