Article: “Midjourney Can’t Count”: Questions of Representation and Meaning for Text-to-Image Generators
dc.creator | Wasielewski, Amanda | |
dc.date.accessioned | 2024-06-14T11:09:37Z | |
dc.date.available | 2024-06-14T11:09:37Z | |
dc.date.issued | 2023 | |
dc.description.abstract | Text-to-image generation tools, such as DALL·E, Midjourney, and Stable Diffusion, were released to the public in 2022. In their wake, communities of artists and amateurs sprang up to share prompts and images created with the help of these tools. This essay investigates two of the common quirks or issues that arise for users of these image generation platforms: the problem of repre- senting human hands and the attendant issue of generating the desired number of any object or appendage. First, I address the issue that image generators have with generating normative human hands and how DALL·E has tried to correct this issue by only providing generations of normative human hands, even when a prompt asks for a different configuration. Secondly, I address how this hand problem is part of a larger issue in these systems where they are unable to count or reproduce the desired number of objects in a particular image, even when explicitly prompted to do so. This essay ultimately argues that these common issues indicate a deeper conundrum for large AI models: the problem of rep- resentation and the creation of meaning. | en |
dc.identifier.doi | http://dx.doi.org/10.25969/mediarep/22327 | |
dc.identifier.uri | https://mediarep.org/handle/doc/23757 | |
dc.language | eng | |
dc.publisher | Herbert von Halem | |
dc.publisher.place | Köln | |
dc.relation.isPartOf | issn:1614-0885 | |
dc.relation.ispartofseries | IMAGE. Zeitschrift für interdisziplinäre Bildwissenschaft | |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | |
dc.subject | text-to-image generation | en |
dc.subject | representation | en |
dc.subject | meaning | en |
dc.subject | human | en |
dc.subject.ddc | ddc:700 | |
dc.title | “Midjourney Can’t Count”: Questions of Representation and Meaning for Text-to-Image Generators | en |
dc.type | article | |
dc.type.status | publishedVersion | |
dspace.entity.type | Article | |
local.coverpage | 2024-06-16T02:31:09 | |
local.source.epage | 82 | |
local.source.issue | 1 | |
local.source.issueTitle | Generative Imagery: Towards a ‘New Paradigm’ of Machine Learning-Based Image Production | |
local.source.spage | 71 | |
local.source.volume | 19 |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- IMAGE_37_2023_71-82_Wasielewski_Midjourney_.pdf
- Size:
- 1.05 MB
- Format:
- Adobe Portable Document Format
- Description:
- Original PDF with additional cover page.