top of page
uintent company logo

AI & UXR, CHAT GPT, HUMAN VS AI

When AI Paints Pictures – And Suddenly Knows How to Spell

3

MIN

Sep 18, 2025

A new quality: AI images with meaning

Generating images has been possible for some time – but it was often a gamble. Especially when text was supposed to appear in the image: instead of ‘UX Matters’, it often said ‘UX Mertres’, ‘UsX MaRer’ or a fantasy combination of characters that somehow looked like letters. This was hardly usable for serious applications.

 

Recently, however, something fundamental has changed. Image generation in ChatGPT – specifically, with DALL·E 3 – has reached a level of maturity where text can be generated correctly in the image. And not only that: I can now specifically reference existing images, have parts of them changed or adjust certain details. Those who have only seen this in Midjourney before will be surprised at how much more precise and semantically stable it has become.


Why was it so difficult in the past?

The reason lies in the understanding of AI. Earlier image generators – including Midjourney, DALL·E 2 and Stable Diffusion – treat images purely visually. This means that they recognise that a road sign has a certain shape, but not what is written on it. Text was treated like a texture or a visual pattern – not as readable, meaningful content.

 

This only changed with multimodal models. ChatGPT with DALL·E 3 combines language comprehension with image composition. This enables AI to truly understand what terms such as ‘Title centred at the top: UX saves products’ mean – and then correctly implement this task in the image.


Context is everything – now I can reference images

One of the most important innovations is that I can specifically refer to images that have already been created. This means that I upload an image or use one that has been generated previously, and then formulate an instruction such as ‘like this image, but with a lighter background and without text’ or ‘replace the figure with a woman in business attire’.

 

The AI recognises the overall context – i.e. what is happening in the image, where certain elements are located and what needs to be changed – without reinterpreting the entire image. This is a real difference from Midjourney's classic ‘remix’ logic, where you often unintentionally change several aspects at once.


What you should keep in mind when prompting

For this to work, the prompt needs to be clear – and there are a few little tricks:


Firstly, English is often more stable. The models were predominantly trained with English image descriptions. German prompts usually work well, but sometimes errors or strange interpretations creep in. If it's important, try it in English as well.


Secondly: Spelling really matters. If a word is spelled incorrectly in the prompt, the AI will mercilessly copy this spelling into the image. So it's not a good idea to rely on autocorrect – better to check twice before ‘Ux Esprience’ ends up on the poster.

 

Thirdly: specific titles help. Instead of ‘A title that shows the importance of UX’, it's better to use ‘Title: UX saves products’ or ‘Text in the middle: Design with meaning’. The clearer the instruction, the more likely you are to get the right result.


Fourthly: keep the text short. Long sentences, paragraphs or convoluted phrasing often lead to errors. One to two lines are realistic, three are risky. If you want longer content, you should work with empty spaces and insert the text yourself later.


What can ChatGPT do better than Midjourney – and vice versa?

Midjourney has earned a reputation as the aesthetic queen among image AIs. The results are atmospheric, stylish and creative. But when it comes to controlling specific content, integrating text correctly into the image or editing small details, the system quickly reaches its limits.


ChatGPT with DALL·E 3 scores highly in this area: I can say, ‘Replace the red T-shirt with a blue one’ – and it happens. I can change the text, adjust the background, remove an object without the entire image being reinterpreted. This semantic controllability makes DALL·E more useful in many UX contexts – especially when I need illustrations with a clear message, explanatory graphics or recognisable image series for slides, articles or social media.


Why this is relevant for UX people

Visuals are not an accessory in UX communication. They convey attitude, focus, structure – and help make complex content accessible. When I use ChatGPT to generate an image with text that looks meaningful and is factually correct, I often save myself the detour via graphics tools, stock photos or image manipulation.

 

In addition, the combination of reference image and voice-controlled modification allows for a very iterative way of working. I can experiment, adapt, compare – and thus quickly develop or test visual concepts. This is particularly helpful in early project phases, for UX concepts or in internal communication.


Conclusion: Image AI is becoming more useful – not just more beautiful

ChatGPT is not a designer. And DALL·E won't build you a complete infographic with a grid system and clean type area. But: The new abilities to display text correctly, modify existing images in a targeted manner and implement visual ideas in an understandable way finally make the system practical – not just inspiring.

 

For UX people, this means that those who want to communicate ideas today no longer have to wait for perfectly rendered mock-ups. A good prompt, an image, a few targeted adjustments – and suddenly an idea becomes something visible. Something that is understood. Something that works.


And that's what UX is all about.

  

💌 Not enough? Then read on – in our newsletter. It comes four times a year. Sticks in your mind longer. To subscribe: https://www.uintent.com/newsletter



Illustration of Donald Trump with raised hand in front of an abstract digital background suggesting speech bubbles and data structures.

Donald Trump Prompt: How Provocative AI Prompts Affect UX Budgets

AI & UXR, PROMPTS, STAKEHOLDER MANAGEMENT

Driver's point of view looking at a winding country road surrounded by green vegetation. The steering wheel, dashboard and rear-view mirror are visible in the foreground.

The Final Hurdle: How Unsafe Automation Undermines Trust in Adas

AUTOMATION, AUTOMOTIVE UX, AUTONOMOUS DRIVING, GAMIFICATION, TRENDS

Illustration of a person standing at a fork in the road with two equal paths.

Will AI Replace UX Jobs? What a Study of 200,000 AI Conversations Really Shows

HUMAN VS AI, RESEARCH, AI & UXR

Close-up of a premium tweeter speaker in a car dashboard with perforated metal surface.

The Passenger Who Always Listens: Why We Are Reluctant to Trust Our Cars When They Talk

AUTOMOTIVE UX, VOICE ASSISTANTS

Keyhole in a dark surface revealing an abstract, colorful UX research interface.

Evaluating AI Results in UX Research: How to Navigate the Black Box

AI & UXR, HOW-TO, HUMAN VS AI

A car cockpit manufactured by Audi. It features a digital display and numerous buttons on the steering wheel.

Haptic Certainty vs. Digital Temptation: The Battle for the Best Controls in Cars

AUTOMOTIVE UX, AUTONOMOUS DRIVING, CONNECTIVITY, GAMIFICATION

Digital illustration of a classical building facade with columns, supported by visible scaffolding, symbolising a fragile, purely superficial front.

UX & AI: How "UX Potemkin" Undermines Your Research and Design Decisions

AI & UXR, HUMAN VS AI, LLM, UX

Silhouette of a diver descending into deep blue water – a metaphor for in-depth research.

Deep Research AI | How to use ChatGPT effectively for UX work

CHAT GPT, HOW-TO, RESEARCH, AI & UXR

A referee holds up a scorecard labeled “Yupp.ai” between two stylized AI chatbots in a boxing ring – a symbolic image for fair user-based comparison of AI models.

How Yupp Uses Feedback to Fairly Evaluate AI Models – And What UX Professionals Can Learn From It

AI & UXR, CHAT GPT, HUMAN VS AI, LLM

A brown book entitled ‘Don't Make Me Think’ by Steve Krug lies on a small table. Light shines through the window.

Why UX Research Is Losing Credibility - And How We Can Regain It

UX, UX QUALITY, UX METHODS

3D illustration of a digital marketplace with colorful prompt stalls and a figure selecting a prompt card.

Buying, sharing, selling prompts – what prompt marketplaces offer today (and why this is relevant for UX)

AI & UXR, PROMPTS

Robot holds two signs: “ISO 9241 – 7 principles” and “ISO 9241 – 10 principles”

ChatGPT Hallucinates – Despite Anti-Hallucination Prompt

AI & UXR, HUMAN VS AI, CHAT GPT

Strawberry being sliced by a knife, stylized illustration.

Why AI Sometimes Can’t Count to 3 – And What That Has to Do With Tokens

AI & UXR, TOKEN, LLM

Square motif divided in the middle: on the left, a grey, stylised brain above a seated person working on a laptop in dark grey tones; on the right, a bright blue, networked brain above a standing person in front of a holographic interface on a dark background.

GPT-5 Is Here: Does This UX AI Really Change Everything for Researchers?

AI & UXR, CHAT GPT

Surreal AI image with data streams, crossed-out “User Expirince” and the text “ChatGPT kann jetzt Text in Bild”.

When AI Paints Pictures – And Suddenly Knows How to Spell

AI & UXR, CHAT GPT, HUMAN VS AI

Human and AI co-create a glowing tree on the screen, set against a dark, surreal background.

When the Text Is Too Smooth: How to Make AI Language More Human

AI & UXR, AI WRITING, CHAT GPT, HUMAN VS AI

Futuristic illustration: Human facing a glowing humanoid AI against a digital backdrop.

Not Science Fiction – AI Is Becoming Independent

AI & UXR, CHAT GPT

Illustration of an AI communicating with a human, symbolizing the persuasive power of artificial intelligence.

Between Argument and Influence – How Persuasive Can AI Be?

AI & UXR, CHAT GPT, LLM

A two-dimensional cartoon woman stands in front of a human-sized mobile phone displaying health apps. To her right is a box with a computer on it showing an ECG.

Digital Health Apps & Interfaces: Why Good UX Determines Whether Patients Really Benefit

HEALTHCARE, MHEALTH, TRENDS, UX METHODS

Illustration of a red hand symbolically prioritizing “Censorship” over “User Privacy” in the context of DeepSeek, with the Chinese flag in the background.

Censorship Meets AI: What Deepseek Is Hiding About Human Rights – And Why This Affects UX

AI & UXR, LLM, OPEN AI

 RELATED ARTICLES YOU MIGHT ENJOY 

AUTHOR

Tara Bosenick

Tara has been active as a UX specialist since 1999 and has helped to establish and shape the industry in Germany on the agency side. She specialises in the development of new UX methods, the quantification of UX and the introduction of UX in companies.


At the same time, she has always been interested in developing a corporate culture in her companies that is as ‘cool’ as possible, in which fun, performance, team spirit and customer success are interlinked. She has therefore been supporting managers and companies on the path to more New Work / agility and a better employee experience for several years.


She is one of the leading voices in the UX, CX and Employee Experience industry.

bottom of page