top of page
uintent company logo

AI & UXR, CHAT GPT, HUMAN VS AI

When AI Paints Pictures – And Suddenly Knows How to Spell

3

MIN

Sep 18, 2025

A new quality: AI images with meaning

Generating images has been possible for some time – but it was often a gamble. Especially when text was supposed to appear in the image: instead of ‘UX Matters’, it often said ‘UX Mertres’, ‘UsX MaRer’ or a fantasy combination of characters that somehow looked like letters. This was hardly usable for serious applications.

 

Recently, however, something fundamental has changed. Image generation in ChatGPT – specifically, with DALL·E 3 – has reached a level of maturity where text can be generated correctly in the image. And not only that: I can now specifically reference existing images, have parts of them changed or adjust certain details. Those who have only seen this in Midjourney before will be surprised at how much more precise and semantically stable it has become.


Why was it so difficult in the past?

The reason lies in the understanding of AI. Earlier image generators – including Midjourney, DALL·E 2 and Stable Diffusion – treat images purely visually. This means that they recognise that a road sign has a certain shape, but not what is written on it. Text was treated like a texture or a visual pattern – not as readable, meaningful content.

 

This only changed with multimodal models. ChatGPT with DALL·E 3 combines language comprehension with image composition. This enables AI to truly understand what terms such as ‘Title centred at the top: UX saves products’ mean – and then correctly implement this task in the image.


Context is everything – now I can reference images

One of the most important innovations is that I can specifically refer to images that have already been created. This means that I upload an image or use one that has been generated previously, and then formulate an instruction such as ‘like this image, but with a lighter background and without text’ or ‘replace the figure with a woman in business attire’.

 

The AI recognises the overall context – i.e. what is happening in the image, where certain elements are located and what needs to be changed – without reinterpreting the entire image. This is a real difference from Midjourney's classic ‘remix’ logic, where you often unintentionally change several aspects at once.


What you should keep in mind when prompting

For this to work, the prompt needs to be clear – and there are a few little tricks:


Firstly, English is often more stable. The models were predominantly trained with English image descriptions. German prompts usually work well, but sometimes errors or strange interpretations creep in. If it's important, try it in English as well.


Secondly: Spelling really matters. If a word is spelled incorrectly in the prompt, the AI will mercilessly copy this spelling into the image. So it's not a good idea to rely on autocorrect – better to check twice before ‘Ux Esprience’ ends up on the poster.

 

Thirdly: specific titles help. Instead of ‘A title that shows the importance of UX’, it's better to use ‘Title: UX saves products’ or ‘Text in the middle: Design with meaning’. The clearer the instruction, the more likely you are to get the right result.


Fourthly: keep the text short. Long sentences, paragraphs or convoluted phrasing often lead to errors. One to two lines are realistic, three are risky. If you want longer content, you should work with empty spaces and insert the text yourself later.


What can ChatGPT do better than Midjourney – and vice versa?

Midjourney has earned a reputation as the aesthetic queen among image AIs. The results are atmospheric, stylish and creative. But when it comes to controlling specific content, integrating text correctly into the image or editing small details, the system quickly reaches its limits.


ChatGPT with DALL·E 3 scores highly in this area: I can say, ‘Replace the red T-shirt with a blue one’ – and it happens. I can change the text, adjust the background, remove an object without the entire image being reinterpreted. This semantic controllability makes DALL·E more useful in many UX contexts – especially when I need illustrations with a clear message, explanatory graphics or recognisable image series for slides, articles or social media.


Why this is relevant for UX people

Visuals are not an accessory in UX communication. They convey attitude, focus, structure – and help make complex content accessible. When I use ChatGPT to generate an image with text that looks meaningful and is factually correct, I often save myself the detour via graphics tools, stock photos or image manipulation.

 

In addition, the combination of reference image and voice-controlled modification allows for a very iterative way of working. I can experiment, adapt, compare – and thus quickly develop or test visual concepts. This is particularly helpful in early project phases, for UX concepts or in internal communication.


Conclusion: Image AI is becoming more useful – not just more beautiful

ChatGPT is not a designer. And DALL·E won't build you a complete infographic with a grid system and clean type area. But: The new abilities to display text correctly, modify existing images in a targeted manner and implement visual ideas in an understandable way finally make the system practical – not just inspiring.

 

For UX people, this means that those who want to communicate ideas today no longer have to wait for perfectly rendered mock-ups. A good prompt, an image, a few targeted adjustments – and suddenly an idea becomes something visible. Something that is understood. Something that works.


And that's what UX is all about.

  

💌 Not enough? Then read on – in our newsletter. It comes four times a year. Sticks in your mind longer. To subscribe: https://www.uintent.com/newsletter



A referee holds up a scorecard labeled “Yupp.ai” between two stylized AI chatbots in a boxing ring – a symbolic image for fair user-based comparison of AI models.

How Yupp Uses Feedback to Fairly Evaluate AI Models – And What UX Professionals Can Learn From It

AI & UXR, CHAT GPT, HUMAN VS AI, LLM

3D illustration of a digital marketplace with colorful prompt stalls and a figure selecting a prompt card.

Buying, sharing, selling prompts – what prompt marketplaces offer today (and why this is relevant for UX)

AI & UXR, PROMPTS

Robot holds two signs: “ISO 9241 – 7 principles” and “ISO 9241 – 10 principles”

ChatGPT Hallucinates – Despite Anti-Hallucination Prompt

AI & UXR, HUMAN VS AI, CHAT GPT

Strawberry being sliced by a knife, stylized illustration.

Why AI Sometimes Can’t Count to 3 – And What That Has to Do With Tokens

AI & UXR, TOKEN, LLM

Square motif divided in the middle: on the left, a grey, stylised brain above a seated person working on a laptop in dark grey tones; on the right, a bright blue, networked brain above a standing person in front of a holographic interface on a dark background.

GPT-5 Is Here: Does This UX AI Really Change Everything for Researchers?

AI & UXR, CHAT GPT

Surreal AI image with data streams, crossed-out “User Expirince” and the text “ChatGPT kann jetzt Text in Bild”.

When AI Paints Pictures – And Suddenly Knows How to Spell

AI & UXR, CHAT GPT, HUMAN VS AI

Human and AI co-create a glowing tree on the screen, set against a dark, surreal background.

When the Text Is Too Smooth: How to Make AI Language More Human

AI & UXR, AI WRITING, CHAT GPT, HUMAN VS AI

Futuristic illustration: Human facing a glowing humanoid AI against a digital backdrop.

Not Science Fiction – AI Is Becoming Independent

AI & UXR, CHAT GPT

Illustration of an AI communicating with a human, symbolizing the persuasive power of artificial intelligence.

Between Argument and Influence – How Persuasive Can AI Be?

AI & UXR, CHAT GPT, LLM

A two-dimensional cartoon woman stands in front of a human-sized mobile phone displaying health apps. To her right is a box with a computer on it showing an ECG.

Digital Health Apps & Interfaces: Why Good UX Determines Whether Patients Really Benefit

HEALTHCARE, MHEALTH, TRENDS, UX METHODS

Illustration of a red hand symbolically prioritizing “Censorship” over “User Privacy” in the context of DeepSeek, with the Chinese flag in the background.

Censorship Meets AI: What Deepseek Is Hiding About Human Rights – And Why This Affects UX

AI & UXR, LLM, OPEN AI

Isometric flat-style illustration depicting global UX study logistics with parcels, checklist, video calls, and location markers over a world map.

What It Takes to Get It Right: Global Study Logistics in UX Research for Medical Devices

HEALTHCARE, UX METHODS, UX LOGISTICS

Surreal, glowing illustration of an AI language model as a brain, influenced by a hand – symbolizing manipulation by external forces.

Propaganda Chatbots - When AI Suddenly Speaks Russian

AI & UXR, LLM

Illustration of seven animals representing different thinking and prompting styles in UX work.

Welcome to the Prompt Zoo

AI & UXR, PROMPTS, UX

A two-dimensional image of a man sitting at a desk with an open laptop displaying a health symbol. In the background hangs a poster with a DNA strand.

UX Regulatory Compliance: Why Usability Drives Medtech Certification

HEALTHCARE, REGULATIONS

Illustration of a lightbulb surrounded by abstract symbols like a question mark, cloud, speech bubble, and cross – symbolizing creative ideas and critical thinking.

Why Prompts That Produce Bias and Hallucinations Can Sometimes Be Helpful

AI & UXR, CHAT GPT, HUMAN VS AI, OPEN AI

Illustration of a man at a laptop, surrounded by symbols of global medical research: world map with location markers, monitor with a medical cross, patient file, and stethoscope.

Global UX Research in Medical Technology: International User Research as a Factor for Success

HEALTHCARE, MHEALTH, REGULATIONS

Abstract pastel-colored illustration showing a stylized brain and geometric shapes – symbolizing AI and bias.

AI, Bias and the Power of Questions: How to Get Better Answers With Smart Prompts

AI & UXR, CHAT GPT

A woman inside a gear is surrounded by icons representing global connectivity, collaboration, innovation, and user focus – all linked by arrows. Uses soft, bright colors from a modern UI color palette.

Automate UX? Yes, Please! Why Zapier and n8n Are Real Super Tools for UX Teams

CHAT GPT, TOOLS, AUTOMATION, AI & UXR

A 2D Image of a man, pointing to a screen with a surgical robot on it.

Surgical Robotics and UX: Why Usability Is Key to or Success

HEALTHCARE, TRENDS, UX METHODS

 RELATED ARTICLES YOU MIGHT ENJOY 

AUTHOR

Tara Bosenick

Tara has been active as a UX specialist since 1999 and has helped to establish and shape the industry in Germany on the agency side. She specialises in the development of new UX methods, the quantification of UX and the introduction of UX in companies.


At the same time, she has always been interested in developing a corporate culture in her companies that is as ‘cool’ as possible, in which fun, performance, team spirit and customer success are interlinked. She has therefore been supporting managers and companies on the path to more New Work / agility and a better employee experience for several years.


She is one of the leading voices in the UX, CX and Employee Experience industry.

bottom of page