top of page
uintent company logo

AI & UXR, LLM, HUMAN VS AI, OPEN AI

How a Transformer Thinks – And Why It Hallucinates

5

MIN

Jul 3, 2025

Why UX people should understand how large language models really work


Large language models such as ChatGPT, Claude and Gemini have long been more than just toys for tech enthusiasts. They write texts, answer questions, analyse feedback, generate ideas – and often do so with remarkable confidence. Many of us already use them every day in our UX practice. And rightly so.


But as exciting as these tools are, it's important to realise that what we're experiencing is not a form of genuine understanding. And the mistakes these systems make are not operational accidents. They are a structural component of the technology.

 

If you know how a transformer works, you understand why large language models hallucinate.

And if you understand that, you can make better UX decisions – for tools, for processes and for users.


What a transformer actually is (and what it isn't) 

The ‘transformer’ is the basic model on which almost all of today's LLMs are based – from GPT-4 to Gemini, from Mistral to LLaMA. It was introduced in 2017 by a Google team (‘Attention is all you need’) and largely replaced older language models such as LSTMs and RNNs.

 

The basic idea is easy to explain:

A transformer is a model that predicts language. Token by token. It calculates which token (this can be a word, part of a word or a punctuation mark) is most likely to follow next, based on the context so far.


An example:

You enter: ‘UX researchers observe how users...’

The model then calculates: What is likely to follow? ‘...interact’, ‘...encounter difficulties’, ‘...use the app’?


The Transformer selects the most likely continuation. And then the next one. And the next one. Until an entire paragraph has been created.


The crucial point here is:

that the Transformer does not ‘know’ what is true. It knows no facts, no meaning, no world. It only ‘knows’ what sounds probable from a linguistic point of view – because it often occurred in its training data. And this is precisely where all the problems originate.


A look inside: How a transformer works

To understand where hallucinations come from, it's worth having a quick look at the technical side of things (we promise: no maths, no equations – just a basic understanding of the structure):


1. Tokenisation

The text entered is broken down into small building blocks called tokens. These can be words, syllables or parts of words (‘UX’ + ‘-researcher’ + ‘:innen’).


2. Embedding

Each of these tokens is translated into a numerical vector – a kind of meaning code. This makes language mathematically calculable.


3. Position coding

Because a transformer processes everything simultaneously (‘parallelises’), it needs additional information: Where in the sentence is each token located? This position data is incorporated so that the model understands sequences.


4. Self-attention

The heart of the transformer: Each token ‘pays attention’ to all other tokens in the context and weighs up which of them are important for its meaning. This creates semantic relationships – e.g. who ‘she’ is in the sentence or what an adjective refers to.


5. Multilayer processing

This does not happen just once, but many times over – with dozens to hundreds of layers. This creates complex patterns of probabilities that map linguistic structures with astonishing precision.


6. Autoregressive generation

The model outputs a token, attaches it to the context, calculates the next token based on the new context, and so on. Step by step, the text grows.


The result often looks like a train of thought – but in reality it is a recursive probability forecast. Linguistically brilliant. Not necessarily correct in terms of content.


Why transformers hallucinate – structurally, not randomly

Now it gets exciting: the way transformers work inevitably introduces certain sources of error. And if you ignore this, you run the risk of using LLMs incorrectly – in interfaces, in research, in product development.


1. Lack of world knowledge (no grounding)

Transformer models have no connection to the real world. They don't know that Paris is in France or that UX doesn't mean ‘user xylophone.’ They only know the probability with which certain word sequences occur.

This means that they can generate completely fabricated statements – as long as they sound linguistically plausible.

 

2. Lack of fact checking

A transformer does not check content. It has no internal control system, no logical validation, no semantic redundancy check.

If the training material often stated, ‘AI was developed by Alan Turing in 1983,’ the model might consider this a valid statement – even if it is historically incorrect.

 

3. Chain reaction through autoregressive writing

 A small error at the beginning (e.g., a made-up name or a false fact) automatically leads to further errors. This is because each new token is based on the previous context.

This is called compounding errors – a slip-up turns into a whole hallucination.


4. Same linguistic feel – regardless of truthfulness

A transformer speaks in the same style – regardless of whether it is guessing, hallucinating or stating a fact.

The model has been trained to sound fluent and coherent – not to make the truth transparent.

This is dangerous: the linguistic quality suggests certainty – where there is none.

  

5. Quality and bias of training data

LLMs learn from huge text corpora – internet forums, websites, books, PDFs. These contain a wealth of knowledge – but also many errors, fiction, opinions, satire and polemics.

The model cannot distinguish whether a sentence comes from Wikipedia or Reddit.

 

What occurs often enough shapes the model, regardless of whether it is true or false.


Why this should matter to UX professionals

As UX professionals, we often build systems in which AI plays a role – whether for content creation, feedback analysis or user support. And the following applies to LLMs in particular:

 

Just because a system sounds good doesn't mean it's good.

 

When integrating LLMs into UX work, we must:

  • Understand where the strengths come from (e.g. language flow, pattern recognition, context sensitivity)

  • Recognise where the weaknesses come from (e.g. hallucinations, fact blindness, bias)

  • Take targeted measures to catch errors (e.g. through RAG, feedback loops, fact checking)

  • Build UX systems that make uncertainty visible (e.g. confidence scores, source references, indications of AI origin)

  • Design prompts and interfaces to guide LLMs in a meaningful way – and not overwhelm them


Conclusion: Understanding is better than amazement

Transformer models are a technological masterpiece. But they are not truth tellers. They generate language spaces, not world models. They make predictions, not statements.


Anyone working with LLMs as a UX professional – whether for tools, processes or content – should understand the inner workings of these models. Not in detail, but in terms of their overall structure.


Because only those who understand how a Transformer thinks can prevent it from hallucinating – especially when it matters most.



Illustration of Donald Trump with raised hand in front of an abstract digital background suggesting speech bubbles and data structures.

Donald Trump Prompt: How Provocative AI Prompts Affect UX Budgets

AI & UXR, PROMPTS, STAKEHOLDER MANAGEMENT

Driver's point of view looking at a winding country road surrounded by green vegetation. The steering wheel, dashboard and rear-view mirror are visible in the foreground.

The Final Hurdle: How Unsafe Automation Undermines Trust in Adas

AUTOMATION, AUTOMOTIVE UX, AUTONOMOUS DRIVING, GAMIFICATION, TRENDS

Illustration of a person standing at a fork in the road with two equal paths.

Will AI Replace UX Jobs? What a Study of 200,000 AI Conversations Really Shows

HUMAN VS AI, RESEARCH, AI & UXR

Close-up of a premium tweeter speaker in a car dashboard with perforated metal surface.

The Passenger Who Always Listens: Why We Are Reluctant to Trust Our Cars When They Talk

AUTOMOTIVE UX, VOICE ASSISTANTS

Keyhole in a dark surface revealing an abstract, colorful UX research interface.

Evaluating AI Results in UX Research: How to Navigate the Black Box

AI & UXR, HOW-TO, HUMAN VS AI

A car cockpit manufactured by Audi. It features a digital display and numerous buttons on the steering wheel.

Haptic Certainty vs. Digital Temptation: The Battle for the Best Controls in Cars

AUTOMOTIVE UX, AUTONOMOUS DRIVING, CONNECTIVITY, GAMIFICATION

Digital illustration of a classical building facade with columns, supported by visible scaffolding, symbolising a fragile, purely superficial front.

UX & AI: How "UX Potemkin" Undermines Your Research and Design Decisions

AI & UXR, HUMAN VS AI, LLM, UX

Silhouette of a diver descending into deep blue water – a metaphor for in-depth research.

Deep Research AI | How to use ChatGPT effectively for UX work

CHAT GPT, HOW-TO, RESEARCH, AI & UXR

A referee holds up a scorecard labeled “Yupp.ai” between two stylized AI chatbots in a boxing ring – a symbolic image for fair user-based comparison of AI models.

How Yupp Uses Feedback to Fairly Evaluate AI Models – And What UX Professionals Can Learn From It

AI & UXR, CHAT GPT, HUMAN VS AI, LLM

A brown book entitled ‘Don't Make Me Think’ by Steve Krug lies on a small table. Light shines through the window.

Why UX Research Is Losing Credibility - And How We Can Regain It

UX, UX QUALITY, UX METHODS

3D illustration of a digital marketplace with colorful prompt stalls and a figure selecting a prompt card.

Buying, sharing, selling prompts – what prompt marketplaces offer today (and why this is relevant for UX)

AI & UXR, PROMPTS

Robot holds two signs: “ISO 9241 – 7 principles” and “ISO 9241 – 10 principles”

ChatGPT Hallucinates – Despite Anti-Hallucination Prompt

AI & UXR, HUMAN VS AI, CHAT GPT

Strawberry being sliced by a knife, stylized illustration.

Why AI Sometimes Can’t Count to 3 – And What That Has to Do With Tokens

AI & UXR, TOKEN, LLM

Square motif divided in the middle: on the left, a grey, stylised brain above a seated person working on a laptop in dark grey tones; on the right, a bright blue, networked brain above a standing person in front of a holographic interface on a dark background.

GPT-5 Is Here: Does This UX AI Really Change Everything for Researchers?

AI & UXR, CHAT GPT

Surreal AI image with data streams, crossed-out “User Expirince” and the text “ChatGPT kann jetzt Text in Bild”.

When AI Paints Pictures – And Suddenly Knows How to Spell

AI & UXR, CHAT GPT, HUMAN VS AI

Human and AI co-create a glowing tree on the screen, set against a dark, surreal background.

When the Text Is Too Smooth: How to Make AI Language More Human

AI & UXR, AI WRITING, CHAT GPT, HUMAN VS AI

Futuristic illustration: Human facing a glowing humanoid AI against a digital backdrop.

Not Science Fiction – AI Is Becoming Independent

AI & UXR, CHAT GPT

Illustration of an AI communicating with a human, symbolizing the persuasive power of artificial intelligence.

Between Argument and Influence – How Persuasive Can AI Be?

AI & UXR, CHAT GPT, LLM

A two-dimensional cartoon woman stands in front of a human-sized mobile phone displaying health apps. To her right is a box with a computer on it showing an ECG.

Digital Health Apps & Interfaces: Why Good UX Determines Whether Patients Really Benefit

HEALTHCARE, MHEALTH, TRENDS, UX METHODS

Illustration of a red hand symbolically prioritizing “Censorship” over “User Privacy” in the context of DeepSeek, with the Chinese flag in the background.

Censorship Meets AI: What Deepseek Is Hiding About Human Rights – And Why This Affects UX

AI & UXR, LLM, OPEN AI

 RELATED ARTICLES YOU MIGHT ENJOY 

AUTHOR

Tara Bosenick

Tara has been active as a UX specialist since 1999 and has helped to establish and shape the industry in Germany on the agency side. She specialises in the development of new UX methods, the quantification of UX and the introduction of UX in companies.


At the same time, she has always been interested in developing a corporate culture in her companies that is as ‘cool’ as possible, in which fun, performance, team spirit and customer success are interlinked. She has therefore been supporting managers and companies on the path to more New Work / agility and a better employee experience for several years.


She is one of the leading voices in the UX, CX and Employee Experience industry.

bottom of page