
AI & UXR, LLM, HUMAN VS AI, OPEN AI
How a Transformer Thinks – And Why It Hallucinates
5
MIN
Jul 3, 2025
Why UX people should understand how large language models really work
Large language models such as ChatGPT, Claude and Gemini have long been more than just toys for tech enthusiasts. They write texts, answer questions, analyse feedback, generate ideas – and often do so with remarkable confidence. Many of us already use them every day in our UX practice. And rightly so.
But as exciting as these tools are, it's important to realise that what we're experiencing is not a form of genuine understanding. And the mistakes these systems make are not operational accidents. They are a structural component of the technology.
If you know how a transformer works, you understand why large language models hallucinate.
And if you understand that, you can make better UX decisions – for tools, for processes and for users.
What a transformer actually is (and what it isn't)
The ‘transformer’ is the basic model on which almost all of today's LLMs are based – from GPT-4 to Gemini, from Mistral to LLaMA. It was introduced in 2017 by a Google team (‘Attention is all you need’) and largely replaced older language models such as LSTMs and RNNs.
The basic idea is easy to explain:
A transformer is a model that predicts language. Token by token. It calculates which token (this can be a word, part of a word or a punctuation mark) is most likely to follow next, based on the context so far.
An example:
You enter: ‘UX researchers observe how users...’
The model then calculates: What is likely to follow? ‘...interact’, ‘...encounter difficulties’, ‘...use the app’?
The Transformer selects the most likely continuation. And then the next one. And the next one. Until an entire paragraph has been created.
The crucial point here is:
that the Transformer does not ‘know’ what is true. It knows no facts, no meaning, no world. It only ‘knows’ what sounds probable from a linguistic point of view – because it often occurred in its training data. And this is precisely where all the problems originate.
A look inside: How a transformer works
To understand where hallucinations come from, it's worth having a quick look at the technical side of things (we promise: no maths, no equations – just a basic understanding of the structure):
1. Tokenisation
The text entered is broken down into small building blocks called tokens. These can be words, syllables or parts of words (‘UX’ + ‘-researcher’ + ‘:innen’).
2. Embedding
Each of these tokens is translated into a numerical vector – a kind of meaning code. This makes language mathematically calculable.
3. Position coding
Because a transformer processes everything simultaneously (‘parallelises’), it needs additional information: Where in the sentence is each token located? This position data is incorporated so that the model understands sequences.
4. Self-attention
The heart of the transformer: Each token ‘pays attention’ to all other tokens in the context and weighs up which of them are important for its meaning. This creates semantic relationships – e.g. who ‘she’ is in the sentence or what an adjective refers to.
5. Multilayer processing
This does not happen just once, but many times over – with dozens to hundreds of layers. This creates complex patterns of probabilities that map linguistic structures with astonishing precision.
6. Autoregressive generation
The model outputs a token, attaches it to the context, calculates the next token based on the new context, and so on. Step by step, the text grows.
The result often looks like a train of thought – but in reality it is a recursive probability forecast. Linguistically brilliant. Not necessarily correct in terms of content.
Why transformers hallucinate – structurally, not randomly
Now it gets exciting: the way transformers work inevitably introduces certain sources of error. And if you ignore this, you run the risk of using LLMs incorrectly – in interfaces, in research, in product development.
1. Lack of world knowledge (no grounding)
Transformer models have no connection to the real world. They don't know that Paris is in France or that UX doesn't mean ‘user xylophone.’ They only know the probability with which certain word sequences occur.
This means that they can generate completely fabricated statements – as long as they sound linguistically plausible.
2. Lack of fact checking
A transformer does not check content. It has no internal control system, no logical validation, no semantic redundancy check.
If the training material often stated, ‘AI was developed by Alan Turing in 1983,’ the model might consider this a valid statement – even if it is historically incorrect.
3. Chain reaction through autoregressive writing
A small error at the beginning (e.g., a made-up name or a false fact) automatically leads to further errors. This is because each new token is based on the previous context.
This is called compounding errors – a slip-up turns into a whole hallucination.
4. Same linguistic feel – regardless of truthfulness
A transformer speaks in the same style – regardless of whether it is guessing, hallucinating or stating a fact.
The model has been trained to sound fluent and coherent – not to make the truth transparent.
This is dangerous: the linguistic quality suggests certainty – where there is none.
5. Quality and bias of training data
LLMs learn from huge text corpora – internet forums, websites, books, PDFs. These contain a wealth of knowledge – but also many errors, fiction, opinions, satire and polemics.
The model cannot distinguish whether a sentence comes from Wikipedia or Reddit.
What occurs often enough shapes the model, regardless of whether it is true or false.
Why this should matter to UX professionals
As UX professionals, we often build systems in which AI plays a role – whether for content creation, feedback analysis or user support. And the following applies to LLMs in particular:
Just because a system sounds good doesn't mean it's good.
When integrating LLMs into UX work, we must:
Understand where the strengths come from (e.g. language flow, pattern recognition, context sensitivity)
Recognise where the weaknesses come from (e.g. hallucinations, fact blindness, bias)
Take targeted measures to catch errors (e.g. through RAG, feedback loops, fact checking)
Build UX systems that make uncertainty visible (e.g. confidence scores, source references, indications of AI origin)
Design prompts and interfaces to guide LLMs in a meaningful way – and not overwhelm them
Conclusion: Understanding is better than amazement
Transformer models are a technological masterpiece. But they are not truth tellers. They generate language spaces, not world models. They make predictions, not statements.
Anyone working with LLMs as a UX professional – whether for tools, processes or content – should understand the inner workings of these models. Not in detail, but in terms of their overall structure.
Because only those who understand how a Transformer thinks can prevent it from hallucinating – especially when it matters most.
RELATED ARTICLES YOU MIGHT ENJOY
AUTHOR
Tara Bosenick
Tara has been active as a UX specialist since 1999 and has helped to establish and shape the industry in Germany on the agency side. She specialises in the development of new UX methods, the quantification of UX and the introduction of UX in companies.
At the same time, she has always been interested in developing a corporate culture in her companies that is as ‘cool’ as possible, in which fun, performance, team spirit and customer success are interlinked. She has therefore been supporting managers and companies on the path to more New Work / agility and a better employee experience for several years.
She is one of the leading voices in the UX, CX and Employee Experience industry.
