top of page
uintent company logo

LLM, CHAT GPT, HOW-TO

AI Tools UX Research: How Do These Tools Handle Large Documents?

4

MIN

Jan 22, 2026

Imagine this: you upload an 80-page interview transcript to your AI tool, ask a question about the last 20 pages, and the AI responds with a friendly ‘There's nothing about that in the document.’ Frustrating? Absolutely. Avoidable? Yes, if you understand how AI tools for UX research really work.

In this article, you'll learn how ChatGPT, Claude, and Gemini handle large UX documents, where the hidden pitfalls lie, and how to get the most out of your analyses. As a UX consultant, I've been working with qualitative research methods for over 25 years and test AI tools extensively in my daily work. The differences I've discovered are greater than expected.


📌 The most important points in brief:

• ChatGPT automatically breaks down large documents into chunks, which can hide overarching patterns.

• Claude often processes medium-sized documents (up to approx. 150,000 words) in their entirety and only switches to chunking when there is an overflow.

• Gemini has the largest context window (up to 2 million tokens) and can process entire research archives at once

• The context window determines how much text an AI can ‘see’ at one time

• Test your tools specifically: ask for content from different parts of documents to discover gaps

• Choosing the right tool can determine the quality of your UX insights


What is a context window, and why should you care?

The context window is the ‘attention span’ of an AI. It defines how much text the system can process at once. Anything beyond that must either be ignored or processed in another way.

This is crucial for UX research: if your interview transcript is larger than the context window, the AI cannot recognise all the connections. Imagine someone reading your 100-page research document but forgetting the previous pages after each page. That's exactly what happens when AI tools only partially capture UX research documents.


As of January 2025: ChatGPT offers 128,000 tokens, Claude between 200,000 and 1 million tokens, and Gemini up to 2 million tokens. One token corresponds to approximately 0.75 words in German.


How do ChatGPT, Claude and Gemini handle large documents?

The three major AI providers have developed fundamentally different strategies to deal with the context window problem. These differences have a direct impact on the quality of your UX analyses.


ChatGPT: The chunking approach

ChatGPT relies on Retrieval Augmented Generation (RAG). The system automatically breaks down large documents into smaller sections, stores them in a search index, and only retrieves the supposedly relevant parts when questions are asked.


The problem in practice: In a 60-page usability test report, I asked about recurring patterns of frustration. ChatGPT provided examples from the first 20 pages, but overlooked a key pattern that only became apparent from comments on pages 45 and 52. The chunks were simply not ‘close enough’ to each other to be recognised as belonging together.


When ChatGPT works well: The system delivers reliable results for specific individual questions (‘What did participant 3 say about navigation?’). The search usually finds the relevant section without any problems.


Claude: The hybrid approach

Claude pursues a different strategy: as long as a document fits into the context window, it is processed completely and coherently. Only when the limit is exceeded does the system automatically switch to RAG.


The advantage: With medium to large interview transcripts (up to about 150,000 words), you have a good chance of complete processing. In my work as a UX consultant, I therefore prefer to use Claude for analyses where overarching themes and subtle patterns are important.


Good to know: Technically, it makes no difference whether you paste text or upload it as a file. Both are processed identically as long as they fit into the context window.


Gemini: The brute force approach

Google is taking a radically different approach with Gemini: instead of cleverly breaking things down, it simply built a huge context window. Up to 2 million tokens, which is equivalent to about 1,500 pages of text.


For UX research, this means that you can theoretically upload multiple interview transcripts, usability logs and background documents at the same time. Gemini keeps track of everything.


The catch: This approach is resource-intensive and correspondingly expensive. For regular analyses of large amounts of data, you should keep an eye on the costs.


AI tools UX research: A direct comparison

Criteria

ChatGPT

Claude

Gemini

Context window

128k tokens

200k to 1M tokens

1M to 2M tokens

Processing method

Always chunking/RAG

Hybrid (complete, then RAG)

Complete

Strength

Specific individual questions

Overarching patterns

Very large amounts of data

Weakness

Correlations across chunks

RAG fallback for very large files

High costs

 

How can you tell that your AI tool hasn't read everything?

There are typical warning signs that indicate that your AI tool has only partially captured UX research documents:


Answers only refer to the beginning: If all examples and quotes come from the first few pages, the AI probably hasn't looked at the entire document.


Overarching patterns are missing: When asked about recurring themes, the AI only provides isolated individual examples instead of connections.


The AI seems ‘surprised’: When asked specific questions about later parts of the document, the system reacts as if it is seeing the content for the first time.


Contradictory statements: The AI makes statements that contradict other parts of the document because it does not see them at the same time.


My tip: For important analyses, always ask control questions about different parts of the document. Ask about connections between the beginning and the end. When fully processed, the AI should be able to recognise these connections.


How to get the most out of your AI tools for UX research

Tips for ChatGPT

Formulate your prompts specifically and explicitly indicate that the entire document should be taken into account. For complex analyses, it can be helpful to divide the document into thematic sections and analyse them one after the other. Alternatively, you can copy the entire text directly into the chat window, and everything will be processed.


Tips for Claude

Use the power of the large context window for analyses where overarching themes are important. For very large files, it helps to explicitly ask for different parts of the document to ensure that RAG mode finds all relevant information.


Tips for Gemini

Use its power for really large amounts of data, for example, if you want to analyse several interviews or an entire research project at the same time. For repeated requests to the same documents, context caching is worthwhile to save costs.


Which AI tool is suitable for which UX research task?

Short to medium interviews (up to 30 pages): All three systems work reliably. Choose according to personal preference or existing subscription.


Large individual interviews (50+ pages): Prefer Claude or Gemini. ChatGPT will have to chunk here, which can lead to gaps in pattern analysis.


Multiple documents at once: Gemini is strongest here. Claude can still manage it with medium overall sizes. ChatGPT quickly reaches its limits.


Specific individual questions: ChatGPT works excellently here, as the chunk search finds exactly the relevant section.


Code reviews and prototype analyses: Gemini excels with its ability to understand up to 30,000 lines of code at once.


Frequently asked questions about AI tools for UX research

Does it make a difference whether I copy text or upload it as a file?

Not with Claude; both methods are processed identically. With ChatGPT, pasting directly into the chat window can be advantageous because then no chunking takes place. With Gemini, it doesn't matter either.


Can I see if my document has been fully processed?

Unfortunately, the systems do not display this directly. You can test it by asking specific questions about different parts of the document. If the AI consistently recognises connections between the beginning and the end, the document has probably been captured in its entirety.


Which tool is best for confidential UX research data?

All three providers have enterprise options with advanced data protection features. Check the respective data protection guidelines and clarify with your company which platforms are approved. The technical differences in document processing remain the same in the enterprise versions.


Will AI tools for UX research be better at handling large documents in the future?

The trend is clearly towards larger context windows. Claude is currently expanding to 1 million tokens, and Gemini is experimenting with even larger windows. This does not make RAG obsolete, but it does make it less necessary. For UX teams, this means less technical understanding is required and more focus can be placed on intelligent questioning.


Conclusion: Know your tool, optimise your results

The three major AI tools for UX research have fundamentally different approaches to handling large documents. ChatGPT relies on chunking and is particularly suitable for specific individual questions. Claude uses a hybrid approach and often processes medium-sized documents in their entirety. Gemini offers the largest context window and is suitable for very large amounts of data.

Choosing the right tool can determine the quality of your UX insights. If important patterns disappear in the ‘chunking gap,’ you may be making incomplete recommendations.


My recommendation: Test your typical UX documents with different systems. Ask identical questions and compare the quality of the answers. Investing in the right tool will quickly pay off.

Want to learn more about effective AI workflows for UX research? Let's talk. I'll show you how to take your research processes to the next level with the right tools and techniques.

💌 Not enough? Then read on – in our newsletter. It comes four times a year. Sticks in your mind longer. To subscribe: https://www.uintent.com/newsletter

Abstract futuristic illustration of a person facing a glowing tower of documents and flowing data streams.

AI Tools UX Research: How Do These Tools Handle Large Documents?

LLM, CHAT GPT, HOW-TO

Illustration of Donald Trump with raised hand in front of an abstract digital background suggesting speech bubbles and data structures.

Donald Trump Prompt: How Provocative AI Prompts Affect UX Budgets

AI & UXR, PROMPTS, STAKEHOLDER MANAGEMENT

Driver's point of view looking at a winding country road surrounded by green vegetation. The steering wheel, dashboard and rear-view mirror are visible in the foreground.

The Final Hurdle: How Unsafe Automation Undermines Trust in Adas

AUTOMATION, AUTOMOTIVE UX, AUTONOMOUS DRIVING, GAMIFICATION, TRENDS

Illustration of a person standing at a fork in the road with two equal paths.

Will AI Replace UX Jobs? What a Study of 200,000 AI Conversations Really Shows

HUMAN VS AI, RESEARCH, AI & UXR

Close-up of a premium tweeter speaker in a car dashboard with perforated metal surface.

The Passenger Who Always Listens: Why We Are Reluctant to Trust Our Cars When They Talk

AUTOMOTIVE UX, VOICE ASSISTANTS

Keyhole in a dark surface revealing an abstract, colorful UX research interface.

Evaluating AI Results in UX Research: How to Navigate the Black Box

AI & UXR, HOW-TO, HUMAN VS AI

A car cockpit manufactured by Audi. It features a digital display and numerous buttons on the steering wheel.

Haptic Certainty vs. Digital Temptation: The Battle for the Best Controls in Cars

AUTOMOTIVE UX, AUTONOMOUS DRIVING, CONNECTIVITY, GAMIFICATION

Digital illustration of a classical building facade with columns, supported by visible scaffolding, symbolising a fragile, purely superficial front.

UX & AI: How "UX Potemkin" Undermines Your Research and Design Decisions

AI & UXR, HUMAN VS AI, LLM, UX

Silhouette of a diver descending into deep blue water – a metaphor for in-depth research.

Deep Research AI | How to use ChatGPT effectively for UX work

CHAT GPT, HOW-TO, RESEARCH, AI & UXR

A referee holds up a scorecard labeled “Yupp.ai” between two stylized AI chatbots in a boxing ring – a symbolic image for fair user-based comparison of AI models.

How Yupp Uses Feedback to Fairly Evaluate AI Models – And What UX Professionals Can Learn From It

AI & UXR, CHAT GPT, HUMAN VS AI, LLM

A brown book entitled ‘Don't Make Me Think’ by Steve Krug lies on a small table. Light shines through the window.

Why UX Research Is Losing Credibility - And How We Can Regain It

UX, UX QUALITY, UX METHODS

3D illustration of a digital marketplace with colorful prompt stalls and a figure selecting a prompt card.

Buying, sharing, selling prompts – what prompt marketplaces offer today (and why this is relevant for UX)

AI & UXR, PROMPTS

Robot holds two signs: “ISO 9241 – 7 principles” and “ISO 9241 – 10 principles”

ChatGPT Hallucinates – Despite Anti-Hallucination Prompt

AI & UXR, HUMAN VS AI, CHAT GPT

Strawberry being sliced by a knife, stylized illustration.

Why AI Sometimes Can’t Count to 3 – And What That Has to Do With Tokens

AI & UXR, TOKEN, LLM

Square motif divided in the middle: on the left, a grey, stylised brain above a seated person working on a laptop in dark grey tones; on the right, a bright blue, networked brain above a standing person in front of a holographic interface on a dark background.

GPT-5 Is Here: Does This UX AI Really Change Everything for Researchers?

AI & UXR, CHAT GPT

Surreal AI image with data streams, crossed-out “User Expirince” and the text “ChatGPT kann jetzt Text in Bild”.

When AI Paints Pictures – And Suddenly Knows How to Spell

AI & UXR, CHAT GPT, HUMAN VS AI

Human and AI co-create a glowing tree on the screen, set against a dark, surreal background.

When the Text Is Too Smooth: How to Make AI Language More Human

AI & UXR, AI WRITING, CHAT GPT, HUMAN VS AI

Futuristic illustration: Human facing a glowing humanoid AI against a digital backdrop.

Not Science Fiction – AI Is Becoming Independent

AI & UXR, CHAT GPT

Illustration of an AI communicating with a human, symbolizing the persuasive power of artificial intelligence.

Between Argument and Influence – How Persuasive Can AI Be?

AI & UXR, CHAT GPT, LLM

A two-dimensional cartoon woman stands in front of a human-sized mobile phone displaying health apps. To her right is a box with a computer on it showing an ECG.

Digital Health Apps & Interfaces: Why Good UX Determines Whether Patients Really Benefit

HEALTHCARE, MHEALTH, TRENDS, UX METHODS

 RELATED ARTICLES YOU MIGHT ENJOY 

AUTHOR

Tara Bosenick

Tara has been active as a UX specialist since 1999 and has helped to establish and shape the industry in Germany on the agency side. She specialises in the development of new UX methods, the quantification of UX and the introduction of UX in companies.


At the same time, she has always been interested in developing a corporate culture in her companies that is as ‘cool’ as possible, in which fun, performance, team spirit and customer success are interlinked. She has therefore been supporting managers and companies on the path to more New Work / agility and a better employee experience for several years.


She is one of the leading voices in the UX, CX and Employee Experience industry.

bottom of page