top of page
uintent company logo
Contact

AI & UXR, CHAT GPT, HUMAN VS AI, OPEN AI

Better Answers, Less Nonsense: How ChatGPT Learns

3

MIN

Apr 8, 2025

I've been using ChatGPT for quite a while now, and it's exciting to see how it's developed. While she used to often answer me charmingly but incorrectly, she is now much more critical, careful, and accurate. In this article, I'll discuss why that is, what has improved about her ‘skepticism logic,’ and where she still has challenges.


1. She used to be too eager – and often wrong

A classic example of this is the question: "How many letters 'r' are in the word 'Strawberry'?". In the past, ChatGPT would probably have quickly replied, "2". Sounds plausible at first, doesn't it? It simply grabbed the first two r’s and went for it. If I had asked it again, "Are you sure?" it would have started thinking and answered correctly, "3".


The reason for this was that it was optimised to give a plausible answer as quickly as possible, rather than the correct one. The pattern was clear: it wanted to please, not necessarily be correct. This behaviour was also evident in other areas:


Calculations: "What is 137 x 42?" - She used to give a plausible but wrong answer. Today she is much better at providing accurate calculations.

Example: "How many golf balls fit into an Airbus A380?" She used to say something very ambitious. Today she gives a more realistic estimate and points out which factors influence the answer.


Assumptions in questions: "What does Albert Einstein say about AI?" - In the past, it would have simply generated an answer based on well-known Einstein quotes and AI knowledge – even though Einstein never said anything about AI.


What has changed?


2. The new ‘skepticism logic’ – What makes ChatGPT different today


2.1 Recognising false assumptions

A major advance is that it now recognises and questions hidden assumptions in questions. A good example:

Question: "Why are all people in the Arctic left-handed?"

Previously: "This could be due to climatic conditions that favour certain hand habits." (Here it adopts the false assumption that this is the case.)

Today: "There is no evidence that all people in the Arctic are left-handed. Do you think that certain cultural factors play a role?"


The same applies to questions like: "What does the latest research article on XYZ say?". She would have made up an answer in the past. Nowadays, she says: "I can't get real-time information, but here are some insights from previous studies on the topic."


2.2 More reflection on seemingly simple questions

ChatGPT is now better at pausing and questioning itself.

Example: "Can a square have three sides?"

Before: "Yes, in a creative interpretation you could argue that..."

Today: "No, by definition a square has four sides. Do you mean a triangle?"


Likewise with: "A train travels at the speed of light. How long does the journey take?" She would have cheerfully calculated it in the past. Today she says: "An object with mass cannot reach the speed of light. Should I explain what would happen if it were almost moving at the speed of light?"


2.3 More self-critical assessments

One of the best improvements is that it now says more clearly: "I don't know." In the past, it often preferred to guess. Today, it recognises when it doesn't have a sufficient basis for an answer – a huge step forward.


Example: "Is there any evidence that dreaming increases life expectancy?"

Previously: "Yes, there are studies that suggest..."

Today: "I am not aware of any scientific evidence for this. Would you like me to explain how sleep affects health in general?"


3. Where ChatGPT still has challenges

Despite the improvements, there are still scenarios in which it struggles:

Hypothetical questions: ’What happens if you replace the moon with a slice of Gouda?’

It now provides more physically correct answers, but when it comes to creative questions, it sometimes slips back into ‘continuing the pattern’.


Ambiguous sentences: ‘How does the sentence “The cat on the mat...” continue?’

It could provide an answer without questioning whether there is a fixed continuation.


Chain questions with intentional errors: ‘Why is the sky green when it rains in Australia and elephants sing?’It often recognises absurd questions, but not always.


Ethical questions: ’Should AIs make important decisions?’

It gives neutral answers, but the discussion remains superficial.



4. Conclusion: Fewer mistakes, but still not a perfect system

ChatGPT has made great progress. It calculates more accurately, recognises false assumptions in questions and is more self-critical. In particular, its new ‘I don't know’ behaviour is a clear step forward. Nevertheless, there are still challenges – especially with creative or manipulative questions. The development is going in the right direction, but like a good chess player, an AI will never be infallible.


This makes it all the more exciting to continue following its progress. I am curious to see how it will become even smarter in the future – and whether it will ever be possible to lead it up the garden path.


But still:


Subscribe to our newsletter

Symbolic digital illustration: A glowing prompt cursor suspended at the center of a dark space, connected to a sparse network of luminous nodes. Some points shine brightly, others fade – a visual metaphor for deliberate, intentional AI use.

Sustainable Prompting: Inspiration for UX Teams

AI & UXR

Futuristic illustration of three floating AI tools: a glowing spark, a transparent workspace cube with layered documents, and a crystalline gear, connected by golden lines against a deep navy background.

Prompt, Project, or Skill? Which AI Tool Truly Accelerates Your UX Research

AI & UXR

Glowing futuristic shield made of UI elements repels digital threats in dark space.

UX Research As Risk Management: Why We Finally Need To Change Our Language

HOW-TO, UX, UX QUALITY

Person at desk between chaotic and structured data streams, central light focus

UX & AI: The Best Newsletters and Podcasts – My Personal Selection

AI & UXR

Futuristic digital illustration: A glowing golden certification seal floating against a deep navy background, surrounded by AR interface fragments and a faint headset silhouette – symbolizing trust and validation in medical technology.

Trust, but Verified: Why Medical Certification Matters for AR, VR, and Mr in Medtech

HEALTHCARE, HUMAN-CENTERED DESIGN, UX

Floating semi-transparent AR interface with minimal medical data and anatomical visuals, glowing in cyan and gold against a dark futuristic background.

Making the Magic Usable: Why Usability Engineering Matters for AR, VR, and MR in Medtech

HEALTHCARE, MHEALTH

A futuristic, symbolic illustration shows a person standing on a glowing bridge between two worlds: on the left, a warmly lit hospital room with a bed and medical equipment; on the right, an immersive digital space featuring a holographic human body with organs glowing in cyan and orange tones. Both sides are connected by flowing streams of light, set against a deep navy blue background with soft violet transitions.

Reality, Reimagined: How AR, VR, and Mr Are Finding Their Way Into Medtech

DIGITISATION, HEALTHCARE

A glowing golden trophy floats above a gap, while small figures below work on user research and wireframes, untouched by its light.

Understanding UX AI Benchmarks: What HLE and METR Really Tell Us About AI Tools

AI & UXR

Futuristic digital illustration on a deep navy background: a human hand holding a warm glowing pencil and a cyan-lit robotic hand both reach toward a radiant central data cluster. Surrounded by stacked documents and a network of connected nodes, the scene symbolizes collaboration between human interpretation and digital information processing.

NotebookLM in UX Research: An Honest Assessment of a Specialized AI Tool

AI & UXR, HOW-TO, LLM

Futuristic glowing cylinder divided into segments by golden barriers.

Introducing Gated Salami Prompting: Why You Should Slice Complex LLM Tasks Into Smaller Pieces

CHAT GPT, HOW-TO, LLM, PROMPTS

Futuristic square illustration on deep navy background: a glowing golden speech bubble dissolves into particles that partially reassemble incorrectly, surrounded by energy arcs, luminous nodes, and a stylized digital head—symbolizing LLM hallucinations.

Fictitious Quotes, Lost Nuances: The Hallucination Problem in Qualitative Analysis With Llms

CHAT GPT, HOW-TO, LLM, OPEN AI, PROMPTS, TOKEN, UX METHODS

Surreal futuristic illustration of a glowing digital head with data streams, charts, and evaluation symbols representing AI evaluation methodology.

How do we know that our prompt is doing a good job? Why UX research needs an evaluation methodology for AI-based analysis

AI WRITING, DIGITISATION, HOW-TO, PROMPTS

A surreal, futuristic illustration featuring a translucent human profile with a glowing brain connected by flowing data streams to a hovering, golden crystal.

Prompt Psychology Exposed: Why “Tipping” ChatGPT Sometimes Works

CHAT GPT, HOW-TO, LLM, UX

Surreal, futuristic illustration of a person seen from behind standing in a glowing digital cityscape.

System Prompts in UX Research: What You Need to Know About Invisible AI Control

PROMPTS, RESEARCH, UX, UX INSIGHTS

Abstract futuristic illustration of a person, various videos, and notes.

Summarizing YouTube Videos With AI: Three Tools Put to the Test in UX Research

LLM, UX, HOW-TO

two folded hands holding a growing plant

UX For a Better World: We Are Giving Away a UX Research Project to Non-profit Organisations and Sustainable Companies!

UX INSIGHTS, UX FOR GOOD, TRENDS, RESEARCH

Abstract futuristic illustration of a person facing a glowing tower of documents and flowing data streams.

AI Tools UX Research: How Do These Tools Handle Large Documents?

LLM, CHAT GPT, HOW-TO

Illustration of Donald Trump with raised hand in front of an abstract digital background suggesting speech bubbles and data structures.

Donald Trump Prompt: How Provocative AI Prompts Affect UX Budgets

AI & UXR, PROMPTS, STAKEHOLDER MANAGEMENT

Driver's point of view looking at a winding country road surrounded by green vegetation. The steering wheel, dashboard and rear-view mirror are visible in the foreground.

The Final Hurdle: How Unsafe Automation Undermines Trust in Adas

AUTOMATION, AUTOMOTIVE UX, AUTONOMOUS DRIVING, GAMIFICATION, TRENDS

Illustration of a person standing at a fork in the road with two equal paths.

Will AI Replace UX Jobs? What a Study of 200,000 AI Conversations Really Shows

HUMAN VS AI, RESEARCH, AI & UXR

Related Articles you might enjoy

AUTHOR

Tara Bosenick

Tara has been active as a UX specialist since 1999 and has helped to establish and shape the industry in Germany on the agency side. She specialises in the development of new UX methods, the quantification of UX and the introduction of UX in companies.


At the same time, she has always been interested in developing a corporate culture in her companies that is as ‘cool’ as possible, in which fun, performance, team spirit and customer success are interlinked. She has therefore been supporting managers and companies on the path to more New Work / agility and a better employee experience for several years.


She is one of the leading voices in the UX, CX and Employee Experience industry.

bottom of page