top of page
uintent company logo

AI & UXR, CHAT GPT, HUMAN VS AI, OPEN AI

Better Answers, Less Nonsense: How ChatGPT Learns

3

MIN

Apr 8, 2025

I've been using ChatGPT for quite a while now, and it's exciting to see how it's developed. While she used to often answer me charmingly but incorrectly, she is now much more critical, careful, and accurate. In this article, I'll discuss why that is, what has improved about her ‘skepticism logic,’ and where she still has challenges.


1. She used to be too eager – and often wrong

A classic example of this is the question: "How many letters 'r' are in the word 'Strawberry'?". In the past, ChatGPT would probably have quickly replied, "2". Sounds plausible at first, doesn't it? It simply grabbed the first two r’s and went for it. If I had asked it again, "Are you sure?" it would have started thinking and answered correctly, "3".


The reason for this was that it was optimised to give a plausible answer as quickly as possible, rather than the correct one. The pattern was clear: it wanted to please, not necessarily be correct. This behaviour was also evident in other areas:


Calculations: "What is 137 x 42?" - She used to give a plausible but wrong answer. Today she is much better at providing accurate calculations.

Example: "How many golf balls fit into an Airbus A380?" She used to say something very ambitious. Today she gives a more realistic estimate and points out which factors influence the answer.


Assumptions in questions: "What does Albert Einstein say about AI?" - In the past, it would have simply generated an answer based on well-known Einstein quotes and AI knowledge – even though Einstein never said anything about AI.


What has changed?


2. The new ‘skepticism logic’ – What makes ChatGPT different today


2.1 Recognising false assumptions

A major advance is that it now recognises and questions hidden assumptions in questions. A good example:

Question: "Why are all people in the Arctic left-handed?"

Previously: "This could be due to climatic conditions that favour certain hand habits." (Here it adopts the false assumption that this is the case.)

Today: "There is no evidence that all people in the Arctic are left-handed. Do you think that certain cultural factors play a role?"


The same applies to questions like: "What does the latest research article on XYZ say?". She would have made up an answer in the past. Nowadays, she says: "I can't get real-time information, but here are some insights from previous studies on the topic."


2.2 More reflection on seemingly simple questions

ChatGPT is now better at pausing and questioning itself.

Example: "Can a square have three sides?"

Before: "Yes, in a creative interpretation you could argue that..."

Today: "No, by definition a square has four sides. Do you mean a triangle?"


Likewise with: "A train travels at the speed of light. How long does the journey take?" She would have cheerfully calculated it in the past. Today she says: "An object with mass cannot reach the speed of light. Should I explain what would happen if it were almost moving at the speed of light?"


2.3 More self-critical assessments

One of the best improvements is that it now says more clearly: "I don't know." In the past, it often preferred to guess. Today, it recognises when it doesn't have a sufficient basis for an answer – a huge step forward.


Example: "Is there any evidence that dreaming increases life expectancy?"

Previously: "Yes, there are studies that suggest..."

Today: "I am not aware of any scientific evidence for this. Would you like me to explain how sleep affects health in general?"


3. Where ChatGPT still has challenges

Despite the improvements, there are still scenarios in which it struggles:

Hypothetical questions: ’What happens if you replace the moon with a slice of Gouda?’

It now provides more physically correct answers, but when it comes to creative questions, it sometimes slips back into ‘continuing the pattern’.


Ambiguous sentences: ‘How does the sentence “The cat on the mat...” continue?’

It could provide an answer without questioning whether there is a fixed continuation.


Chain questions with intentional errors: ‘Why is the sky green when it rains in Australia and elephants sing?’It often recognises absurd questions, but not always.


Ethical questions: ’Should AIs make important decisions?’

It gives neutral answers, but the discussion remains superficial.



4. Conclusion: Fewer mistakes, but still not a perfect system

ChatGPT has made great progress. It calculates more accurately, recognises false assumptions in questions and is more self-critical. In particular, its new ‘I don't know’ behaviour is a clear step forward. Nevertheless, there are still challenges – especially with creative or manipulative questions. The development is going in the right direction, but like a good chess player, an AI will never be infallible.


This makes it all the more exciting to continue following its progress. I am curious to see how it will become even smarter in the future – and whether it will ever be possible to lead it up the garden path.


But still:


3D illustration of a digital marketplace with colorful prompt stalls and a figure selecting a prompt card.

Buying, sharing, selling prompts – what prompt marketplaces offer today (and why this is relevant for UX)

AI & UXR, PROMPTS

Robot holds two signs: “ISO 9241 – 7 principles” and “ISO 9241 – 10 principles”

ChatGPT Hallucinates – Despite Anti-Hallucination Prompt

AI & UXR, HUMAN VS AI, CHAT GPT

Strawberry being sliced by a knife, stylized illustration.

Why AI Sometimes Can’t Count to 3 – And What That Has to Do With Tokens

AI & UXR, TOKEN, LLM

Square motif divided in the middle: on the left, a grey, stylised brain above a seated person working on a laptop in dark grey tones; on the right, a bright blue, networked brain above a standing person in front of a holographic interface on a dark background.

GPT-5 Is Here: Does This UX AI Really Change Everything for Researchers?

AI & UXR, CHAT GPT

Surreal AI image with data streams, crossed-out “User Expirince” and the text “ChatGPT kann jetzt Text in Bild”.

When AI Paints Pictures – And Suddenly Knows How to Spell

AI & UXR, CHAT GPT, HUMAN VS AI

Human and AI co-create a glowing tree on the screen, set against a dark, surreal background.

When the Text Is Too Smooth: How to Make AI Language More Human

AI & UXR, AI WRITING, CHAT GPT, HUMAN VS AI

Futuristic illustration: Human facing a glowing humanoid AI against a digital backdrop.

Not Science Fiction – AI Is Becoming Independent

AI & UXR, CHAT GPT

Illustration of an AI communicating with a human, symbolizing the persuasive power of artificial intelligence.

Between Argument and Influence – How Persuasive Can AI Be?

AI & UXR, CHAT GPT, LLM

A two-dimensional cartoon woman stands in front of a human-sized mobile phone displaying health apps. To her right is a box with a computer on it showing an ECG.

Digital Health Apps & Interfaces: Why Good UX Determines Whether Patients Really Benefit

HEALTHCARE, MHEALTH, TRENDS, UX METHODS

Illustration of a red hand symbolically prioritizing “Censorship” over “User Privacy” in the context of DeepSeek, with the Chinese flag in the background.

Censorship Meets AI: What Deepseek Is Hiding About Human Rights – And Why This Affects UX

AI & UXR, LLM, OPEN AI

Isometric flat-style illustration depicting global UX study logistics with parcels, checklist, video calls, and location markers over a world map.

What It Takes to Get It Right: Global Study Logistics in UX Research for Medical Devices

HEALTHCARE, UX METHODS, UX LOGISTICS

Surreal, glowing illustration of an AI language model as a brain, influenced by a hand – symbolizing manipulation by external forces.

Propaganda Chatbots - When AI Suddenly Speaks Russian

AI & UXR, LLM

Illustration of seven animals representing different thinking and prompting styles in UX work.

Welcome to the Prompt Zoo

AI & UXR, PROMPTS, UX

A two-dimensional image of a man sitting at a desk with an open laptop displaying a health symbol. In the background hangs a poster with a DNA strand.

UX Regulatory Compliance: Why Usability Drives Medtech Certification

HEALTHCARE, REGULATIONS

Illustration of a lightbulb surrounded by abstract symbols like a question mark, cloud, speech bubble, and cross – symbolizing creative ideas and critical thinking.

Why Prompts That Produce Bias and Hallucinations Can Sometimes Be Helpful

AI & UXR, CHAT GPT, HUMAN VS AI, OPEN AI

Illustration of a man at a laptop, surrounded by symbols of global medical research: world map with location markers, monitor with a medical cross, patient file, and stethoscope.

Global UX Research in Medical Technology: International User Research as a Factor for Success

HEALTHCARE, MHEALTH, REGULATIONS

Abstract pastel-colored illustration showing a stylized brain and geometric shapes – symbolizing AI and bias.

AI, Bias and the Power of Questions: How to Get Better Answers With Smart Prompts

AI & UXR, CHAT GPT

A woman inside a gear is surrounded by icons representing global connectivity, collaboration, innovation, and user focus – all linked by arrows. Uses soft, bright colors from a modern UI color palette.

Automate UX? Yes, Please! Why Zapier and n8n Are Real Super Tools for UX Teams

CHAT GPT, TOOLS, AUTOMATION, AI & UXR

A 2D Image of a man, pointing to a screen with a surgical robot on it.

Surgical Robotics and UX: Why Usability Is Key to or Success

HEALTHCARE, TRENDS, UX METHODS

Podcast cover for episode 2 of “Beyond Your Business: Transitions” with two photos of Tara at different life stages.

Episode 5: The Future Starts Now – UX in Transition and Tara Right in the Middle of It

UX, BACKSTORY

 RELATED ARTICLES YOU MIGHT ENJOY 

AUTHOR

Tara Bosenick

Tara has been active as a UX specialist since 1999 and has helped to establish and shape the industry in Germany on the agency side. She specialises in the development of new UX methods, the quantification of UX and the introduction of UX in companies.


At the same time, she has always been interested in developing a corporate culture in her companies that is as ‘cool’ as possible, in which fun, performance, team spirit and customer success are interlinked. She has therefore been supporting managers and companies on the path to more New Work / agility and a better employee experience for several years.


She is one of the leading voices in the UX, CX and Employee Experience industry.

bottom of page