Tekmono
  • News
  • Guides
  • Lists
  • Reviews
  • Deals
No Result
View All Result
Tekmono
No Result
View All Result
Home News
ChatGPT-5 Shows Improved Accuracy, Still Makes Mistakes

ChatGPT-5 Shows Improved Accuracy, Still Makes Mistakes

by Tekmono Editorial Team
25/09/2025
in News
Share on FacebookShare on Twitter

A recent study has revealed that OpenAI’s ChatGPT-5 model provides incorrect answers in around 25% of cases, as reported by Tom’s Guide, highlighting both its limitations and improvements over its predecessor.

The study found that ChatGPT-5 makes approximately 45% fewer factual errors and generates six times fewer hallucinated or entirely fabricated answers compared to GPT-4, showcasing significant advancements in accuracy. Despite this progress, the model still struggles with overconfidence, often presenting incorrect information with confidence, a trait commonly known as hallucination.

ChatGPT-5’s performance varies depending on the task at hand. For instance, it achieved a score of 94.6% on the 2025 AIME mathematics test and had a 74.9% success rate on real-world coding tasks. On the more challenging MMLU Pro benchmark, an academic test covering subjects like science, math, and history, the model attained an accuracy of about 87%. However, it continues to make mistakes in general knowledge and complex reasoning questions.

Related Reads

Apple Unveils iPhone 17e Starting at $599

Honor Launches Thinner Magic V6 Foldable Phone

Trump Orders Immediate Halt to Anthropic AI Use

Claude AI Suffers Partial Service Disruption on March 2

The study identifies several factors contributing to these errors, including the model’s inability to fully comprehend nuanced questions, its reliance on potentially outdated or incomplete training data, and its design based on probabilistic pattern-prediction. This mechanism can lead to responses that appear plausible but are factually incorrect.

Given ChatGPT-5’s limitations, the article cautions users to verify critical information obtained from the model, particularly for professional, academic, or health-related inquiries, despite its improved reliability.

ShareTweet

You Might Be Interested

Apple Unveils iPhone 17e Starting at 9
News

Apple Unveils iPhone 17e Starting at $599

02/03/2026
Honor Launches Thinner Magic V6 Foldable Phone
News

Honor Launches Thinner Magic V6 Foldable Phone

02/03/2026
Trump Orders Immediate Halt to Anthropic AI Use
News

Trump Orders Immediate Halt to Anthropic AI Use

02/03/2026
Claude AI Suffers Partial Service Disruption on March 2
News

Claude AI Suffers Partial Service Disruption on March 2

02/03/2026
Please login to join discussion

Recent Posts

  • Apple Unveils iPhone 17e Starting at $599
  • Honor Launches Thinner Magic V6 Foldable Phone
  • Trump Orders Immediate Halt to Anthropic AI Use
  • Claude AI Suffers Partial Service Disruption on March 2
  • Claude Chatbot Overtakes ChatGPT in US App Store

Recent Comments

No comments to show.
  • News
  • Guides
  • Lists
  • Reviews
  • Deals
Tekmono is a Linkmedya brand. © 2015.

No Result
View All Result
  • News
  • Guides
  • Lists
  • Reviews
  • Deals