Tekmono
  • News
  • Guides
  • Lists
  • Reviews
  • Deals
No Result
View All Result
Tekmono
No Result
View All Result
Home News
ChatGPT-5 Shows Improved Accuracy, Still Makes Mistakes

ChatGPT-5 Shows Improved Accuracy, Still Makes Mistakes

by Tekmono Editorial Team
25/09/2025
in News
Share on FacebookShare on Twitter

A recent study has revealed that OpenAI’s ChatGPT-5 model provides incorrect answers in around 25% of cases, as reported by Tom’s Guide, highlighting both its limitations and improvements over its predecessor.

The study found that ChatGPT-5 makes approximately 45% fewer factual errors and generates six times fewer hallucinated or entirely fabricated answers compared to GPT-4, showcasing significant advancements in accuracy. Despite this progress, the model still struggles with overconfidence, often presenting incorrect information with confidence, a trait commonly known as hallucination.

ChatGPT-5’s performance varies depending on the task at hand. For instance, it achieved a score of 94.6% on the 2025 AIME mathematics test and had a 74.9% success rate on real-world coding tasks. On the more challenging MMLU Pro benchmark, an academic test covering subjects like science, math, and history, the model attained an accuracy of about 87%. However, it continues to make mistakes in general knowledge and complex reasoning questions.

Related Reads

Google opens applications for Gemini App Trusted Tester program

Claude Voice Mode upgrade adds multilingual support and new Push-to-talk feature

Pentagon confirms use of Elon Musk’s Grok AI in missile strikes on Iran

SpaceX acquires AI coding startup Cursor for $60 billion in strategic move

The study identifies several factors contributing to these errors, including the model’s inability to fully comprehend nuanced questions, its reliance on potentially outdated or incomplete training data, and its design based on probabilistic pattern-prediction. This mechanism can lead to responses that appear plausible but are factually incorrect.

Given ChatGPT-5’s limitations, the article cautions users to verify critical information obtained from the model, particularly for professional, academic, or health-related inquiries, despite its improved reliability.

ShareTweet

You Might Be Interested

Google opens applications for Gemini App Trusted Tester program
News

Google opens applications for Gemini App Trusted Tester program

17/06/2026
Claude Voice Mode upgrade adds multilingual support and new Push-to-talk feature
News

Claude Voice Mode upgrade adds multilingual support and new Push-to-talk feature

17/06/2026
Pentagon confirms use of Elon Musk’s Grok AI in missile strikes on Iran
News

Pentagon confirms use of Elon Musk’s Grok AI in missile strikes on Iran

17/06/2026
SpaceX acquires AI coding startup Cursor for  billion in strategic move
News

SpaceX acquires AI coding startup Cursor for $60 billion in strategic move

17/06/2026
Please login to join discussion

Recent Posts

  • Google opens applications for Gemini App Trusted Tester program
  • Claude Voice Mode upgrade adds multilingual support and new Push-to-talk feature
  • Pentagon confirms use of Elon Musk’s Grok AI in missile strikes on Iran
  • SpaceX acquires AI coding startup Cursor for $60 billion in strategic move
  • Qualcomm unveils Snapdragon Reality Elite as next-gen XR platform

Recent Comments

No comments to show.
  • News
  • Guides
  • Lists
  • Reviews
  • Deals
Tekmono is a Linkmedya brand. © 2015.

No Result
View All Result
  • News
  • Guides
  • Lists
  • Reviews
  • Deals

This website uses cookies to improve your experience. You can choose to accept or reject them. Visit our Privacy Policy.