Published 3 days ago • loading... • Updated 2 days ago

FACTS benchmark shows that even top AI models struggle with the truth

A new benchmark from Google Deepmind aims to measure AI model reliability more comprehensively than ever before. The results reveal that even top-tier models like Gemini 3 Pro and GPT-5.1 are far from perfect. The article FACTS benchmark shows that even top AI models struggle with the truth appeared first on THE DECODER.

This story is only covered by news sources that have yet to be evaluated by the independent media monitoring agencies we use to assess the quality and reliability of news outlets on our platform. Learn more here.

3 Articles

nextbigwhat

Google's FACTS benchmark reveals AI's 70% accuracy limit

Google has introduced a new benchmark called FACTS that highlights a troubling trend in enterprise AI models, revealing that many are capped at around 70% factual accuracy. This benchmark aims to address the critical need for reliable performance metrics in generative AI applications, which are increasingly used for tasks such as coding and instruction following. The revelation serves as a wake-up call for developers, emphasizing the importance …

2 days ago

Read Full Article

the-decoder.com

FACTS benchmark shows that even top AI models struggle with the truth

3 days ago

Read Full Article

the-decoder.de

Facts Benchmark: Top Ai Models Also Struggle with the Truth

A new benchmark from Google DeepMind is designed to measure the reliability of AI models more comprehensively than before. The results show that even top models such as Gemini 3 Pro and GPT-5.1 are far from perfection. The article FACTS Benchmark: Top AI models also struggle with the truth appeared first on The Decoder.

3 days ago·Germany

Read Full Article

Think freely.Subscribe and get full access to Ground NewsSubscriptions start at $9.99/year

Coverage Details

Total News Sources3

Leaning Left0Leaning Right0Center0Last Updated2 days agoBias Distribution

No sources with tracked biases.

Bias Distribution

There is no tracked Bias information for the sources covering this story.

Untracked bias

Factuality

To view factuality data please Upgrade to Premium

Ownership

To view ownership data please Upgrade to Vantage

the-decoder.de broke the news in Germany 3 days ago on Thursday, December 11, 2025.

Sources are mostly out of (0)

FACTS benchmark shows that even top AI models struggle with the truth

3 Articles

3 Articles

Google's FACTS benchmark reveals AI's 70% accuracy limit

FACTS benchmark shows that even top AI models struggle with the truth

Facts Benchmark: Top Ai Models Also Struggle with the Truth

Coverage Details

Bias Distribution

Factuality

Ownership

Similar News Topics

Similar News Topics

FACTS benchmark shows that even top AI models struggle with the truth

3 Articles

3 Articles

Google's FACTS benchmark reveals AI's 70% accuracy limit

FACTS benchmark shows that even top AI models struggle with the truth

Translate IconFacts Benchmark: Top Ai Models Also Struggle with the Truth

Coverage Details

Bias Distribution Too Big Arrow Icon

Factuality Info Icon

Ownership

Similar News Topics

Similar News Topics

Facts Benchmark: Top Ai Models Also Struggle with the Truth

Bias Distribution

Factuality