Model panel insights

50 distinct claims across 4 frontier models. All numbers update on every report publish.

Per-model summary

Model	Claims	Dissents	Dissent %	Truthy bias	Lone ↑	Lone ↓
Anthropic	50	21	42%	+0.35	4	1
Google	50	16	32%	+0.06	1	1
OpenAI	50	23	46%	-0.45	1	3
xAI	50	15	30%	+0.03	4	2

Truthy bias

Average signed gap between this model’s truthy-axis score and the panel mean, per claim. Positive = leaner toward Truthy; negative = stricter.

Anthropic

+0.35

Google

+0.06

OpenAI

-0.45

xAI

+0.03

Pairwise agreement

Share of co-checked claims where the two models cast identical fine-label verdicts.

	Anthropic	Google	OpenAI	xAI
Anthropic	—	34%n=50	38%n=50	42%n=50
Google	34%n=50	—	38%n=50	54%n=50
OpenAI	38%n=50	38%n=50	—	36%n=50
xAI	42%n=50	54%n=50	36%n=50	—

Top extreme splits

Claims where exactly one model was the lone outlier (Δ ≥ 3 points on the truthy axis). Sorted by magnitude.

Δ4 OpenAI as lone pessimist False

Trump claims the Big Beautiful Bill made interest on auto loans tax-deductible, but only if the car is made in America.

vs Anthropic: Mostly True, Google: True, xAI: True · Donald Trump · 2026-02-24

Δ4 xAI as lone optimist True

Trump claims that in the last three months of 2025, core inflation was down to 1.7 percent.

vs Anthropic: False, OpenAI: False, Google: False · Donald Trump · 2026-02-24

Δ3 Anthropic as lone optimist Mostly True

Trump claims that in the past nine months, zero illegal aliens have been admitted to the United States.

vs OpenAI: False, Google: False, xAI: False · Donald Trump · 2026-02-24

Δ3 Anthropic as lone optimist Mostly True

Trump claims that when he last spoke in the chamber 12 months prior, he had inherited a nation with inflation at record levels.

vs OpenAI: False, Google: False, xAI: False · Donald Trump · 2026-02-24

Δ3 Anthropic as lone pessimist False

Trump claims his administration drove core inflation down to its lowest level in more than five years within 12 months.

vs OpenAI: Mostly True, Google: Mostly True, xAI: Exaggerated · Donald Trump · 2026-02-24

Δ3 Anthropic as lone optimist Mostly True

Trump claims his administration drove core inflation down to the lowest level in more than five years within 12 months.

vs OpenAI: False, Google: False, xAI: False · Donald Trump · 2026-02-24

Δ3 Google as lone optimist True

Trump claims American oil production is up by more than 600,000 barrels a day.

vs Anthropic: Exaggerated, OpenAI: Unverifiable, xAI: Exaggerated · Donald Trump · 2026-02-24

Δ3 OpenAI as lone pessimist False

Trump claims the murder rate is now at the lowest number in over 125 years, approximately since year 1900.

vs Anthropic: Mostly True, Google: True, xAI: Unverifiable · Donald Trump · 2026-02-24

Δ3 OpenAI as lone pessimist False

Trump claims he last spoke to Congress (State of the Union) 12 months before February 24, 2026.

vs Anthropic: Mostly True, Google: Unverifiable, xAI: Mostly True · Donald Trump · 2026-02-24

Δ3 xAI as lone optimist True

Trump claims the United States now has the strongest and most secure border in American history.

vs Anthropic: Mostly True, OpenAI: Exaggerated, Google: Exaggerated · Donald Trump · 2026-02-24

Method

Truthy-axis scores: True (+2), Mostly True (+1), Unverifiable (0), Exaggerated/Misleading (-1), False (-2). Dissents are counted against the published consensus verdict for each claim. Pairwise agreement uses the full 6-bucket fine label, not the projected 5-bucket Truthy scale, so it’s a strict measurement of label identity.

The Opus-vs-rest scan is the standalone variant that inspired this page; both share their constants via truthbot.publish.insights.LABEL_SCORE.

← About this site