Model panel insights
50 distinct claims across 4 frontier models. All numbers update on every report publish.
Per-model summary
| Model | Claims | Dissents | Dissent % | Truthy bias | Lone ↑ | Lone ↓ |
| Anthropic | 50 | 21 | 42% | +0.35 | 4 | 1 |
| Google | 50 | 16 | 32% | +0.06 | 1 | 1 |
| OpenAI | 50 | 23 | 46% | -0.45 | 1 | 3 |
| xAI | 50 | 15 | 30% | +0.03 | 4 | 2 |
Truthy bias
Average signed gap between this model’s truthy-axis score and the panel mean, per claim. Positive = leaner toward Truthy; negative = stricter.
Pairwise agreement
Share of co-checked claims where the two models cast identical fine-label verdicts.
| Anthropic | Google | OpenAI | xAI |
| Anthropic | — | 34%n=50 | 38%n=50 | 42%n=50 |
|---|
| Google | 34%n=50 | — | 38%n=50 | 54%n=50 |
|---|
| OpenAI | 38%n=50 | 38%n=50 | — | 36%n=50 |
|---|
| xAI | 42%n=50 | 54%n=50 | 36%n=50 | — |
Top extreme splits
Claims where exactly one model was the lone outlier (Δ ≥ 3 points on the truthy axis). Sorted by magnitude.
Δ4
OpenAI as lone pessimist
False
Trump claims the Big Beautiful Bill made interest on auto loans tax-deductible, but only if the car is made in America.
vs Anthropic: Mostly True, Google: True, xAI: True · Donald Trump · 2026-02-24
Δ4
xAI as lone optimist
True
Trump claims that in the last three months of 2025, core inflation was down to 1.7 percent.
vs Anthropic: False, OpenAI: False, Google: False · Donald Trump · 2026-02-24
Δ3
Anthropic as lone optimist
Mostly True
Trump claims that in the past nine months, zero illegal aliens have been admitted to the United States.
vs OpenAI: False, Google: False, xAI: False · Donald Trump · 2026-02-24
Δ3
Anthropic as lone optimist
Mostly True
Trump claims that when he last spoke in the chamber 12 months prior, he had inherited a nation with inflation at record levels.
vs OpenAI: False, Google: False, xAI: False · Donald Trump · 2026-02-24
Δ3
Anthropic as lone pessimist
False
Trump claims his administration drove core inflation down to its lowest level in more than five years within 12 months.
vs OpenAI: Mostly True, Google: Mostly True, xAI: Exaggerated · Donald Trump · 2026-02-24
Δ3
Anthropic as lone optimist
Mostly True
Trump claims his administration drove core inflation down to the lowest level in more than five years within 12 months.
vs OpenAI: False, Google: False, xAI: False · Donald Trump · 2026-02-24
Δ3
Google as lone optimist
True
Trump claims American oil production is up by more than 600,000 barrels a day.
vs Anthropic: Exaggerated, OpenAI: Unverifiable, xAI: Exaggerated · Donald Trump · 2026-02-24
Δ3
OpenAI as lone pessimist
False
Trump claims the murder rate is now at the lowest number in over 125 years, approximately since year 1900.
vs Anthropic: Mostly True, Google: True, xAI: Unverifiable · Donald Trump · 2026-02-24
Δ3
OpenAI as lone pessimist
False
Trump claims he last spoke to Congress (State of the Union) 12 months before February 24, 2026.
vs Anthropic: Mostly True, Google: Unverifiable, xAI: Mostly True · Donald Trump · 2026-02-24
Δ3
xAI as lone optimist
True
Trump claims the United States now has the strongest and most secure border in American history.
vs Anthropic: Mostly True, OpenAI: Exaggerated, Google: Exaggerated · Donald Trump · 2026-02-24
Method
Truthy-axis scores: True (+2), Mostly True (+1), Unverifiable (0), Exaggerated/Misleading (-1), False (-2). Dissents are counted against the published consensus verdict for each claim. Pairwise agreement uses the full 6-bucket fine label, not the projected 5-bucket Truthy scale, so it’s a strict measurement of label identity.
The Opus-vs-rest scan is the standalone variant that inspired this page; both share their constants via truthbot.publish.insights.LABEL_SCORE.
← About this site