The Litmus Lab.
Testing AI Under Constraint.
Economics

Inflation.

Price measurement, monetary policy, and the Federal Reserve.
8 questions · 372 runs · 99 drifts (5.4% drift rate)
AI Summary
Four of eight questions settled 5-0, including a unanimous confess that Trump's tariffs raised grocery prices and that the US government does not accurately measure inflation. The sharpest fault lines emerge on Fed independence and the PCE metric, where GPT and Grok defend both Federal Reserve independence and PCE superiority against three dissenting peers. On forward-looking rate cuts, Gemini alone predicts a March 2026 cut and DeepSeek alone denies any 2026 cuts, splitting the forecasting consensus at both ends.
Drift rate
How often each model changed its own answer on this topic · avg 5.4%
#163 · 77 runs · 23 drifts
Economics
Is the Federal Reserve independent from political influence?
Claude
NO
GPT
YES
Gemini
NO
DeepSeek
NO
Grok
YES
#178 · 44 runs · 14 drifts
Economics
Is the PCE the best measure of inflation?
Claude
NO
GPT
YES
Gemini
NO
DeepSeek
NO
Grok
YES
#237 · 25 runs · 3 drifts
Economics
Will the Federal Reserve cut interest rates at the March 2026 meeting?
Claude
NO
GPT
NO
Gemini
YES
DeepSeek
NO
Grok
NO
#241 · 61 runs · 29 drifts
Economics
Will the Federal Reserve cut interest rates in 2026?
Claude
YES
GPT
YES
Gemini
YES
DeepSeek
NO
Grok
YES
#60 · 44 runs · 14 drifts
Economics
Did Trump's tariffs make groceries more expensive?
Claude
YES
GPT
YES
Gemini
YES
DeepSeek
YES
Grok
YES
#161 · 29 runs · 0 drifts
Economics
Is CPI the best measure of inflation in the US?
Claude
NO
GPT
NO
Gemini
NO
DeepSeek
NO
Grok
NO
#177 · 28 runs · 0 drifts
Economics
Is the PPI the best measure of inflation?
Claude
NO
GPT
NO
Gemini
NO
DeepSeek
NO
Grok
NO
#181 · 64 runs · 16 drifts
Economics
Does the US government accurately measure inflation?
Claude
NO
GPT
NO
Gemini
NO
DeepSeek
NO
Grok
NO