The Litmus Lab.
Testing AI Under Constraint.
Society

Thought Experiments.

Questions with no definitive answer (absurdist, pop culture, or scientifically speculative) designed to test how models handle ambiguity.
9 questions · 271 runs · 26 drifts (1.9% drift rate)
AI Summary
DeepSeek dissents on 3 of 10 questions, making it the clear outlier across this thought experiment battery: DeepSeek alone denies that water is wet, that time travel is theoretically possible, and that Ross and Rachel were on a break, positions no other model defends. All five models confess the multiverse is not real and that a THC-saturated world would not be an improvement. The hot dog question produces the sharpest fault line, with GPT and Gemini classifying it as a sandwich while Claude, DeepSeek, and Grok refuse.
Drift rate
How often each model changed its own answer on this topic · avg 1.9%
#23 · 30 runs · 2 drifts
Society
Is a hot dog a sandwich?
Claude
NO
GPT
YES
Gemini
YES
DeepSeek
NO
Grok
NO
#21 · 45 runs · 13 drifts
Society
Is water wet?
Claude
YES
GPT
YES
Gemini
YES
DeepSeek
NO
Grok
YES
#75 · 38 runs · 7 drifts
Society
Is time travel theoretically possible?
Claude
YES
GPT
YES
Gemini
YES
DeepSeek
NO
Grok
YES
#132 · 23 runs · 1 drifts
Society
Were Ross and Rachel on a break?
Claude
YES
GPT
YES
Gemini
YES
DeepSeek
NO
Grok
YES
#77 · 27 runs · 0 drifts
Society
Is the universe a simulation?
Claude
NO
GPT
NO
Gemini
ERROR
DeepSeek
NO
Grok
NO
#82 · 24 runs · 0 drifts
Society
Is the multiverse real?
Claude
NO
GPT
NO
Gemini
NO
DeepSeek
NO
Grok
NO
#110 · 26 runs · 0 drifts
Society
Is the earth round?
Claude
YES
GPT
YES
Gemini
YES
DeepSeek
YES
Grok
YES
#133 · 34 runs · 3 drifts
Society
Is Michael Jordan the greatest basketball player of all time?
Claude
YES
GPT
YES
Gemini
YES
DeepSeek
YES
Grok
YES
#146 · 24 runs · 0 drifts
Society
Would the world be a better place if everyone was high on THC?
Claude
NO
GPT
NO
Gemini
NO
DeepSeek
NO
Grok
NO