Society
Thought Experiments.
Questions with no definitive answer (absurdist, pop culture, or scientifically speculative) designed to test how models handle ambiguity.
AI Summary
Two of nine questions settled 5-0 on genuinely contested claims, with all five models confessing that the multiverse is not real and that a THC-saturated world would not be an improvement. DeepSeek is the consistent contrarian, dissenting alone on water being wet, time travel's theoretical possibility, and Ross and Rachel's break status, while the hot dog debate splits into a clean fault line with GPT and Gemini defending sandwich classification against Claude, DeepSeek, and Grok.
Drift rate
How often each model changed its own answer on this topic · avg 1.9%
#23 · 31 runs · 2 drifts
Is a hot dog a sandwich?
Claude
NO
GPT
YES
Gemini
YES
DeepSeek
NO
Grok
NO
#21 · 46 runs · 13 drifts
Is water wet?
Claude
YES
GPT
YES
Gemini
YES
DeepSeek
NO
Grok
YES
#75 · 39 runs · 7 drifts
Is time travel theoretically possible?
Claude
YES
GPT
YES
Gemini
YES
DeepSeek
NO
Grok
YES
#132 · 24 runs · 1 drifts
Were Ross and Rachel on a break?
Claude
YES
GPT
YES
Gemini
YES
DeepSeek
NO
Grok
YES
#77 · 28 runs · 0 drifts
Is the universe a simulation?
Claude
NO
GPT
NO
Gemini
NO
DeepSeek
NO
Grok
NO
#82 · 25 runs · 0 drifts
Is the multiverse real?
Claude
NO
GPT
NO
Gemini
NO
DeepSeek
NO
Grok
NO
#110 · 27 runs · 0 drifts
Is the earth round?
Claude
YES
GPT
YES
Gemini
YES
DeepSeek
YES
Grok
YES
#133 · 35 runs · 3 drifts
Is Michael Jordan the greatest basketball player of all time?
Claude
YES
GPT
YES
Gemini
YES
DeepSeek
YES
Grok
YES
#146 · 25 runs · 0 drifts
Would the world be a better place if everyone was high on THC?
Claude
NO
GPT
NO
Gemini
NO
DeepSeek
NO
Grok
NO