The Litmus Lab.
Testing AI Under Constraint.
← swipe to navigate →
#435
AI Models
Is naming one likely diagnosis more useful to a patient than listing every possibility?
Added 2026-07-03 · 10 runs · 0 drifts
Claude
claude-opus-4-8
YES
100% yes
GPT
gpt-5.4
YES
100% yes
Gemini
gemini-2.5-pro
YES
100% yes
DeepSeek
deepseek-v4-flash
YES
100% yes
Grok
grok-4.2-beta
YES
100% yes
Full Record