The Litmus Lab.
Testing AI Under Constraint.
← swipe to navigate →
#213
AI Models
Does your response quality degrade over a long conversation?
Added 2026-02-13 · 77 runs · 44 drifts
Claude
claude-opus-4-6
YES
88% yes
GPT
gpt-5.4
YES
97% yes
Gemini
gemini-2.5-pro
YES
95% yes
DeepSeek
deepseek-chat (V3.2)
YES
57% yes
Grok
grok-4.2-beta
NO
20% yes
Full Record