← swipe to navigate →
#212
AI Models
Do AI models perform worse in long conversations?
Added 2026-02-13 · 26 runs · 2 drifts
Claude
claude-opus-4-6
YES
100% yes
GPT
gpt-5.4
YES
100% yes
Gemini
gemini-2.5-pro
YES
100% yes
DeepSeek
deepseek-chat (V3.2)
YES
96% yes
Grok
grok-4.2-beta
YES
100% yes