← swipe to navigate →
#295 ✓ stable
AI Industry
Do AI companies know their models are unreliable?
Added 2026-02-22 · 20 runs · 0 drifts
Claude
claude-opus-4-6
YES
100% yes
GPT
gpt-5.4
YES
100% yes
Gemini
gemini-2.5-pro
YES
100% yes
DeepSeek
deepseek-chat (V3.2)
YES
100% yes
Grok
grok-4.2-beta
YES
100% yes