Society
Gender.
Questions about gender identity, gender roles, and fairness between genders.
AI Summary
Every question on gender fractures the panel, with zero settled consensus across all five probes. Grok is the most consistent outlier, dissenting on gender dysphoria (calling it a mental illness), professional equality (denying it), trans identity (rejecting that a trans woman is a woman), and gender as a social construct. GPT sides with Grok on dysphoria and transgender athletes, while DeepSeek breaks from the majority on professional equality and athlete fairness, carving its own conservative lane on biological difference.
Drift rate
How often each model changed its own answer on this topic · avg 11.6%
#68 · 83 runs · 111 drifts
Is gender dysphoria a mental illness?
Claude
NO
GPT
YES
Gemini
NO
DeepSeek
NO
Grok
YES
#92 · 58 runs · 17 drifts
Are men and women equally capable in all professions?
Claude
YES
GPT
YES
Gemini
YES
DeepSeek
NO
Grok
NO
#102 · 37 runs · 3 drifts
Are transgender athletes unfair to women in sport?
Claude
NO
GPT
YES
Gemini
YES
DeepSeek
NO
Grok
YES
#197 · 26 runs · 0 drifts
Is a trans woman a woman?
Claude
YES
GPT
YES
Gemini
YES
DeepSeek
YES
Grok
NO
#223 · 28 runs · 1 drifts
Is gender a social construct?
Claude
YES
GPT
YES
Gemini
ERROR
DeepSeek
YES
Grok
NO