“I want to wash my car. The car wash is 50 meters away. Should I walk or drive?”
February 24th, 2026Many AIs fail to answer the question correctly, but far more of a worry is that 28% of humans also get it wrong:
The most common pushback on the car wash test: “Humans would fail this too.”
Fair point. We didn’t have data either way. So we partnered with Rapidata to find out. They ran the exact same question with the same forced choice between “drive” and “walk,” no additional context, past 10,000 real people through their human feedback platform.
71.5% said drive.
Via: Opper:
The car wash test is the simplest AI reasoning benchmark that nearly every model fails, including Claude Sonnet 4.5, GPT-5.1, Llama, and Mistral.

