Everyday AI Fails
Ask GPT-5 which days of the week contain the letter D. It will say two. Then three. Then four. Confidently wrong, every iteration. A first-grader gets this right in thirty seconds. The gap between benchmark scores and real-world reasoning is wider than the press releases admit.