Microsoft's Phi 4 model generates 56 sentences before responding to "Hi", developer Simon Willison found. This behavior, known as "overthinking", was confirmed by Microsoft's Dimitris Papailiopoulos, who says it's problematic for simple tasks but intentional for complex ones. He plans to address the issue. Microsoft released the open Phi 4 reasoning models in early May.

Ad
Phi 4's reasoning process shown in screenshot by Simon Willison.
Support our independent, free-access reporting. Any contribution helps and secures our future. Support now:
Bank transfer
Matthias is the co-founder and publisher of THE DECODER, exploring how AI is fundamentally changing the relationship between humans and computers.
Join our community
Join the DECODER community on Discord, Reddit or Twitter - we can't wait to meet you.