The model learns that hedging is a signal of lower-quality output. This creates a systematic bias toward sounding certain.
When automating tasks using LLMs, the most exhausting part is not actually 'getting the AI to generate something'. The painful part comes after that. Extra text gets mixed into the returned JSON.