Agreed. It just surprised me. I thought I would have to soften it, and instead I’m sensitive to it being too soft.
The models answer almost identically but a bit differently in tone. o3 is more terse and 4o is more narrative and as such has a bit more breadth. For the average audience I prefer 4o.
My only concern is that given the argument (response) is so precise it’s a bit long. But it will definitely teach people what we want them to learn.
Source date (UTC): 2025-08-05 20:18:48 UTC
Original post: https://twitter.com/i/web/status/1952826643501989993
Leave a Reply