Oh dear (vibey AI)
When we’re now measuring AI success against “vibes,” then we’ve gone off several rails. From Beni Swenson in Ars Technica:
Upon 4.5’s release, OpenAI CEO Sam Altman did some expectation tempering on X, writing that the model is strong on vibes but low on analytical strength. “It is the first model that feels like talking to a thoughtful person to me,” he wrote. He then added further down in his post, “a heads up: this isn’t a reasoning model and won’t crush benchmarks. it’s a different kind of intelligence and there’s a magic to it i haven’t felt before.”