Developers Say GPT-5 Is a Mixed Bag

Some developers say they’ve had largely positive experiences with GPT-5 so far. Jenny Wang, an engineer, investor, and creator of the personal styling agent Alta, told WIRED the model appears to be better at completing complex coding tasks in one shot than other models. She compared it to OpenAI’s o3 and 4o, which she uses…

Read More

GPT-5 Doesn’t Dislike You—It Might Just Need a Benchmark for Emotional Intelligence

Since the all-new ChatGPT launched on Thursday, some users have mourned the disappearance of a peppy and encouraging personality in favor of a colder, more businesslike one (a move seemingly designed to reduce unhealthy user behavior.) The backlash shows the challenge of building artificial intelligence systems that exhibit anything like real emotional intelligence. Researchers at…

Read More

Why I Founded Cognix: Creating Reliable AI Tools

Originally published on Hashnode The Trust Problem in AI Code GenerationAs developers, we’ve all been there. You ask an AI tool to generate some code, it spits out something that looks reasonable, and then… it breaks in production. Or worse, it fails silently, introducing subtle bugs that take hours to track down. I’ve watched countless…

Read More

Meet the early-adopter judges using AI

In this, Goddard appears to be caught in the same predicament the AI boom has created for many of us. Three years in, companies have built tools that sound so fluent and humanlike they obscure the intractable problems lurking underneath—answers that read well but are wrong, models that are trained to be decent at everything…

Read More