New study shows AI isn’t ready for office work

Found 66 days ago ago at Digital Trends

Mercor released a new benchmark called APEX Agents, and it is brutal. unlike the usual tests that ask AI to write a poem or solve a math problem, this one uses actual queries from lawyers, consultants, and bankers. It asks the models to do complete, multi step tasks that require jumping between different types of information. The results? Even the absolute best models on the market—we are talking about Gemini 3 Flash and GPT 5.2—couldn’t crack a 25% accuracy rate. Gemini led

Read the full article at Digital Trends

More Windows News

Those Lines On The Sides Of Your iPhone Aren't A Design Feature – Here's What They Do

Found 1 hour ago at Boy Genius Report

4 Budget-Friendly Laptops More Powerful Than The MacBook Air

Found 1 hour ago at Boy Genius Report

Google’s Find Hub website can now locate more devices, even without your phone

Found 3 hours ago at Digital Trends

Apple finally teaches Siri to handle more than one thing

Found 4 hours ago at Digital Trends

How to Watch the 1993 'Super Mario Bros.' Movie

Found 4 hours ago at CNET