The use of generative AI and large language models to automate and simplify tasks for people who work with PCs continued to grow. However, there also a need to see how well AI can work to accomplish tasks. This week, Microsoft Research announced it has developed a benchmark specifically to test out AI agents on Windows PCs. The benchmark, as revealed on Microsoft GitHub page, is called Windows Agent Arena . This framework is designed to test how well and how quickly AI agents can interact wit