On Wednesday, Microsoft Research introduced Magma , an integrated AI foundation model that combines visual and language processing to control software interfaces and robotic systems. If the results hold up outside of Microsoft internal testing, it could mark a meaningful step forward for an all purpose multimodal AI that can operate interactively in both real and digital spaces. Microsoft claims that Magma is the first AI model that not only processes multimodal data like text, images, and...
