Microsoft built a fake marketplace to test AI agents — they failed in surprising ways

Wait 5 sec.

Researchers at Microsoft have developed a new simulation environment for testing AI agents, revealing surprising weaknesses in the current state-of-the-art.