Microsoft’s AI strategy has, for the most part, been about using third-party large language models (LLMs). First this was mostly about using OpenAI’s GPT models, but more recently, this also included Anthropic’s Claude — and now Microsoft is using both of them in tandem to improve Copilot’s Researcher agent. The Researcher agent, which Microsoft recommends for problems where deeper reasoning or problem solving across multiple sources is necessary, now includes an optional ‘critique’ feature. With this, GPT will write the draft, which Claude then reviews. As Microsoft notes in its announcement, this review will include checks for “accuracy, completeness, and citation integrity.” In the future, Microsoft says, it may also give users the option to switch this flow around and have Claude write and GPT check.Claude and GPT: Better together?This workflow may feel a bit hacky at first, but it’s also not all that different from how developers sometimes use one model to write the code and another — from a different model family — to do the code review. At least in Microsoft’s benchmark, this approach also shows some clear advantages. Using Perplexity’s deep research DRACO benchmark, Anthropic’s Claude Opus 4.6 scores 42.7 by itself and 50.4 within Perplexity’s Deep Research mode. Copliot’s Researcher with Critique turned on scores 57.4, higher than any of the individual models. Credit: Microsoft.Sadly, we don’t have benchmarks for OpenAI’s GPT-5.4 yet, but chances are its score would be in the same range as Opus 4.6’s.Another new feature for research with Copilot is the so-called ‘council.’ This allows users to compare how different models handle a query side-by-side. Cowork is now in the M365 Frontier Program Recently, Microsoft also announced that it would bring Anthropic’s Claude Cowork tool — essentially Claude Code for knowledge workers who need long-running agents who can complete multi-step workflows — to Copilot. Imaginatively named Copilot Cowork, this feature is now available in the early-access Microsoft 365 Frontier program. Credit: Microsoft.Microsoft’s advantage here is that many of its customers would be worried about using Cowork if they had to upload their data to Anthropic. But since these companies already use Microsoft 365 and the Copilot Cowork data stays within their control (Cowork runs in a sandboxed cloud environment), this now enables them to take advantage of these new tools.“This isn’t about generating content or answers. It’s about taking real action – connecting steps, coordinating tasks, and following through across everyday workflows,” says Barton Warner, SVP ofEnterprise Technology at Capital Group. “Because Cowork operates on our enterprise data and within our security and risk boundaries, we can experiment, learn, and scale with confidence. That allows us to move faster and focus AI in places where it actually delivers value.” Why is Microsoft doing this?Having to bring in Anthropic to ship features like Cowork and Critique does say something about the position Microsoft finds itself in now: it is diversifying away from its early reliance on OpenAI, but in doing so, it is also deepening its relationship with yet another model provider. For customers paying premium prices for Copilot, one question on their minds is surely whether the value in using Microsoft’s services lies in the models it orchestrates or in the enterprise data and trust layer that makes those models useful in the first place. Microsoft is clearly betting it’s the latter, while for Anthropic, this partnership is yet another step in its play to become the AI vendor for the enterprise.When Microsoft first announced Cowork, its president of business applications and agents Charles Lamanna noted that “it is this multi-model advantage that makes Copilot different.” If Microsoft had its own frontier models, it would likely take a different approach, but as things stand, this is the best approach it can take.The post Microsoft’s Copilot makes Anthropic’s Claude and OpenAI’s GPT team up appeared first on The New Stack.