BlogNews
Launch App

News / AI Tools & Platforms

Microsoft Copilot Adds Multi-Model AI Comparison

Daniel Mercer

Written by Daniel Mercer

Mon Apr 06 2026

Compare GPT and Claude side by side inside Chatly.

Microsoft Copilot Introduces Multi-Model Intelligence

Microsoft has introduced two new multi-model AI capabilities, Critique and Council, to its Microsoft 365 Copilot Researcher agent, announced on March 30, 2026.

What Is the Critique Feature?

The headline feature, "Critique," equips Copilot's Researcher agent to draw on outputs from both OpenAI's GPT and Anthropic's Claude models for each response.

  • GPT is responsible for generating the initial answer
  • Claude evaluates that response for accuracy and overall quality

Then the final output gets delivered to the user.

Microsoft said it plans to further evolve this system into a bi-directional workflow, where both models can review and refine each other's outputs. The Critique feature is built around a rubric-based evaluation. It does not take up the rewriting task, rather It provides feedback for the researcher to improve the work.

It will be the default experience in Researcher when "Auto" is selected in the model picker. The review focuses on three key dimensions:

  • Source reliability: Verifying where information comes from
  • Report completeness: Ensuring nothing critical is missing
  • Citation accuracy: Making sure all key claims are backed by verifiable sources

The results show that researchers now score 13.8% higher on the DRACO (Deep Research Accuracy, Completeness, and Objectivity) benchmark, the industry standard for deep research quality.

What Is the Council Feature?

Council takes a different approach.

  • Instead of one model reviewing another, it runs an Anthropic and OpenAI model simultaneously, with each producing a complete, standalone report.
  • Once both reports are generated, a dedicated judge model evaluates them and creates a distilled summary highlighting where the models meaningfully agree or diverge.

CEO Satya Nadella announced the features on X, writing: "New in M365 Copilot Council. You can run multiple models on the same prompt at the same time, so you can see where they align and diverge."

If you want to run multiple AI models side by side, Chatly also lets you access GPT, Claude, and other leading models in one workspace.

Copilot Cowork Now in Early Access

Alongside the Researcher updates, Microsoft announced that Copilot Cowork is now available through its Frontier early access program.

Cowork handles long-running, multi-step tasks autonomously, covering functions like:

  • Calendar management
  • Budget reviews
  • Meeting preparation

Nicole Herskowitz, corporate vice president of Microsoft 365 and Copilot, said: "Having various different models from different vendors in Copilot is highly attractive, but we're taking this to the next level, where customers actually get the benefits of the models working together."

Adoption Still a Challenge

As of January 2026, Microsoft reported 15 million paid Copilot seats, roughly 3.3% of its 450 million commercial Microsoft 365 users. Features like Critique and Council appear designed to accelerate adoption by addressing the most common objection to AI-assisted research: trustworthiness.

Suggested Reads

  • OpenAI Closes Record $122 Billion Funding Round at $852 Billion Valuation
  • Anthropic Launched Claude Opus 4.5 — New Flagship Model for Coding and Complex AI Workflows
  • OpenAI Prepares to Make a Historic Public Market Debut, Eyeing Up to $1 Trillion Valuation
  • ChatGPT Voice Mode Update for Seamless Integration of Voice, Text & Live Maps
Copilot Goes Multi-Model

Copilot Goes Multi-Model

Microsoft upgraded Copilot's Researcher agent with Critique and Council, combining GPT and Claude to deliver more accurate, verified, and comparable AI research results.

Ask Chatly AI Chat to Learn More

Frequently Asked Questions

Learn more about this new feature by reading what other people have asking.

More topics you may like

OpenAI’s IndQA Benchmark Puts Indian Languages & Culture in Focus for AI

OpenAI’s IndQA Benchmark Puts Indian Languages & Culture in Focus for AI

Faisal Saeed

Faisal Saeed

11 Best ChatGPT Alternatives in 2026 (Tested, Compared & Priced)

11 Best ChatGPT Alternatives in 2026 (Tested, Compared & Priced)

Muhammad Bin Habib

Muhammad Bin Habib

28 Best AI Tools for Students in 2025 – The Complete AI-Powered Academic Success Guide

28 Best AI Tools for Students in 2025 – The Complete AI-Powered Academic Success Guide

Muhammad Bin Habib

Muhammad Bin Habib

21 Journaling Techniques That Actually Work in 2025

21 Journaling Techniques That Actually Work in 2025

Muhammad Bin Habib

Muhammad Bin Habib

Footer Background Gradient

A product by

Vyro

Trusted by thousands of professionals worldwide.

Get Started for Free

Features

AI ChatAI Search EngineAI Image GeneratorAI Document GeneratorAI Presentation Maker

AI Models

GPT-5.4Claude Opus 4.7Gemini 3.1 ProGemini 3 ProGemini 3 FlashGPT-5.2 ProGPT-5.2GPT-5GPT-5.1Claude Opus 4.6Claude Sonnet 4.6Gemini 3.1 Flash LiteSeedream 5.0 LiteIdeogram 3.0Nano BananaNano Banana 2Seedream 4.030+ AI Models

AI Translation Apps

Translate English to ChineseTranslate English to SpanishTranslate English to JapaneseTranslate English to UrduTranslate English to HindiTranslate Chinese to English

AI Apps

AI CoderCitation GeneratorGPT ChatAI Story GeneratorAsk AIAI Math SolverPhysics SolverChemistry SolverChat PDFSummary GeneratorParaphrasing ToolAI Humanizer

Blogs

ChatGPT AlternativesGPT-5.2 OverviewGemini 2.5 Pro vs Gemini 3 Pro: Cost AnalysisJSON Prompting GuideBest System PromptsWhat is Vibe Coding?Create Presentations Using AIClaude Sonnet 4.6 OverviewFrom Prompt to Deck in 30 MInutes9 Best AI Image Generation Models

Company

Help & SupportPlans & PricingChatly Help CenterBlogNews

Legal

Privacy PolicyTerms & Conditions
ChatlyTry NowChatly