Xiaomi and MiniMax both unleash their ultimate moves, signaling the start of the Agent Pricing War.

By: blockbeats|2026/03/20 13:00:01

On March 18 and 19, two Chinese companies successively released their respective Agent-oriented large models. The domestic AI startup MiniMax launched M2.7, and Xiaomi's large model team MiMo introduced V2-Pro. Both models entered the global top tier in the Agent benchmark, but their API output pricing is 1/21 and 1/8 of Claude Opus 4.6, respectively.

Both companies played their cards in the same week, but with completely different hands. They represent two completely different technical paths, betting on two futures of the Agent era.

Same Exam, 1/17 Tuition Fee

First, let's look at the most intuitive comparison.

Xiaomi and MiniMax both unleash their ultimate moves, signaling the start of the Agent Pricing War.

According to OpenRouter and various company official pricing pages, based on API output price (per million tokens), MiniMax M2.7 is $1.2, and MiMo-V2-Pro is $3. As a reference, the output price for Claude Opus 4.6 is $25, GPT-5.2 is $14, and Claude Sonnet 4.6 is $15.

The price difference is an order of magnitude, but the performance difference is not. In SWE-bench Verified (the current mainstream benchmark for measuring code engineering capability), MiMo-V2-Pro scored 78%, Sonnet 4.6 was 79.6%, a difference of less than two percentage points. M2.7's SWE-Pro score is 56.22%, on par with GPT-5.3-Codex. In VIBE-Pro (end-to-end project delivery capability), M2.7 scored 55.6%, approaching the level of Opus 4.6.

The focus of this chart is not on who is higher or lower—the benchmark systems of various companies are not entirely aligned, so direct comparisons should be cautious. The focus is on the "price-performance scissor difference": domestic Agent models have squeezed into the same performance band, but are in completely different price ranges.

Trillion Parameters vs. Self-evolution

Price is just the surface. The two companies have presented two completely different sets of trump cards.

MiMo-V2-Pro follows the "go big or go home" route. According to Xiaomi's official announcement, V2-Pro has over 1 trillion total parameters, 42B activation parameters, and supports an ultra-long context of 1 million tokens. Its core innovation is the Hybrid Attention mixed attention mechanism, adjusting the ratio of Sliding Window Attention (SWA) to Global Attention (GA) to 7:1—its predecessor V2-Flash was 5:1. This architecture makes the model more stable in scenarios where long documents are processed and multiple tool parallel calls in the Agent scene. In PinchBench (Agent tool invocation capability assessment), MiMo-V2-Pro scored 84%.

M2.7 took a completely different path. According to MiniMax's official tech blog post on March 18, M2.7's parameter count was not disclosed, but it demonstrated a "self-iterative evolution" mechanism: the model autonomously ran over 100 optimization loops, including analyzing failure trajectories, planning modifications, modifying its own code architecture, running evaluations, and looping again, ultimately achieving a 30% performance improvement on an internal evaluation set. In the MLE Bench Lite (Machine Learning Contest Difficulty Assessment), out of 22 challenging problems, M2.7 secured 9 gold, 5 silver, and 1 bronze, with an average medal rate of 66.6%.

From five dimensions, the two paths are aimed in completely different directions: MiMo-V2-Pro clearly dominates in context length and code engineering dimensions, while M2.7 widens the gap in office automation and self-iterative capability. According to MiniMax's same tech blog post, M2.7 scored ELO 1495 on GDPval-AA (Office Document Processing Evaluation), ranking first among open-source models, and maintained a 97% skill compliance rate in the MM-Claw test covering over 40 complex skills.

Four Versions in Five Months

Not only are the technical paths of the two companies different, but their iteration rhythms are also completely different.

According to public release records, from the release of M2 in October 2025 to the release of M2.7 in March 2026, MiniMax iterated four versions within five months, averaging a major version every 49 days. The gap between M2.5 and M2.7 was only about 30 days.

The rhythm of Xiaomi's MiMo is different: MiMo-7B was released in April 2025 (an open-source inference model with 7B parameters), V2-Flash was released in December of the same year (with 309B total parameters), and V2-Pro was released in March 2026 (with 1T total parameters). The parameter scale between each generation is much larger, but the intervals between versions are also longer.

MiniMax chose small, frequent steps, with each iteration not making big leaps but at a very high frequency. M2.7's self-iterative mechanism itself is designed for "continuous evolution." Xiaomi opted for a more impactful approach, with each version featuring significant changes in parameter scale and architecture.

-- Price

Anonymous 8 Days, Summit OpenRouter

In addition to the technical roadmap, Xiaomi's release strategy has also broken industry conventions.

According to Reuters, on March 11, an anonymous model named Hunter Alpha appeared on the world's largest API aggregation platform, OpenRouter. No brand endorsement, no product launch event, no technical blog. Its API pricing was extremely low, yet its performance was surprisingly strong.

The community began to speculate about its origins. According to Republic World and several tech media reports, the most mainstream speculation was DeepSeek V4, as MiMo team leader Luo Fuli had previously worked on research at DeepSeek. The number of API calls quickly skyrocketed, with the total number of calls during the anonymous period exceeding 1 trillion tokens, reaching the top of the OpenRouter weekly rankings.

Early on March 19, Xiaomi revealed: Hunter Alpha is indeed MiMo-V2-Pro. According to the same Reuters report, Xiaomi's Hong Kong stock once surged by 5.8% after the revelation.

This is the first time a domestic large-scale model has proven itself on a global platform through purely blind testing. Not relying on the brand, not relying on publicity, it took 8 days to let developers vote with their feet.

The various iterations of Uniswap are one of the sources of vitality in the DeFi market, but since 2023, Uniswap has not proposed any substantial innovations, instead adhering to traditional business explorations in application chains, Launchpads, etc., leading to a slump in token prices and market ...

What is the key to competition in crypto banking?

Digital banks, crypto cards, wallets, super apps, and DeFi protocols are all converging towards the same goal: to become the primary gateway for your savings, spending, earning, and transferring in the new era.

The flow of stablecoins and the spillover effects in the foreign exchange market

Research has found that an exogenous increase in net inflows of stablecoins significantly widens the price deviation between stablecoins and traditional foreign exchange, leads to depreciation of the local currency, and worsens the financing conditions for synthetic dollars (i.e., increases the doll...

After two years, Hong Kong's first batch of stablecoin licenses finally issued: HSBC, Standard Chartered make the cut

The regulated entity is set to launch a stablecoin in the first half of this year.

The person who helped TAO rise by 90% has now single-handedly crashed the price again today

As long as people are around, the story continues. But once they're gone, you may not even find a worthy opponent to play against.

3-Minute Guide to Participating in the SpaceX IPO on Bitget

Bitget IPO Prime brings a rare opportunity for global users to participate in world-class unicorn IPOs, allowing ordinary users to equally access the potential economic benefits of top-tier IPOs.

Top 5 Cryptos to Buy in 2026 Q1: A ChatGPT Deep Dive Analysis

Explore the top 5 cryptos to buy in Q1 2026 including BTC, ETH, SOL, TAO, and ONDO. See price outlooks, key narratives, and institutional catalysts shaping the next market move.

How to Earn $15,000 with Idle USDT Before Altcoin Season 2026

Wondering if altcoin season is coming in 2026? Get the latest market update, and learn how to turn your idle stablecoins waiting for entry into extra rewards up to 15,000 USDT.

Can You Win Joker Returns Without Large Trading Volume? 5 Mistakes New Players Make In WEEX Joker Returns Season 2

Can small traders win WEEX Joker Returns 2026 without huge volume? Yes—if you avoid these 5 costly mistakes. Learn how to maximize card draws, use Jokers wisely, and turn small deposits into 15,000 USDT rewards.

Altcoin Season 2026: 4 Stages to Profit (Before the Crowd FOMO In)

Altcoin Season 2026 is starting — discover the 4 key stages of capital rotation (from ETH to PEPE) and how to position before the peak. Learn which tokens will lead each phase and avoid missing the rally.

Will Alt season come in 2026? 5 Tips to Spot the Next 100x Crypto Opportunities

Will altcoin season arrive in 2026? Discover 5 rotation stages, early signals smart traders watch, and the key crypto sectors where the next 100x altcoin opportunities may emerge.

The bear market has arrived, and cryptocurrency ETF issuers are also getting involved

Today's listing of MSBT is the latest landmark in this restructuring, with the influx of institutions accelerating the embrace of cryptocurrencies by traditional finance, but also diluting the liquidity of the native market.

The richest man had a quarrel with his former boss

It has become a huge uproar, as several top figures in the Chinese cryptocurrency circle have engaged in intense verbal battles and confrontations in the past 24 hours.

BTC Firm Above 70K! Saylor’s "Institutional Logic" vs. Moon’s "Retail Faith": Who is Really Harvesting the Market?

Bitcoin is holding firm above the $70,000 support level following a massive short squeeze that liquidated $427 million. As the "Four-Year Cycle" narrative shifts, the market is split: Michael Saylor’s cold, institutional "indiscriminate stacking" vs. Carl Moon’s high-energy retail "hopium." This article decodes these two polar-opposite strategies for the 2026 bull run and reveals how WEEX’s institutional-grade liquidity and AI trading tools empower every type of investor to convert market volatility into profit.

The Girl Who Created the SBTI Test: A Story of a Doomed Cyber Love, an E-Widow Ratfolk

The usefulness of the useless is the highest usefulness.

B.AI Officially Launched: Building AI Agent Financial Bedrock Platform, Driving AGI Era Business Underlying Logic

B.AI has built a complete ecosystem from the AI Service Gateway to the AI Agent Financial Base: The LLM permissionless gateway integrates top global models and a unified API in one stop; The AI Agent infrastructure, through protocols such as x402 and 8004, empowers the AI Agent with an independent wallet and autonomous transactions.

B.AI Officially Launched: Breaking Down A2A Collaboration Barriers to Unlock the Smart Body Economy's Full Potential

With its Multi-Model Intelligent Routing breaking the compute bottleneck on one hand, and the integration of x402, 8004, Skills, and BAIClaw on the other hand, B.AI has seamlessly connected the full-stack business loop of AI Agents from large-scale intelligent scheduling to financial operational capability, accelerating the arrival of the AGI era.

We helped Xu Mingxing write a book called "<OK Life>".

That was a small-town youth who had lost three times, lost 2 million yuan selling a Beijing apartment, always felt like he was about to be spit out by Beijing, and on the screen, encountered something that was said to be unclaimable by anyone.