Pricing audit24 min readReviewed Apr 20, 2026

Chinese AI Coding Plan Pricing in 2026: All 7 Providers, Domestic vs Overseas, Benchmarks, and Honest Buying Advice

An engineer who burns through over a billion tokens a month went page-by-page through every Chinese AI coding plan and its overseas counterpart. This guide covers real pricing tables for all seven providers, domestic-vs-international quota gaps, benchmark scores that matter, and opinionated buying advice for every budget tier from ¥29 to ¥870 per month.

Published Apr 19, 2026Updated Apr 20, 2026
  • Cheapest mainland entry: MiniMax Starter at ¥29/month. Cheapest “AI work style” bundle: Kimi Andante at ¥39/month. Cheapest no-5-hour-window plan: MiMo Lite at ¥39/month. These are different products — pick the one that matches your workflow, not just the lowest number.
  • Price hikes are accelerating: Zhipu GLM raised prices 30%+ in February 2026 alone, Alibaba Bailian killed its ¥40 Lite tier entirely, and Volcengine replaced first-purchase discounts with daily 10:30 AM flash sales. The cheap plans are disappearing one by one.
  • Counter-intuitive finding: MiniMax gives overseas users 2.5x the quota of domestic users at the same tier. StepFun Flash Plus is cheaper overseas ($9.99) than in mainland China (¥99). Sometimes the grass really is greener on the international side.
  • Model-first, plan-second: GLM-5.1 tops SWE-bench Pro at 58.4 (beating Claude Opus 4.6 and GPT-5.4). Kimi K2.5 hits 76.8-80.0% on SWE-bench Verified. Pick the model that fits your tasks, then find the plan that unlocks it at the right price.
Quick note: This guide is based on public docs and release pages, but you should still verify current pricing, limits, supported tools, and region-specific billing on the official source before you pay, subscribe, or integrate.

Token is the new data plan

Remember how mobile data plans evolved? In 2009, you paid 100 yuan for 1 GB and thought it would last forever. By 2013, the same money barely bought a few gigabytes. Then came “unlimited” plans — except they throttled you after a threshold. By 2020, truly unlimited data was gone, and everyone was counting gigabytes again.

Token pricing is following the exact same trajectory. From late 2024 through mid-2025, providers slashed per-million-token prices to fractions of a yuan — cheap enough that you did not bother checking usage. From late 2025 onward, the wind shifted: purchase limits, price hikes, discontinued low tiers, and first-purchase discounts that turned from “buy anytime” to “10:30 AM daily flash sale.”

You thought tokens were air. They are actually data plans — the free stuff is disappearing, and the cheap stuff is getting more expensive.

I am an engineer who consumes over a billion tokens per month. From Claude Code to Cursor, from agent swarms to automated pipelines, tokens are oxygen — you do not notice them until they run out. What follows is my honest, page-by-page audit of every major Chinese AI coding plan and its overseas counterpart, written for fellow engineers who need to make smart buying decisions right now.

The 30-second summary

Look at that table carefully. Some providers charge more overseas but give you more quota. Others are simply more expensive internationally with no upside. The right answer depends on your payment method, your network latency tolerance, and how much quota you actually burn per session.

All seven providers at a glance — domestic floor, overseas floor, and the surprising price gap
ProviderCheapest domestic tierCheapest overseas tierIs overseas worth it?
MiniMax¥29/month$10/monthOverseas gives 2.5x the quota per tier
Kimi (Moonshot)¥39/month$19/monthDomestic is cheaper
Xiaomi MiMo¥39/month~$5/monthRoughly at parity
Volcengine (ByteDance)¥40/month$10/monthDomestic is cheaper
Zhipu GLM¥49/month$18/monthOverseas has no purchase limits
StepFun¥49/month$6.99/monthPlus tier is cheaper overseas
Alibaba Bailian¥200/month$50/monthDomestic is cheaper
BuyGLM shows package prices in USD. When a source page is published in CNY, the displayed value uses a fixed 1 USD = 8 CNY conversion and should still be checked against the live vendor page before payment.

Zhipu GLM: two rounds of price hikes, and domestic users still get a bargain

At an 8:1 exchange rate, the overseas Max tier costs roughly ¥1,280/month while the domestic Max is only ¥469. Same model, same quotas — mainland users pay less than half what overseas users pay.

But the overseas version has one crucial advantage: no purchase limits. The domestic Max tier frequently sells out, with only about 20% of quota released daily at 10:00 AM Beijing time. The overseas DevPack is almost always in stock. And with annual billing plus discount codes, the effective overseas price can drop significantly.

Zhipu GLM domestic coding plan pricing page
Official domestic plan tiers on bigmodel.cn Source: Zhipu GLM domestic plans.
Zhipu GLM international Z.ai coding plan pricing page
Overseas DevPack tiers on z.ai Source: Z.AI DevPack pricing.
Zhipu GLM Coding Plan pricing — domestic (bigmodel.cn) vs overseas Z.ai (z.ai)
TierDomestic (CNY/month)Overseas Z.ai (USD/month)Overseas in CNY (approx.)
Lite¥49$18~¥144
Pro¥149$72~¥576
Max¥469$160~¥1,280
BuyGLM shows package prices in USD. When a source page is published in CNY, the displayed value uses a fixed 1 USD = 8 CNY conversion and should still be checked against the live vendor page before payment.

GLM gotchas: peak-hour multipliers and the Lite model trap

Three things catch people off guard with Zhipu GLM:

  • GLM-5 and GLM-5.1 are restricted to Pro tier and above. Lite subscribers only get GLM-4.7. If you bought Lite expecting the flagship model, you will be disappointed.
  • Peak-hour quota multiplier: from 14:00 to 18:00 UTC+8, GLM-5 consumes 3x quota per request. Off-peak is 2x. You think you have 400 prompts, but during peak hours you actually get about 133. A temporary off-peak promotion (1x multiplier) was running through late April 2026, but this is not guaranteed to last.
  • Domestic Max tier has severe availability limits. Only about 20% of daily quota opens at 10:00 AM Beijing time, and it frequently sells out within minutes. Set your alarm or buy overseas.
GLM coding plan models and quota details
Model access and quota consumption rules for each GLM tier Source: Zhipu GLM quota docs.
GLM domestic purchase limit notice
Max tier purchase restrictions on the domestic platform Source: Zhipu GLM purchase limits.

Kimi (Moonshot AI): not just a coding tool — an AI work style

Like GLM, domestic pricing is far below overseas. Domestic Moderato at ¥79 is roughly half the price of the overseas equivalent at $19 (about ¥152). The overseas-only Vivace tier ($199/month) unlocks 4 concurrent agents plus 8 sub-agent swarms — there is no domestic equivalent at that scale.

But here is what makes Kimi different from every other provider on this list: the ¥39 Andante tier includes 10 Agent sessions per month, 10 deep research sessions, 10 PPT generations, plus Kimi Code programming quota. For people who are not full-time programmers but occasionally need to write code, do research, or create presentations, this “everything bundle” is more practical than any pure coding plan.

Kimi is not selling a coding tool. It is selling an AI work style. That positioning separates it from every other provider. My team has product managers who use Kimi all day — and they compete with the engineers for Agent quota.

Kimi domestic membership tiers
Official domestic membership pricing on kimi.com Source: Kimi membership.
Kimi international membership tiers
Overseas membership pricing on kimi.com/en Source: Kimi international membership.
Kimi domestic membership quota details
Quota breakdown for each domestic Kimi tier Source: Kimi quota details.
Kimi international membership quota details
Quota breakdown for each overseas Kimi tier Source: Kimi international quotas.
Kimi credits sharing across features
How Kimi shares credits between Agent, research, PPT, and Code Source: Kimi credits sharing.
Official Kimi K2.5 tech blog screenshot

Official screenshot

Kimi K2.5 is best introduced from the technical blog, not from social summaries

The public K2.5 tech blog already combines the multimodal upgrade story, benchmark tables, and Agent Swarm explanation in one official page.

  • Best official visual for the K2 to K2.5 family transition.
  • Useful for readers who need proof that Agent Swarm and MoonViT are part of the public story.

Source: Official Kimi K2.5 tech blog.

Kimi membership pricing — domestic (CNY) vs overseas (USD)
TierDomestic (CNY/month)Overseas (USD/month)Overseas in CNY (approx.)
Andante¥39Domestic only
Moderato¥79$19~¥152
Allegretto¥159$39~¥312
Allegro¥559$99~¥792
Vivace$199Overseas only (~¥1,592)
BuyGLM shows package prices in USD. When a source page is published in CNY, the displayed value uses a fixed 1 USD = 8 CNY conversion and should still be checked against the live vendor page before payment.

MiniMax: the absolute cheapest entry — but the domestic version is quota-capped

On the model side, M2.7 scored 56.22% on SWE-Pro and 57.0% on Terminal-Bench 2.0. The older M2.5 was even more impressive: SWE-bench Verified 75.8%-80.2% (official leaderboard 75.80%, self-reported maximum 80.2%), placing it among the very best open-source models.

What makes MiniMax unique is full-modal bundling — images, voice, music, and video are all included in the same plan at no extra cost. Nobody else offers that. ¥29/month does not just buy you a coding tool; it buys you an AI multimedia studio.

MiniMax domestic Token Plan pricing page
Official domestic plan tiers on platform.minimaxi.com Source: MiniMax domestic plans.
MiniMax domestic Token Plan quota details
Quota breakdown for domestic MiniMax tiers Source: MiniMax domestic quotas.
MiniMax international Token Plan pricing page
Official overseas plan tiers on platform.minimax.io Source: MiniMax international plans.
MiniMax international Token Plan quota details
Quota breakdown for overseas MiniMax tiers — note the 2.5x vs domestic Source: MiniMax international quotas.
Official MiniMax Token Plan overview screenshot

Official screenshot

MiniMax clearly positions Token Plan as the current subscription route

The official Token Plan overview is the best first stop for public-facing articles because it explains the route before readers ever hit a pricing table.

  • Useful for queries around MiniMax Token Plan, MiniMax Coding Plan, and MiniMax subscription.
  • Helps clarify that Token Plan is the current public route readers should treat as primary.

Source: MiniMax Token Plan overview.

Official MiniMax Token Plan pricing tables screenshot

Official screenshot

MiniMax publishes a strong public pricing table for monthly and Highspeed tiers

This pricing-table view is one of the best official screenshots in the category because readers can verify standard and Highspeed tiers directly from the source page.

  • Shows the monthly standard tiers and the Highspeed plan table in one view.
  • A good visual checkpoint before repeating plan prices or 5-hour request limits in an article.

Source: MiniMax Token Plan pricing.

MiniMax Highspeed Token Plan pricing
TierDomestic (CNY/month)Overseas (USD/month)
Plus Highspeed¥98$40
Max Highspeed¥199$80
Ultra Highspeed¥899$150
BuyGLM shows package prices in USD. When a source page is published in CNY, the displayed value uses a fixed 1 USD = 8 CNY conversion and should still be checked against the live vendor page before payment.

Xiaomi MiMo: finally, a plan without the 5-hour window trap

The overseas starting price of approximately $5/month may be the cheapest coding plan entry point anywhere in the world. Note that Xiaomi has said “global launch” but has not formally published overseas pricing — treat the USD numbers as estimates.

MiMo-V2-Pro is the flagship model: over 1 trillion total parameters (MoE architecture, 42B activated), with a 1M context window. Xiaomi claims coding performance surpasses Claude 4.6 Sonnet and approaches Opus 4.6, though independent third-party benchmark verification is still pending. PAYG API costs are reportedly about 20% of competing models.

I know several engineers who do agent debugging for a living, and they love the no-window model. Their work pattern is “sit still all day, then suddenly get three hours of intense inspiration.” MiMo Credits match that rhythm perfectly.

Good product design is not about giving you more — it is about not setting limits. MiMo understood this.

Xiaomi MiMo domestic Token Plan pricing page
Official domestic Token Plan tiers on the Xiaomi AI platform Source: MiMo Token Plan.
MiMo Artificial Analysis benchmark ranking
MiMo position on the Artificial Analysis leaderboard Source: Artificial Analysis.
MiMo benchmark scores overview
Detailed benchmark results for MiMo-V2-Pro Source: Xiaomi MiMo benchmarks.
MiMo usage data on OpenRouter
Anonymous usage statistics for MiMo on OpenRouter Source: OpenRouter.
Official MiMo-V2-Pro release page screenshot

Official screenshot

MiMo-V2-Pro now has a strong official release page with buyer-relevant claims

The Xiaomi release note is the best page to anchor the 1T / 42B / 1M context story before you move into the pricing and integration docs.

  • Useful for replacing older beta-era summaries with official product language.
  • Pairs naturally with the pricing and tools overview pages for route clarity.

Source: Official MiMo-V2-Pro release note.

Official MiMo-V2-Pro Artificial Analysis Intelligence Index image

Official image

Xiaomi publishes the Artificial Analysis ranking image directly on the official MiMo-V2-Pro page

The buyer-facing MiMo page does not only describe the model in prose. It also exposes the ranking visual that Xiaomi uses to support the “8th worldwide, 2nd among Chinese LLMs” positioning.

  • Useful when readers want a traceable official image instead of a copied leaderboard screenshot from social posts.
  • Works well alongside the pricing page because it keeps performance proof and route proof on official Xiaomi surfaces.

Source: Official MiMo-V2-Pro page.

Xiaomi MiMo Token Plan pricing — domestic and estimated overseas
TierDomestic (CNY/month)Overseas (USD, estimated)First-purchase 12% off (CNY)Credits
Lite¥39~$5¥34.3260 million
Standard¥99~$14¥87.12200 million
Pro¥329~$44¥289.52700 million
Max¥659~$88¥579.921.6 billion
BuyGLM shows package prices in USD. When a source page is published in CNY, the displayed value uses a fixed 1 USD = 8 CNY conversion and should still be checked against the live vendor page before payment.

StepFun (Jieyu Xingchen): the one where overseas is actually cheaper

”Promotional period” is the countdown timer on every discount. You think you are gaming the system. They are building your habits.

  • All prices are currently labeled “promotional period pricing” — StepFun has not promised these rates are permanent.
  • Billing is prompt-based, where 1 prompt equals roughly 15-20 model calls. The prompt count looks low, but each prompt does more work.
  • Step Plan is tightly coupled to the OpenClaw ecosystem. If you are a heavy OpenClaw user, the value proposition is strong. What happens after the promotional period ends is an open question.
StepFun Step Plan domestic pricing page
Official domestic Step Plan tiers on platform.stepfun.com Source: StepFun domestic plans.
StepFun Step Plan domestic quota details
Quota breakdown for domestic StepFun tiers Source: StepFun domestic quotas.
StepFun Step Plan international pricing page
Overseas Step Plan tiers on platform.stepfun.ai Source: StepFun international plans.
StepFun Step Plan international quota details
Quota breakdown for overseas StepFun tiers Source: StepFun international quotas.
Official Step 3.5 Flash GitHub page screenshot

Official screenshot

Step 3.5 Flash exposes architecture and benchmark rows directly on the official repo page

For public writing, the GitHub page is one of the best official assets because it makes the MoE architecture, benchmark table, and open-weight route visible without guesswork.

  • Strong image for explaining why Step 3.5 Flash is framed around speed and decoding efficiency.
  • Useful when readers want to verify that the model is an Apache 2.0 open release.

Source: Official Step 3.5 Flash GitHub repository.

StepFun Step Plan pricing — domestic (CNY) vs overseas (USD), with quotas
TierDomestic (CNY/month)Overseas (USD/month)Overseas in CNY (approx.)Prompts per 5h
Flash Mini¥49$6.99~¥56~100
Flash Plus¥99$9.99~¥80~400
Flash Pro¥199$29~¥232~1,500
Flash Max¥699$99~¥792~5,000
BuyGLM shows package prices in USD. When a source page is published in CNY, the displayed value uses a fixed 1 USD = 8 CNY conversion and should still be checked against the live vendor page before payment.

Alibaba Bailian: the ¥40 entry tier vanished overnight

Overseas Pro at $50 (about ¥400) is double the domestic ¥200 price. Like GLM and Kimi, domestic users get a substantial price advantage.

But Pro has something nobody else offers: multi-model aggregation. One subscription gives you access to Qwen3-Coder-Next, Kimi-K2.5, GLM-5, and MiniMax-M2.5. It is like having one SIM card that works on every carrier network simultaneously. Plus it supports both OpenAI-compatible and Anthropic-compatible endpoints.

When every other provider is selling their own model, Bailian chose a different path: selling a model supermarket. Whether that works long-term depends on whether the shelves stay stocked with top-tier models.

  • Critical technical gotcha: Bailian requires a dedicated Coding Plan API key (prefixed sk-sp-...). Using a regular API key will trigger pay-as-you-go billing, which can cost 5x more than the plan. I have seen someone burn through over ¥1,000 in a single month because they used the wrong key.
  • The entry barrier jumping from ¥40 to ¥200 is genuinely hostile to students and newcomers.
  • Multi-model access in one plan is excellent for anyone doing comparative model evaluation — no more juggling five different accounts and API keys.
Alibaba Bailian domestic coding plan pricing page
Official domestic plan tiers (note: Lite is discontinued) Source: Alibaba Bailian domestic plans.
Alibaba Bailian domestic token plan details
Detailed quota and billing mechanism for Bailian Coding Plan Source: Bailian token plan.
Alibaba Bailian token plan consumption mechanism
How Bailian counts and deducts tokens from your plan quota Source: Bailian consumption mechanism.
Alibaba Bailian international coding plan pricing page
Overseas plan tiers on alibabacloud.com Source: Bailian international plans.
Official Alibaba Model Studio pricing page screenshot for Qwen3.6-Plus

Official screenshot

Qwen3.6-Plus already has route-specific public pricing on the Model Studio side

The Alibaba pricing page is the safest public surface for route-aware Qwen3.6-Plus billing because it distinguishes mainland and international rows directly on the official page.

  • Useful when articles need one source-backed image for regional pricing differences.
  • Pairs well with benchmark claims from the Qwen 3.6 release page.

Source: Alibaba Cloud Model Studio pricing.

Alibaba Bailian Coding Plan pricing — including the discontinued Lite tier
TierDomestic (CNY/month)Overseas (USD/month)Requests per 5hRequests per month
~~Lite~~ (discontinued)~~¥40~~~~1,200~~~~18,000~~
Pro¥200$506,00090,000
BuyGLM shows package prices in USD. When a source page is published in CNY, the displayed value uses a fixed 1 USD = 8 CNY conversion and should still be checked against the live vendor page before payment.

Volcengine Ark (ByteDance): the design-to-code frontend specialist

Doubao Seed 2.0 Code deserves special attention. It swept all five gold medals in ICPC programming competitions. The community has positioned it as “the visual-driven frontend weapon” — if your workflow is “design mockup to working code,” this is currently the best value option.

Volcengine also fell into the first-purchase discount trap: Lite was ¥9.9 and Pro was ¥49.9 for your first purchase, but from March 13 onward, those became daily 10:30 AM limited flash sales. Yes, you read that right — you now set an alarm to抢 (snatch) token plans. Concert tickets, Moutai liquor, and now AI subscriptions. This is what 2026 looks like for programmers.

Pro subscriptions also include ArkClaw Lite. Users already in the ByteDance ecosystem (TRAE, Doubao App) will find the integration natural. As with Bailian, make sure you use the dedicated Code Plan URL — traffic through the generic API endpoint is not counted toward your plan quota.

Volcengine Ark domestic coding plan pricing page
Official domestic Code Plan tiers on volcengine.com Source: Volcengine domestic plans.
Volcengine Ark domestic supported models
Available models for the domestic Ark Code Plan Source: Volcengine supported models.
Volcengine Ark international BytePlus coding plan pricing page
Overseas Code Plan tiers on byteplus.com Source: BytePlus international plans.
Volcengine Ark international BytePlus supported models
Available models for the international BytePlus Code Plan Source: BytePlus supported models.
Official Seed2.0 model page screenshot

Official screenshot

The official Seed2.0 page is the cleanest public source for the model-family story

ByteDance's Seed site makes the Pro, Lite, and Mini lineup, benchmark breadth, and Ark access links visible in one place. That is stronger than building the article around unofficial pricing screenshots.

  • Best official visual for the Seed2.0 family overview.
  • Useful for replacing reseller-style descriptions with Seed's own model framing.

Source: Official Seed2.0 model page.

Volcengine Ark Code Plan pricing — domestic and overseas (BytePlus)
TierDomestic (CNY/month)Overseas BytePlus (USD/month)Requests per 5hRequests per month
Lite¥40$10 (~¥80)1,20018,000
Pro¥200$50 (~¥400)6,00090,000
BuyGLM shows package prices in USD. When a source page is published in CNY, the displayed value uses a fixed 1 USD = 8 CNY conversion and should still be checked against the live vendor page before payment.

Benchmark comparison: the models behind the plans

Choose your model before you choose your plan. Choose your model based on what you actually do. Comprehensive bug-fixing? Kimi K2.5 or MiniMax M2.5. The hardest engineering challenges? GLM-5.1. Frontend work from designs? Doubao Seed 2.0 Code.

GLM-5.1 SWE-bench Pro benchmark result
GLM-5.1 achieving #1 on SWE-bench Pro with 58.4 Source: Zhipu GLM benchmarks.
Kimi K2.6 benchmark results
K2.6 coding benchmark improvements over K2.5 Source: Kimi K2.6 benchmarks.
Kimi Code Bench results
K2.6 performance on the Kimi Code Bench Source: Kimi Code Bench.
Qwen 3.6 Artificial Analysis ranking
Qwen3.6 performance on the Artificial Analysis leaderboard Source: Artificial Analysis.
Qwen 3.6 benchmark scores
Detailed benchmark results for Qwen3.6 models Source: Qwen 3.6 benchmarks.
Flagship model benchmark comparison across all seven providers
ModelSWE-bench VerifiedSWE-bench ProOne-line summary
Kimi K2.576.8%-80.0%50.7-55.6The comprehensive code-fixing king
MiniMax M2.575.8%-80.2%Open-source value ceiling
Step 3.5 Flash74.4%Small model, big capability
Qwen3-Coder-Next~70%+Only 3B activated — lightweight champ
DeepSeek V3.2~67.8%Open-source veteran, reliably solid
GLM-5.158.4 (#1)King of the hardest engineering tasks
MiniMax M2.756.22Self-evolving agent specialist
Doubao Seed 2.0ICPC 5 gold medalsDesign mockup to frontend code
MiMo-V2-ProClaims exceed Sonnet 4.6Trillion parameters, 1M context
SWE-bench Verified scores — the model quality signals that matter for plan buyers

Higher is better. These scores reflect bug-fixing ability on real-world GitHub issues. Choose your plan based on which model fits your tasks, not which plan has the lowest monthly price.

Kimi K2.576.8%

SWE-bench Verified (custom), up to 80.0% in high-reasoning mode

MiniMax M2.575.8%

SWE-bench Verified (official leaderboard), up to 80.2% self-reported

Step 3.5 Flash74.4%

SWE-bench Verified — impressive for a 196B/11B MoE model

DeepSeek V3.2~67.8%

SWE-bench Verified — stable open-source veteran

MiniMax M2.756.22

SWE-bench Pro (different test set — not directly comparable to Verified)

GLM-5.158.4

SWE-bench Pro #1 — beats Claude Opus 4.6 and GPT-5.4 on hardest tasks

SWE-bench Verified and SWE-bench Pro are different benchmarks. Verified tests bug-fixing on real issues; Pro tests complex multi-file engineering tasks. Do not cross-compare the scores. Source: SWE-bench leaderboard.

Domestic vs overseas: why mainland users get better deals (and when they do not)

Why is domestic so much cheaper for most providers? Because the Chinese market is a seven-way knife fight for users. Seven providers in one pool, all slashing prices. The overseas market faces Claude, GPT, and Gemini — a completely different pricing logic where you do not need to win on price, you need to prove “I am not worse than Claude.”

The cheap prices you get domestically exist not because the service is truly cheap, but because someone is fighting a price war on your behalf. That war will not last forever. Bailian Lite is already gone. Volcengine first-purchase discounts are now flash sales. GLM has raised prices twice.

Those cheap tiers are disappearing one by one.

Domestic vs overseas price gap — who wins where?
ProviderDomestic top tierOverseas equivalentDomestic savings
Zhipu GLMMax ¥469/monthMax $160/month (~¥1,280)63% cheaper domestically
KimiModerato ¥79/monthModerato $19/month (~¥152)~48% cheaper domestically
Alibaba BailianPro ¥200/monthPro $50/month (~¥400)50% cheaper domestically
VolcengineLite ¥40/monthBytePlus Lite $10/month (~¥80)50% cheaper domestically
MiniMaxStarter ¥29/monthStarter $10/month (~¥80)Cheaper domestically, but overseas gives 2.5x quota
StepFunFlash Plus ¥99/monthFlash Plus $9.99/month (~¥80)Plus tier is actually cheaper overseas
Xiaomi MiMoLite ¥39/monthLite ~$5/month (~¥40)Roughly at parity
BuyGLM shows package prices in USD. When a source page is published in CNY, the displayed value uses a fixed 1 USD = 8 CNY conversion and should still be checked against the live vendor page before payment.

Will tokens keep getting more expensive?

Yes. And this process has barely started.

Late 2024 through mid-2025 was the “subsidize to acquire users” phase. Per-million-token prices dropped from tens of yuan to fractions. Cheap enough that you did not bother checking usage. From late 2025 onward, the screws tightened: first-purchase discounts eliminated, low tiers discontinued, flash sales replacing open stock. In 2026, we entered the “fine-grained segmentation” phase: MiniMax launched Highspeed tiers from ¥98 to ¥899, Kimi introduced the ¥559 Allegro tier.

Tokens are being tiered exactly like mobile data plans were. Daily passes, monthly bundles, unlimited-with-throttling — every playbook from the telecom era is being replayed in AI with remarkable precision.

The root cause is compute cost. Training a hundred-billion-parameter model runs into the hundreds of millions of dollars, and inference is not cheap either. The low prices were always venture-funded customer acquisition, and investor patience is finite.

Will the market consolidate down to a few suppliers? Very likely. Look at cloud computing today — AWS, Azure, and Alibaba Cloud in a three-way stand. The AI token supply chain will probably follow the same pattern. Aggregation platforms like Bailian and Volcengine are becoming the “virtual operators” of the AI era — they do not build base stations, they package several providers' signals and sell them to you.

  • Budget at list price, not promo price. First-purchase discounts only apply once. Flash sales are not guaranteed. Promotional periods end without warning.
  • The number you can sustainably pay every month for the foreseeable future — that is your real cost.
  • Consolidation is coming. The seven providers today may become three or four within a year or two.
BuyGLM shows package prices in USD. When a source page is published in CNY, the displayed value uses a fixed 1 USD = 8 CNY conversion and should still be checked against the live vendor page before payment.

If I were buying today: budget-tier recommendations

¥29/month will not get you the strongest model. ¥200/month will. Which is the better deal? The answer does not depend on the price — it depends on your use case. If you only code for 30 minutes a day, ¥29 is enough. If your livelihood depends on AI-assisted programming, ¥200 is the frugal choice.

Budget-tier buying recommendations from an engineer who uses these plans daily
BudgetPrimary pickWhyCombo option (if you need more)
¥30-50/monthMiniMax Starter (¥29) for pure coding, or Kimi Andante (¥39) for AI work-style bundleMiniMax is the cheapest real coding plan on the market, with full-modal support including images, voice, music, and video. Kimi Andante adds Agent, PPT, and research for people who do more than just code.If you have an international credit card, Xiaomi MiMo overseas at ~$5/month (~¥40) is an ultra-low-cost alternative.
¥100-200/monthZhipu GLM Pro (¥149)SWE-bench Pro #1 model. About 400 prompts per 5 hours. Enough for serious daily development. The best single-plan value for working engineers.Add Bailian Pro (¥200) or Volcengine Pro (¥200) if you need multi-model access for comparative testing. Total: ¥349-400/month.
¥400+/monthZhipu GLM Max (¥469) as the primary planApproximately 1,600 prompts per 5 hours, 8,000 per week. Full GLM-5.1 access. Made for全天候 (round-the-clock) high-intensity use.Combo: GLM Max + MiniMax Max Highspeed (¥199) for fast inference backup + Volcengine Pro (¥200) for multi-model + ArkClaw. Total: about ¥870/month. Covers every scenario.
BuyGLM shows package prices in USD. When a source page is published in CNY, the displayed value uses a fixed 1 USD = 8 CNY conversion and should still be checked against the live vendor page before payment.

Six buying rules to remember

  • Pick the model first, then find the plan. GLM-5.1 for hard engineering, Kimi K2.5 for comprehensive code repair, Doubao Seed 2.0 for frontend from designs, MiniMax for the cheapest entry with full-modal support.
  • Check which route you are actually buying. A membership is not a token plan is not a coding plan is not a PAYG API. These are fundamentally different products that happen to live on the same provider website.
  • Use the correct API key. Bailian requires sk-sp-... prefixed keys. GLM has separate DevPack and API routes. Using the wrong key or endpoint can cost you 5x more than the plan price.
  • Watch for peak-hour multipliers. GLM-5 uses 3x quota during peak hours (14:00-18:00 UTC+8). Your 400-prompt allowance drops to 133 effective prompts when you need them most.
  • Budget at regular price, not promo price. First-purchase discounts expire. Flash sales are unreliable. Promotional periods end. The sustainable monthly cost is what matters for planning.
  • If you are overseas, do not just convert CNY prices at face value. Check whether the overseas tier gives different quotas (MiniMax gives 2.5x more), has different purchase limits (GLM overseas has no stock caps), or is actually cheaper (StepFun Flash Plus).

The right coding plan is not the cheapest — it is the one that matches your workflow

Pick your model, pick your route, then pick your plan. This guide gives you the real pricing tables, the benchmark scores, and the gotchas for all seven Chinese AI coding plan providers. The compare tool on BuyGLM lets you put them side by side with live data.

Sources and official links

Frequently asked questions

Which Chinese AI coding plan is the cheapest?

MiniMax Domestic Starter at ¥29/month (about $3.63) is the cheapest real coding plan. If you want more than just coding — Agent, PPT, research — Kimi Andante at ¥39/month (about $4.88) is a broader bundle. If you have an international credit card, Xiaomi MiMo overseas at about $5/month is also competitive.

Why are Chinese AI coding plans cheaper domestically than internationally?

Intense domestic competition. Seven providers fighting for the same user base drives prices down. Internationally, they compete against Claude, GPT, and Gemini — where the strategy shifts from price competition to capability proof. The domestic price war benefits Chinese users, but those prices are not sustainable forever. Already, cheap tiers are disappearing: Bailian Lite is gone, Volcengine first-purchase discounts became flash sales, and GLM has raised prices twice.

Is it ever cheaper to buy the overseas version of a Chinese AI coding plan?

Surprisingly, yes. MiniMax overseas plans give 2.5x the quota at each tier. StepFun Flash Plus overseas ($9.99) is cheaper than domestic (¥99). GLM overseas has no purchase limits on Max tier while domestic frequently sells out. For some providers, the international version is the better deal — if you can pay with Stripe.

What happened to the Alibaba Bailian ¥40 Lite tier?

It is gone. New purchases stopped on March 20, 2026. Existing renewals and upgrades stopped on April 13, 2026. The cheapest Bailian tier is now Pro at ¥200/month — a 5x increase in the minimum entry price. This is part of the broader trend of low-cost tiers being eliminated across all providers.

Which model is the best for coding tasks?

It depends on the task. For bug-fixing and general code repair, Kimi K2.5 (SWE-bench Verified 76.8-80.0%) and MiniMax M2.5 (75.8-80.2%) are top tier. For the most complex engineering challenges, GLM-5.1 leads SWE-bench Pro at 58.4 — beating Claude Opus 4.6 and GPT-5.4. For design-to-code frontend work, Doubao Seed 2.0 Code (5 ICPC gold medals) is the specialist.

What is the “wrong API key” gotcha?

Alibaba Bailian requires a dedicated Coding Plan API key (prefixed sk-sp-...). If you use a regular API key, traffic is billed at pay-as-you-go rates — which can be 5x more expensive than the plan. This mistake can cost over ¥1,000 in a single month. Similarly, Volcengine requires using the dedicated Code Plan URL; traffic through the generic API endpoint does not count toward your plan quota.

What does “peak-hour multiplier” mean?

Some providers charge more quota per request during busy hours. Zhipu GLM is the most aggressive: from 14:00 to 18:00 UTC+8, GLM-5 consumes 3x quota per request. Off-peak is 2x. If your plan allows 400 prompts, peak-hour usage effectively limits you to about 133 prompts. A temporary off-peak promotion (1x multiplier) was running through late April 2026, but this is not guaranteed to persist.

Should I wait for prices to come back down?

Every signal points in the opposite direction. Cheap tiers are being discontinued, first-purchase discounts are being eliminated or converted to flash sales, and multiple providers have raised prices in 2026. The current pricing environment is likely the most competitive it will ever be. Budget at today's regular prices, not at promotional rates.