Chinese AI Coding Plan Pricing in 2026: All 7 Providers, Domestic vs Overseas, Benchmarks, and Honest Buying Advice
An engineer who burns through over a billion tokens a month went page-by-page through every Chinese AI coding plan and its overseas counterpart. This guide covers real pricing tables for all seven providers, domestic-vs-international quota gaps, benchmark scores that matter, and opinionated buying advice for every budget tier from ¥29 to ¥870 per month.
- Cheapest mainland entry: MiniMax Starter at ¥29/month. Cheapest “AI work style” bundle: Kimi Andante at ¥39/month. Cheapest no-5-hour-window plan: MiMo Lite at ¥39/month. These are different products — pick the one that matches your workflow, not just the lowest number.
- Price hikes are accelerating: Zhipu GLM raised prices 30%+ in February 2026 alone, Alibaba Bailian killed its ¥40 Lite tier entirely, and Volcengine replaced first-purchase discounts with daily 10:30 AM flash sales. The cheap plans are disappearing one by one.
- Counter-intuitive finding: MiniMax gives overseas users 2.5x the quota of domestic users at the same tier. StepFun Flash Plus is cheaper overseas ($9.99) than in mainland China (¥99). Sometimes the grass really is greener on the international side.
- Model-first, plan-second: GLM-5.1 tops SWE-bench Pro at 58.4 (beating Claude Opus 4.6 and GPT-5.4). Kimi K2.5 hits 76.8-80.0% on SWE-bench Verified. Pick the model that fits your tasks, then find the plan that unlocks it at the right price.
Token is the new data plan
Remember how mobile data plans evolved? In 2009, you paid 100 yuan for 1 GB and thought it would last forever. By 2013, the same money barely bought a few gigabytes. Then came “unlimited” plans — except they throttled you after a threshold. By 2020, truly unlimited data was gone, and everyone was counting gigabytes again.
Token pricing is following the exact same trajectory. From late 2024 through mid-2025, providers slashed per-million-token prices to fractions of a yuan — cheap enough that you did not bother checking usage. From late 2025 onward, the wind shifted: purchase limits, price hikes, discontinued low tiers, and first-purchase discounts that turned from “buy anytime” to “10:30 AM daily flash sale.”
You thought tokens were air. They are actually data plans — the free stuff is disappearing, and the cheap stuff is getting more expensive.
I am an engineer who consumes over a billion tokens per month. From Claude Code to Cursor, from agent swarms to automated pipelines, tokens are oxygen — you do not notice them until they run out. What follows is my honest, page-by-page audit of every major Chinese AI coding plan and its overseas counterpart, written for fellow engineers who need to make smart buying decisions right now.
The 30-second summary
Look at that table carefully. Some providers charge more overseas but give you more quota. Others are simply more expensive internationally with no upside. The right answer depends on your payment method, your network latency tolerance, and how much quota you actually burn per session.
| Provider | Cheapest domestic tier | Cheapest overseas tier | Is overseas worth it? |
|---|---|---|---|
| MiniMax | ¥29/month | $10/month | Overseas gives 2.5x the quota per tier |
| Kimi (Moonshot) | ¥39/month | $19/month | Domestic is cheaper |
| Xiaomi MiMo | ¥39/month | ~$5/month | Roughly at parity |
| Volcengine (ByteDance) | ¥40/month | $10/month | Domestic is cheaper |
| Zhipu GLM | ¥49/month | $18/month | Overseas has no purchase limits |
| StepFun | ¥49/month | $6.99/month | Plus tier is cheaper overseas |
| Alibaba Bailian | ¥200/month | $50/month | Domestic is cheaper |
Zhipu GLM: two rounds of price hikes, and domestic users still get a bargain
At an 8:1 exchange rate, the overseas Max tier costs roughly ¥1,280/month while the domestic Max is only ¥469. Same model, same quotas — mainland users pay less than half what overseas users pay.
But the overseas version has one crucial advantage: no purchase limits. The domestic Max tier frequently sells out, with only about 20% of quota released daily at 10:00 AM Beijing time. The overseas DevPack is almost always in stock. And with annual billing plus discount codes, the effective overseas price can drop significantly.


| Tier | Domestic (CNY/month) | Overseas Z.ai (USD/month) | Overseas in CNY (approx.) |
|---|---|---|---|
| Lite | ¥49 | $18 | ~¥144 |
| Pro | ¥149 | $72 | ~¥576 |
| Max | ¥469 | $160 | ~¥1,280 |
GLM gotchas: peak-hour multipliers and the Lite model trap
Three things catch people off guard with Zhipu GLM:
- GLM-5 and GLM-5.1 are restricted to Pro tier and above. Lite subscribers only get GLM-4.7. If you bought Lite expecting the flagship model, you will be disappointed.
- Peak-hour quota multiplier: from 14:00 to 18:00 UTC+8, GLM-5 consumes 3x quota per request. Off-peak is 2x. You think you have 400 prompts, but during peak hours you actually get about 133. A temporary off-peak promotion (1x multiplier) was running through late April 2026, but this is not guaranteed to last.
- Domestic Max tier has severe availability limits. Only about 20% of daily quota opens at 10:00 AM Beijing time, and it frequently sells out within minutes. Set your alarm or buy overseas.


Kimi (Moonshot AI): not just a coding tool — an AI work style
Like GLM, domestic pricing is far below overseas. Domestic Moderato at ¥79 is roughly half the price of the overseas equivalent at $19 (about ¥152). The overseas-only Vivace tier ($199/month) unlocks 4 concurrent agents plus 8 sub-agent swarms — there is no domestic equivalent at that scale.
But here is what makes Kimi different from every other provider on this list: the ¥39 Andante tier includes 10 Agent sessions per month, 10 deep research sessions, 10 PPT generations, plus Kimi Code programming quota. For people who are not full-time programmers but occasionally need to write code, do research, or create presentations, this “everything bundle” is more practical than any pure coding plan.
Kimi is not selling a coding tool. It is selling an AI work style. That positioning separates it from every other provider. My team has product managers who use Kimi all day — and they compete with the engineers for Agent quota.






Official screenshot
Kimi K2.5 is best introduced from the technical blog, not from social summaries
The public K2.5 tech blog already combines the multimodal upgrade story, benchmark tables, and Agent Swarm explanation in one official page.
- Best official visual for the K2 to K2.5 family transition.
- Useful for readers who need proof that Agent Swarm and MoonViT are part of the public story.
Source: Official Kimi K2.5 tech blog.
| Tier | Domestic (CNY/month) | Overseas (USD/month) | Overseas in CNY (approx.) |
|---|---|---|---|
| Andante | ¥39 | — | Domestic only |
| Moderato | ¥79 | $19 | ~¥152 |
| Allegretto | ¥159 | $39 | ~¥312 |
| Allegro | ¥559 | $99 | ~¥792 |
| Vivace | — | $199 | Overseas only (~¥1,592) |
MiniMax: the absolute cheapest entry — but the domestic version is quota-capped
On the model side, M2.7 scored 56.22% on SWE-Pro and 57.0% on Terminal-Bench 2.0. The older M2.5 was even more impressive: SWE-bench Verified 75.8%-80.2% (official leaderboard 75.80%, self-reported maximum 80.2%), placing it among the very best open-source models.
What makes MiniMax unique is full-modal bundling — images, voice, music, and video are all included in the same plan at no extra cost. Nobody else offers that. ¥29/month does not just buy you a coding tool; it buys you an AI multimedia studio.





Official screenshot
MiniMax clearly positions Token Plan as the current subscription route
The official Token Plan overview is the best first stop for public-facing articles because it explains the route before readers ever hit a pricing table.
- Useful for queries around MiniMax Token Plan, MiniMax Coding Plan, and MiniMax subscription.
- Helps clarify that Token Plan is the current public route readers should treat as primary.
Source: MiniMax Token Plan overview.

Official screenshot
MiniMax publishes a strong public pricing table for monthly and Highspeed tiers
This pricing-table view is one of the best official screenshots in the category because readers can verify standard and Highspeed tiers directly from the source page.
- Shows the monthly standard tiers and the Highspeed plan table in one view.
- A good visual checkpoint before repeating plan prices or 5-hour request limits in an article.
Source: MiniMax Token Plan pricing.
| Tier | Domestic (CNY/month) | Overseas (USD/month) |
|---|---|---|
| Plus Highspeed | ¥98 | $40 |
| Max Highspeed | ¥199 | $80 |
| Ultra Highspeed | ¥899 | $150 |
Xiaomi MiMo: finally, a plan without the 5-hour window trap
The overseas starting price of approximately $5/month may be the cheapest coding plan entry point anywhere in the world. Note that Xiaomi has said “global launch” but has not formally published overseas pricing — treat the USD numbers as estimates.
MiMo-V2-Pro is the flagship model: over 1 trillion total parameters (MoE architecture, 42B activated), with a 1M context window. Xiaomi claims coding performance surpasses Claude 4.6 Sonnet and approaches Opus 4.6, though independent third-party benchmark verification is still pending. PAYG API costs are reportedly about 20% of competing models.
I know several engineers who do agent debugging for a living, and they love the no-window model. Their work pattern is “sit still all day, then suddenly get three hours of intense inspiration.” MiMo Credits match that rhythm perfectly.
Good product design is not about giving you more — it is about not setting limits. MiMo understood this.





Official screenshot
MiMo-V2-Pro now has a strong official release page with buyer-relevant claims
The Xiaomi release note is the best page to anchor the 1T / 42B / 1M context story before you move into the pricing and integration docs.
- Useful for replacing older beta-era summaries with official product language.
- Pairs naturally with the pricing and tools overview pages for route clarity.
Source: Official MiMo-V2-Pro release note.

Official image
Xiaomi publishes the Artificial Analysis ranking image directly on the official MiMo-V2-Pro page
The buyer-facing MiMo page does not only describe the model in prose. It also exposes the ranking visual that Xiaomi uses to support the “8th worldwide, 2nd among Chinese LLMs” positioning.
- Useful when readers want a traceable official image instead of a copied leaderboard screenshot from social posts.
- Works well alongside the pricing page because it keeps performance proof and route proof on official Xiaomi surfaces.
Source: Official MiMo-V2-Pro page.
| Tier | Domestic (CNY/month) | Overseas (USD, estimated) | First-purchase 12% off (CNY) | Credits |
|---|---|---|---|---|
| Lite | ¥39 | ~$5 | ¥34.32 | 60 million |
| Standard | ¥99 | ~$14 | ¥87.12 | 200 million |
| Pro | ¥329 | ~$44 | ¥289.52 | 700 million |
| Max | ¥659 | ~$88 | ¥579.92 | 1.6 billion |
StepFun (Jieyu Xingchen): the one where overseas is actually cheaper
”Promotional period” is the countdown timer on every discount. You think you are gaming the system. They are building your habits.
- All prices are currently labeled “promotional period pricing” — StepFun has not promised these rates are permanent.
- Billing is prompt-based, where 1 prompt equals roughly 15-20 model calls. The prompt count looks low, but each prompt does more work.
- Step Plan is tightly coupled to the OpenClaw ecosystem. If you are a heavy OpenClaw user, the value proposition is strong. What happens after the promotional period ends is an open question.





Official screenshot
Step 3.5 Flash exposes architecture and benchmark rows directly on the official repo page
For public writing, the GitHub page is one of the best official assets because it makes the MoE architecture, benchmark table, and open-weight route visible without guesswork.
- Strong image for explaining why Step 3.5 Flash is framed around speed and decoding efficiency.
- Useful when readers want to verify that the model is an Apache 2.0 open release.
| Tier | Domestic (CNY/month) | Overseas (USD/month) | Overseas in CNY (approx.) | Prompts per 5h |
|---|---|---|---|---|
| Flash Mini | ¥49 | $6.99 | ~¥56 | ~100 |
| Flash Plus | ¥99 | $9.99 | ~¥80 | ~400 |
| Flash Pro | ¥199 | $29 | ~¥232 | ~1,500 |
| Flash Max | ¥699 | $99 | ~¥792 | ~5,000 |
Alibaba Bailian: the ¥40 entry tier vanished overnight
Overseas Pro at $50 (about ¥400) is double the domestic ¥200 price. Like GLM and Kimi, domestic users get a substantial price advantage.
But Pro has something nobody else offers: multi-model aggregation. One subscription gives you access to Qwen3-Coder-Next, Kimi-K2.5, GLM-5, and MiniMax-M2.5. It is like having one SIM card that works on every carrier network simultaneously. Plus it supports both OpenAI-compatible and Anthropic-compatible endpoints.
When every other provider is selling their own model, Bailian chose a different path: selling a model supermarket. Whether that works long-term depends on whether the shelves stay stocked with top-tier models.
- Critical technical gotcha: Bailian requires a dedicated Coding Plan API key (prefixed sk-sp-...). Using a regular API key will trigger pay-as-you-go billing, which can cost 5x more than the plan. I have seen someone burn through over ¥1,000 in a single month because they used the wrong key.
- The entry barrier jumping from ¥40 to ¥200 is genuinely hostile to students and newcomers.
- Multi-model access in one plan is excellent for anyone doing comparative model evaluation — no more juggling five different accounts and API keys.





Official screenshot
Qwen3.6-Plus already has route-specific public pricing on the Model Studio side
The Alibaba pricing page is the safest public surface for route-aware Qwen3.6-Plus billing because it distinguishes mainland and international rows directly on the official page.
- Useful when articles need one source-backed image for regional pricing differences.
- Pairs well with benchmark claims from the Qwen 3.6 release page.
Source: Alibaba Cloud Model Studio pricing.
| Tier | Domestic (CNY/month) | Overseas (USD/month) | Requests per 5h | Requests per month |
|---|---|---|---|---|
| ~~Lite~~ (discontinued) | ~~¥40~~ | — | ~~1,200~~ | ~~18,000~~ |
| Pro | ¥200 | $50 | 6,000 | 90,000 |
Volcengine Ark (ByteDance): the design-to-code frontend specialist
Doubao Seed 2.0 Code deserves special attention. It swept all five gold medals in ICPC programming competitions. The community has positioned it as “the visual-driven frontend weapon” — if your workflow is “design mockup to working code,” this is currently the best value option.
Volcengine also fell into the first-purchase discount trap: Lite was ¥9.9 and Pro was ¥49.9 for your first purchase, but from March 13 onward, those became daily 10:30 AM limited flash sales. Yes, you read that right — you now set an alarm to抢 (snatch) token plans. Concert tickets, Moutai liquor, and now AI subscriptions. This is what 2026 looks like for programmers.
Pro subscriptions also include ArkClaw Lite. Users already in the ByteDance ecosystem (TRAE, Doubao App) will find the integration natural. As with Bailian, make sure you use the dedicated Code Plan URL — traffic through the generic API endpoint is not counted toward your plan quota.





Official screenshot
The official Seed2.0 page is the cleanest public source for the model-family story
ByteDance's Seed site makes the Pro, Lite, and Mini lineup, benchmark breadth, and Ark access links visible in one place. That is stronger than building the article around unofficial pricing screenshots.
- Best official visual for the Seed2.0 family overview.
- Useful for replacing reseller-style descriptions with Seed's own model framing.
Source: Official Seed2.0 model page.
| Tier | Domestic (CNY/month) | Overseas BytePlus (USD/month) | Requests per 5h | Requests per month |
|---|---|---|---|---|
| Lite | ¥40 | $10 (~¥80) | 1,200 | 18,000 |
| Pro | ¥200 | $50 (~¥400) | 6,000 | 90,000 |
Benchmark comparison: the models behind the plans
Choose your model before you choose your plan. Choose your model based on what you actually do. Comprehensive bug-fixing? Kimi K2.5 or MiniMax M2.5. The hardest engineering challenges? GLM-5.1. Frontend work from designs? Doubao Seed 2.0 Code.





| Model | SWE-bench Verified | SWE-bench Pro | One-line summary |
|---|---|---|---|
| Kimi K2.5 | 76.8%-80.0% | 50.7-55.6 | The comprehensive code-fixing king |
| MiniMax M2.5 | 75.8%-80.2% | — | Open-source value ceiling |
| Step 3.5 Flash | 74.4% | — | Small model, big capability |
| Qwen3-Coder-Next | ~70%+ | — | Only 3B activated — lightweight champ |
| DeepSeek V3.2 | ~67.8% | — | Open-source veteran, reliably solid |
| GLM-5.1 | — | 58.4 (#1) | King of the hardest engineering tasks |
| MiniMax M2.7 | — | 56.22 | Self-evolving agent specialist |
| Doubao Seed 2.0 | ICPC 5 gold medals | — | Design mockup to frontend code |
| MiMo-V2-Pro | Claims exceed Sonnet 4.6 | — | Trillion parameters, 1M context |
Higher is better. These scores reflect bug-fixing ability on real-world GitHub issues. Choose your plan based on which model fits your tasks, not which plan has the lowest monthly price.
SWE-bench Verified (custom), up to 80.0% in high-reasoning mode
SWE-bench Verified (official leaderboard), up to 80.2% self-reported
SWE-bench Verified — impressive for a 196B/11B MoE model
SWE-bench Verified — stable open-source veteran
SWE-bench Pro (different test set — not directly comparable to Verified)
SWE-bench Pro #1 — beats Claude Opus 4.6 and GPT-5.4 on hardest tasks
SWE-bench Verified and SWE-bench Pro are different benchmarks. Verified tests bug-fixing on real issues; Pro tests complex multi-file engineering tasks. Do not cross-compare the scores. Source: SWE-bench leaderboard.
Domestic vs overseas: why mainland users get better deals (and when they do not)
Why is domestic so much cheaper for most providers? Because the Chinese market is a seven-way knife fight for users. Seven providers in one pool, all slashing prices. The overseas market faces Claude, GPT, and Gemini — a completely different pricing logic where you do not need to win on price, you need to prove “I am not worse than Claude.”
The cheap prices you get domestically exist not because the service is truly cheap, but because someone is fighting a price war on your behalf. That war will not last forever. Bailian Lite is already gone. Volcengine first-purchase discounts are now flash sales. GLM has raised prices twice.
Those cheap tiers are disappearing one by one.
| Provider | Domestic top tier | Overseas equivalent | Domestic savings |
|---|---|---|---|
| Zhipu GLM | Max ¥469/month | Max $160/month (~¥1,280) | 63% cheaper domestically |
| Kimi | Moderato ¥79/month | Moderato $19/month (~¥152) | ~48% cheaper domestically |
| Alibaba Bailian | Pro ¥200/month | Pro $50/month (~¥400) | 50% cheaper domestically |
| Volcengine | Lite ¥40/month | BytePlus Lite $10/month (~¥80) | 50% cheaper domestically |
| MiniMax | Starter ¥29/month | Starter $10/month (~¥80) | Cheaper domestically, but overseas gives 2.5x quota |
| StepFun | Flash Plus ¥99/month | Flash Plus $9.99/month (~¥80) | Plus tier is actually cheaper overseas |
| Xiaomi MiMo | Lite ¥39/month | Lite ~$5/month (~¥40) | Roughly at parity |
Will tokens keep getting more expensive?
Yes. And this process has barely started.
Late 2024 through mid-2025 was the “subsidize to acquire users” phase. Per-million-token prices dropped from tens of yuan to fractions. Cheap enough that you did not bother checking usage. From late 2025 onward, the screws tightened: first-purchase discounts eliminated, low tiers discontinued, flash sales replacing open stock. In 2026, we entered the “fine-grained segmentation” phase: MiniMax launched Highspeed tiers from ¥98 to ¥899, Kimi introduced the ¥559 Allegro tier.
Tokens are being tiered exactly like mobile data plans were. Daily passes, monthly bundles, unlimited-with-throttling — every playbook from the telecom era is being replayed in AI with remarkable precision.
The root cause is compute cost. Training a hundred-billion-parameter model runs into the hundreds of millions of dollars, and inference is not cheap either. The low prices were always venture-funded customer acquisition, and investor patience is finite.
Will the market consolidate down to a few suppliers? Very likely. Look at cloud computing today — AWS, Azure, and Alibaba Cloud in a three-way stand. The AI token supply chain will probably follow the same pattern. Aggregation platforms like Bailian and Volcengine are becoming the “virtual operators” of the AI era — they do not build base stations, they package several providers' signals and sell them to you.
- Budget at list price, not promo price. First-purchase discounts only apply once. Flash sales are not guaranteed. Promotional periods end without warning.
- The number you can sustainably pay every month for the foreseeable future — that is your real cost.
- Consolidation is coming. The seven providers today may become three or four within a year or two.
If I were buying today: budget-tier recommendations
¥29/month will not get you the strongest model. ¥200/month will. Which is the better deal? The answer does not depend on the price — it depends on your use case. If you only code for 30 minutes a day, ¥29 is enough. If your livelihood depends on AI-assisted programming, ¥200 is the frugal choice.
| Budget | Primary pick | Why | Combo option (if you need more) |
|---|---|---|---|
| ¥30-50/month | MiniMax Starter (¥29) for pure coding, or Kimi Andante (¥39) for AI work-style bundle | MiniMax is the cheapest real coding plan on the market, with full-modal support including images, voice, music, and video. Kimi Andante adds Agent, PPT, and research for people who do more than just code. | If you have an international credit card, Xiaomi MiMo overseas at ~$5/month (~¥40) is an ultra-low-cost alternative. |
| ¥100-200/month | Zhipu GLM Pro (¥149) | SWE-bench Pro #1 model. About 400 prompts per 5 hours. Enough for serious daily development. The best single-plan value for working engineers. | Add Bailian Pro (¥200) or Volcengine Pro (¥200) if you need multi-model access for comparative testing. Total: ¥349-400/month. |
| ¥400+/month | Zhipu GLM Max (¥469) as the primary plan | Approximately 1,600 prompts per 5 hours, 8,000 per week. Full GLM-5.1 access. Made for全天候 (round-the-clock) high-intensity use. | Combo: GLM Max + MiniMax Max Highspeed (¥199) for fast inference backup + Volcengine Pro (¥200) for multi-model + ArkClaw. Total: about ¥870/month. Covers every scenario. |
Six buying rules to remember
- Pick the model first, then find the plan. GLM-5.1 for hard engineering, Kimi K2.5 for comprehensive code repair, Doubao Seed 2.0 for frontend from designs, MiniMax for the cheapest entry with full-modal support.
- Check which route you are actually buying. A membership is not a token plan is not a coding plan is not a PAYG API. These are fundamentally different products that happen to live on the same provider website.
- Use the correct API key. Bailian requires sk-sp-... prefixed keys. GLM has separate DevPack and API routes. Using the wrong key or endpoint can cost you 5x more than the plan price.
- Watch for peak-hour multipliers. GLM-5 uses 3x quota during peak hours (14:00-18:00 UTC+8). Your 400-prompt allowance drops to 133 effective prompts when you need them most.
- Budget at regular price, not promo price. First-purchase discounts expire. Flash sales are unreliable. Promotional periods end. The sustainable monthly cost is what matters for planning.
- If you are overseas, do not just convert CNY prices at face value. Check whether the overseas tier gives different quotas (MiniMax gives 2.5x more), has different purchase limits (GLM overseas has no stock caps), or is actually cheaper (StepFun Flash Plus).
The right coding plan is not the cheapest — it is the one that matches your workflow
Pick your model, pick your route, then pick your plan. This guide gives you the real pricing tables, the benchmark scores, and the gotchas for all seven Chinese AI coding plan providers. The compare tool on BuyGLM lets you put them side by side with live data.
Sources and official links
- Zhipu GLM domestic plans
- Z.AI DevPack subscription
- GLM-5.1 official docs
- Kimi membership pricing
- Kimi K2.5 tech blog
- Kimi K2.6 tech blog
- Kimi K2.5 API pricing
- MiniMax Token Plan pricing
- MiniMax M2.5 release
- MiniMax M2.7 release
- Xiaomi MiMo Token Plan
- MiMo-V2-Pro release
- StepFun StepPlan overview
- Step 3.5 Flash GitHub
- Alibaba Bailian Coding Plan
- Alibaba Model Studio pricing
- Volcengine ARK Code Plan overview
- Volcengine ARK pricing notice
- BytePlus ModelArk pricing
- Seed2.0 model page
- SWE-bench leaderboard
- Qwen 3.6 release blog
Frequently asked questions
Which Chinese AI coding plan is the cheapest?
MiniMax Domestic Starter at ¥29/month (about $3.63) is the cheapest real coding plan. If you want more than just coding — Agent, PPT, research — Kimi Andante at ¥39/month (about $4.88) is a broader bundle. If you have an international credit card, Xiaomi MiMo overseas at about $5/month is also competitive.
Why are Chinese AI coding plans cheaper domestically than internationally?
Intense domestic competition. Seven providers fighting for the same user base drives prices down. Internationally, they compete against Claude, GPT, and Gemini — where the strategy shifts from price competition to capability proof. The domestic price war benefits Chinese users, but those prices are not sustainable forever. Already, cheap tiers are disappearing: Bailian Lite is gone, Volcengine first-purchase discounts became flash sales, and GLM has raised prices twice.
Is it ever cheaper to buy the overseas version of a Chinese AI coding plan?
Surprisingly, yes. MiniMax overseas plans give 2.5x the quota at each tier. StepFun Flash Plus overseas ($9.99) is cheaper than domestic (¥99). GLM overseas has no purchase limits on Max tier while domestic frequently sells out. For some providers, the international version is the better deal — if you can pay with Stripe.
What happened to the Alibaba Bailian ¥40 Lite tier?
It is gone. New purchases stopped on March 20, 2026. Existing renewals and upgrades stopped on April 13, 2026. The cheapest Bailian tier is now Pro at ¥200/month — a 5x increase in the minimum entry price. This is part of the broader trend of low-cost tiers being eliminated across all providers.
Which model is the best for coding tasks?
It depends on the task. For bug-fixing and general code repair, Kimi K2.5 (SWE-bench Verified 76.8-80.0%) and MiniMax M2.5 (75.8-80.2%) are top tier. For the most complex engineering challenges, GLM-5.1 leads SWE-bench Pro at 58.4 — beating Claude Opus 4.6 and GPT-5.4. For design-to-code frontend work, Doubao Seed 2.0 Code (5 ICPC gold medals) is the specialist.
What is the “wrong API key” gotcha?
Alibaba Bailian requires a dedicated Coding Plan API key (prefixed sk-sp-...). If you use a regular API key, traffic is billed at pay-as-you-go rates — which can be 5x more expensive than the plan. This mistake can cost over ¥1,000 in a single month. Similarly, Volcengine requires using the dedicated Code Plan URL; traffic through the generic API endpoint does not count toward your plan quota.
What does “peak-hour multiplier” mean?
Some providers charge more quota per request during busy hours. Zhipu GLM is the most aggressive: from 14:00 to 18:00 UTC+8, GLM-5 consumes 3x quota per request. Off-peak is 2x. If your plan allows 400 prompts, peak-hour usage effectively limits you to about 133 prompts. A temporary off-peak promotion (1x multiplier) was running through late April 2026, but this is not guaranteed to persist.
Should I wait for prices to come back down?
Every signal points in the opposite direction. Cheap tiers are being discontinued, first-purchase discounts are being eliminated or converted to flash sales, and multiple providers have raised prices in 2026. The current pricing environment is likely the most competitive it will ever be. Budget at today's regular prices, not at promotional rates.