When did GLM-5.2 launch and to whom?

GLM-5.2 rolled out on June 13, 2026 to all GLM Coding Plan tiers (Lite, Pro, Max, Team), according to a TestingCatalog post citing Z.ai's announcement.

What context window does GLM-5.2 support?

Z.ai advertises a usable 1 million token context window for GLM-5.2, framed as supporting full-repository loading and long-horizon agent workflows.

When will the API and open weights be available?

Z.ai has stated that API access and an MIT-licensed open-weight release are planned for the week following the June 13 rollout, but no exact date has been published.

What reasoning modes does GLM-5.2 expose?

Per AI Weekly's summary of the launch, GLM-5.2 ships with two thinking modes — High Mode for everyday coding latency and Max Mode for harder reasoning, debugging, and multi-file generation.

What positioning does Zhipu use for GLM-5.2?

Zhipu calls GLM-5.2 its new flagship model and emphasizes coding capability and long-horizon task performance over chatbot benchmarks, per its founder's announcement reposted on Hacker News.

Has Zhipu published official benchmark numbers for GLM-5.2?

No. AI Weekly explicitly notes that no independent or first-party benchmark results were published at launch and that third-party verification is pending.

How does GLM-5.2 compare to Claude Opus 4.7 or Kimi K2.7?

There are no measured head-to-head numbers in the public signal yet. The 'Opus 4.7 class' framing comes from a single tweet by Bindu Reddy and should be treated as a community impression, not a measured comparison.

What is GLM-5.2 priced at via API?

The API has not launched, so per-token pricing is not yet public. The only pricing visible is the GLM Coding Plan tiers, which Reddit users report at roughly $65 to $160 per month depending on tier.

Is the open-weight release the same model served on the GLM Coding Plan?

Zhipu's announcement implies the same model, but no model card has been published yet to confirm parameter count, training data scope, or whether the served and open-weight checkpoints are identical.

What evaluation harness or eval suite was used by Zhipu internally?

Not disclosed. Zhipu's announcement asserts continued strength on long-horizon coding but does not name SWE-bench, Aider, LiveCodeBench, or any specific harness with numbers.

TLDRZhipu shipped GLM-5.2 to GLM Coding Plan users with a 1M-token window and an MIT open-weight release promised for next week. Here's what's actually known.

GLM-5.2 Release: What the First 24 Hours of Signal Actually Show

Zhipu shipped GLM-5.2 on the evening of June 13, 2026, with no benchmark deck, no system card, and no API. The rollout went to paying users of the GLM Coding Plan first, with an MIT-licensed open-weight release and API access both promised for the following week. About 24 hours in, most of the chatter is impressions, screenshots, and one-shot test artifacts — the kind of window where the public narrative outpaces the public data.

What Was Actually Shipped

The verifiable facts so far are narrow. GLM-5.2 became available to all four GLM Coding Plan tiers — Lite, Pro, Max, and Team — on June 13, per a TestingCatalog post that quoted Zhipu's own framing of the model as its "new flagship" with "powerful coding capabilities, usable 1M-context support, and continued strengths in long-horizon tasks." The Hacker News front-page submission reposts the founder's launch note in full, including the commitment that "the API will also go live next week" and that GLM-5.2 will ship under an open weight release.

That note is also where the political framing of this launch lives. Zhipu's founder positions GLM-5.2 against what the post calls "the sudden restriction of certain frontier models," with the line "frontier intelligence must remain open-source, accessible, and buildable." Several community summaries explicitly link the timing of the drop to the U.S. Fable 5 ban as reported by BridgeMind's GLM-5.2 walkthrough, which characterizes the same-day launch as a marketing response.

The two named capability claims to track are Long-Horizon Coding and the "usable 1M context" framing — both Zhipu's own language. An AI Weekly summary adds two more terms worth fixing in vocabulary: High Mode and Max Mode, the two reasoning modes the company surfaces in the Coding Plan UI, with Max Mode positioned for complex coding tasks.

Why This Launch Matters

The shape of this release is more interesting than any individual claim inside it. Zhipu chose to ship the served version to paying coding-plan users first, hold the API for a week, and only then drop open weights under MIT. That ordering inverts the usual open-source playbook of weights-first, hosted-later, and it concentrates early evaluation inside a paying user base whose feedback loops back to Zhipu before anyone else can replicate results.

The competitive backdrop matters too. Bindu Reddy's reaction — calling it "a very useful Opus 4.7 class model" — is a vibe check from one named voice, not a measurement, but it captures where Chinese open-weight coding models are being positioned in the conversation. Mark Kretschmann's post frames the same pattern as a sustained release cadence: "The pace from China is just brutal now. GLM, Qwen, Kimi, DeepSeek."

For builders, the interesting bet is the MIT license. If the open-weight checkpoint matches the served model, GLM-5.2 becomes a permissively licensed 1M-context coding model that any team can host themselves. That is the genuinely new variable in this release.

What Early Testing Suggests

Hands-on coverage so far is community-curated, not measured. A LocalLLaMA user ran a one-shot Pac-Man generation test using a structured intent/scope/constraints prompt and reported the result as the best of any model the author had tried on the same harness, with one functional bug (ghosts stuck near the ghost house) that resolved in a single follow-up prompt. The same writeup mentions a throughput observation of "70 tok/s, slower than GLM 5.1" and notes the model "seems to spend more time reasoning."

A separate Medium walkthrough from Mehul Gupta enumerates the High Mode / Max Mode split and characterizes GLM-5.2 as targeting "agentic software engineering" — tool calling, multi-step planning, repository analysis. Bijan Bowen's 32-minute YouTube review is the longest hands-on so far at roughly 28,000 views in the first day. None of these constitute a benchmark.

What this tells us: the model exists, ships with two reasoning modes, and is producing usable one-shot output on at least one structured prompt. What it doesn't tell us: how GLM-5.2 scores on SWE-bench Verified, Aider Polyglot, LiveCodeBench, or any standardized harness — because Zhipu has published none, and no independent eval has landed in the 24-hour window.

GLM-5.2 vs Kimi K2.7 Code: What the Signal Says

Kimi K2.7 Code is the closest analog in the signal set — another Chinese open-weight coding model in active release cadence, and the explicit comparison point in the Medium post's headline. The comparison is suggestive, not measured.

On context window, GLM-5.2 advertises 1M tokens; no comparable number for K2.7 Code appears in this signal set, so the dimension is unverified for K2.7. On license, Zhipu has committed to MIT weights within roughly a week; K2.7's license status is unverified — no public number from this signal set. On reasoning modes, GLM-5.2 exposes High Mode and Max Mode per the AI Weekly summary; K2.7's mode structure is unverified — no public number from this signal set. On published benchmarks at launch, both are effectively at zero in this window: AI Weekly notes GLM-5.2 shipped without any. On community impression, the only named comparison is Bindu Reddy's "Opus 4.7 class" framing for GLM-5.2, which is a single tweet from one named voice, not a measured head-to-head against K2.7.

The honest read: anyone publishing a GLM-5.2 vs K2.7 Code comparison table right now is filling in numbers that don't exist yet. Wait for the open-weight drop and at least one third-party eval before relying on a ranking.

What We Know vs. What We Don't

Confirmed by the signal so far:

GLM-5.2 became available to all GLM Coding Plan tiers on June 13, 2026, per TestingCatalog.
Z.ai advertises a "usable" 1M-token context window.
An MIT-licensed open-weight release is planned for the week of June 15, 2026, per the founder's announcement on Hacker News.
GLM-5.2 ships with two reasoning modes — High Mode and Max Mode — per AI Weekly.
Zhipu positions GLM-5.2 as a flagship for coding and long-horizon agent workflows, not as a general chatbot.

Not confirmed and worth treating with hedge:

No first-party or third-party benchmark numbers have been published at launch.
Parameter count, training data scope, and architecture details are undisclosed.
API pricing has not been announced — only the GLM Coding Plan subscription tiers exist publicly, which Reddit users on r/ZaiGLM report at roughly $65 to $160 per month depending on tier.
Whether the served Coding Plan model and the upcoming open-weight checkpoint are identical is not stated.
The "Opus 4.7 class" framing comes from one tweet and should be treated as a community impression rather than a measured claim.
Throughput numbers, with one community report of roughly 70 tokens per second, come from a single user and one prompt.

What Builders Should Do Today

Three concrete moves while the signal is still thin. First, pin a private coding eval before the open weights drop — even a small SWE-bench subset, Aider Polyglot run, or your team's internal regression set — so that when GLM-5.2 weights land, the comparison against your current stack is measured rather than vibe-based. Second, treat the 1M-context claim as a hypothesis to test rather than a feature to design around: run the actual long-horizon task you care about, with full repository context, and measure recall at the tail. Third, hold off on production migration until at least the API ships and one independent eval lands.

The Week Ahead

Three signals to watch. Watch for the MIT-licensed weight release on HuggingFace and check whether Zhipu publishes a model card with parameter count, training scope, and an eval table. Run a private long-horizon coding eval against your current production model the day the API opens, rather than relying on community one-shot demos. Pin the Coding Plan pricing page and the eventual API pricing page side by side — the served-vs-API price delta will tell you how Zhipu values its hosted inference relative to letting teams self-host the MIT weights.

Want to call GLM-5.2 via API? kie.ai has it.

GLM-5.2 Release: What the First 24 Hours Show