GPT-5.6 release at a glance: Sol, Terra, Luna positioning and limited preview status
OpenAI dropped its biggest model family of 2026 on June 26: GPT-5.6 Sol, Terra, and Luna, named after celestial bodies for the first time — Sol (flagship), Terra (balanced), and Luna (lightweight). Sources: OpenAI official announcement, OpenAI Deployment Safety System Card, VentureBeat, SiliconAngle, TechTimes.
| Model | Tier | Input price | Output price | Highlight |
|---|---|---|---|---|
| GPT-5.6 Sol | Flagship | $5 / 1M tokens | $30 / 1M tokens | TerminalBench 2.1 #1 (91.9%) |
| GPT-5.6 Terra | Balanced | $2.50 / 1M tokens | $15 / 1M tokens | Near GPT-5.5 performance, 50% lower cost |
| GPT-5.6 Luna | Lightweight | $1 / 1M tokens | $6 / 1M tokens | High-volume tasks, 80% cheaper than Sol |
Current status: At U.S. government request, access is limited to roughly 20 vetted partner organizations. Broad availability expected within weeks. Polymarket assigns an 87% probability that GPT-5.6 will be broadly released by July 31, 2026.
Release was not smooth: A June 2 executive order required OpenAI to undergo government security review before broad release — the first time the U.S. government formally required an AI company to limit a frontier model launch.
Altman pushed back publicly: OpenAI complied with the review while stating this government approval process should not become the long-term industry default.
June "super launch month" stalled: OpenAI GPT-5.6, Anthropic Claude Fable 5 / Mythos 5 (forced offline June 12), and Google Gemini 3.5 Pro (delayed to July) all hit release gates at once.
All three hit High cybersecurity rating: GPT-5.6 is the first OpenAI product line where every tier — including entry-level Luna — triggered the "High" cybersecurity risk classification.
Context window: All three models report approximately 1.5M tokens, up roughly 50% from GPT-5.5's 1M.
"We don't believe this kind of government access process should become the long-term default. It keeps the best tools from users, developers, enterprises, cyber defenders, and global partners who need them." — Sam Altman, OpenAI CEO, June 2026
GPT-5.6 Sol, Terra, Luna explained: pricing, reasoning modes, and use cases
GPT-5.6 Sol — flagship model: OpenAI's most capable model to date, built for hard coding, long-horizon cybersecurity research, and multi-step agentic workflows. Two new reasoning modes:
| Reasoning mode | Mechanism | Best for |
|---|---|---|
| Max mode | Extra reasoning time before responding; trades latency for accuracy | Scenarios where correctness matters more than speed |
| Ultra mode | Multi-agent collaboration — splits tasks, runs parallel subagents, merges output | Complex coding and agent tasks (core reason for TerminalBench 91.9%) |
GPT-5.6 Terra — balanced model: The daily workhorse for high-volume business tasks — customer support, internal tools, document analysis. Performance near GPT-5.5 at 50% lower cost. Best value for production deployments at scale.
GPT-5.6 Luna — lightweight model: Optimized for high-frequency, low-latency tasks like summarization, drafting, and routine automation. Luna is the first non-flagship OpenAI model to receive a High rating in both cybersecurity and biology.
| Your need | Recommended model |
|---|---|
| Complex code generation, debugging, multi-step agent tasks | Sol |
| Enterprise document analysis, support, high-volume API calls | Terra |
| Summarization, drafting, routine automation | Luna |
| GPT-5.5-level capability on a tighter budget | Terra (same tier performance, 50% lower cost) |
| Latency-critical real-time apps (from July) | Sol on Cerebras |
GPT-5.6 benchmark results: TerminalBench, CTF, ExploitBench, and life sciences
Coding: TerminalBench 2.1 — 89 complex command-line planning challenges testing multi-step tool use, iterative repair, and task coordination.
| Model | Score | Mode |
|---|---|---|
| GPT-5.6 Sol | 91.9% New #1 | Ultra (multi-agent) |
| GPT-5.6 Sol | 88.8% | Standard |
| Claude Mythos 5 | 88.0% | Standard |
| GPT-5.5 | 83.4% | Standard |
| Gemini 3.1 Pro Preview | 70.7% | Standard |
Sol dethroned Claude Mythos 5 after just 17 days at the top — Mythos 5 had held #1 since June 9.
Long-horizon agents: Agent's Last Exam
| Model | Task completion rate (code mode) |
|---|---|
| GPT-5.6 Sol | 50.9% — only model to cross 50% |
| GPT-5.6 Luna | Slightly above GPT-5.5 |
Cybersecurity: CTF & ExploitBench
| Model | CTF hit rate |
|---|---|
| Sol | 96.7% |
| Terra | 91.84% |
| Luna | 85.19% |
ExploitBench: Sol matches Anthropic Mythos Preview on vulnerability research while using only about one-third of the output tokens — the same security research capability at dramatically lower cost.
Safety note: OpenAI red-teaming confirmed Sol can identify vulnerabilities and exploit primitives in Chromium and Firefox codebases, but cannot autonomously construct complete functional exploit chains — staying below the "Cyber Critical" threshold. All three models include real-time misuse classifiers, account-level review, 700,000 A100-equivalent GPU hours of automated red-teaming, and universal jailbreak testing.
Life sciences: On GeneBench v1 (genomics and quantitative biology), Sol matches or exceeds GPT-5.5 using fewer tokens. On HealthBench Professional, Sol scores 60.5 — +8.7 points above GPT-5.5.
Six-step guide to GPT-5.6 access and model selection: from limited preview to July GA
Know the current stage (June 2026): Only roughly 20 government-approved trusted partners can access via API and Codex. General ChatGPT users cannot use GPT-5.6 yet.
Watch the July window: ChatGPT general availability (Plus/Pro first), public API access, and Cerebras-accelerated Sol for enterprise customers at up to 750 token/s.
Coding agents: pick Sol: Enable Ultra mode for complex multi-step tasks. Token consumption is significantly higher than standard mode.
High-volume business: pick Terra: GPT-5.5-level performance at 50% lower cost — ideal for support, document analysis, and production API workloads.
Lightweight automation: pick Luna: $1/$6 pricing with a High cybersecurity rating — competitive for millions of daily API calls.
Maintain multi-vendor fallback: Per the Fable 5 outage, manage model IDs via environment variables — never hardcode a single slug. For local agent testing, deploy an OpenClaw cloud Mac node.
import os
PRIMARY = os.environ.get("AI_MODEL", "gpt-5.6-terra")
FALLBACKS = ["gpt-5.6-sol", "gpt-5.5", "claude-opus-4-8"]
ULTRA_TASKS = os.environ.get("ENABLE_ULTRA", "false") == "true"Cerebras 750 token/s, government review fallout, and GPT-5.6 vs Claude Mythos 5
Speed breakthrough: Starting July 2026, GPT-5.6 Sol deploys on Cerebras hardware at up to 750 tokens per second. Most frontier models today output 50–150 tokens/s — response times could drop to one-fifth or one-fifteenth of current latency.
| Dimension | GPT-5.6 Sol | Claude Mythos 5 |
|---|---|---|
| TerminalBench 2.1 (coding) | 91.9% (Ultra) / 88.8% | 88.0% |
| ExploitBench (cybersecurity) | Near-identical, ~1/3 the tokens | Data not public |
| Availability | Limited preview, GA within weeks | Offline (export control) |
| Input price | $5 / 1M | Was $10 / 1M (currently offline) |
| Context window | ~1.5M tokens | 200K tokens |
Capability breakthrough: Sol Ultra multi-agent mode tops the global coding leaderboard, dethroning Claude Mythos 5 in 17 days.
Efficiency breakthrough: Comparable security research capability at one-third the token cost of competitors.
Speed breakthrough: Cerebras 750 token/s in July will reshape real-time AI application boundaries.
Bottom line: Sol beats Mythos 5 on TerminalBench and offers comparable security research at half the input price. Fable 5 still leads on SWE-Bench Pro and other dimensions — full System Card data pending.
Alternative hosting paths: running Codex / agent workflows on a personal MacBook drops when the lid closes; a pure Linux VPS cannot run Xcode or native macOS toolchains; co-locating multiple agents on underpowered hardware causes swap thrash. Regardless of when GPT-5.6 goes GA, a 7×24 online macOS host for OpenClaw, Hermes Agent, and iOS CI pipelines — for production, a KVMNODE dedicated Mac Mini M4 / M4 Pro is usually the better choice. See the pricing page, order entry, and help center.