OpenMythos is an open-source, theoretical reconstruction of the Claude Mythos model architecture, built from first principles using publicly available research. It is not affiliated with or endorsed by Anthropic. It implements a Recurrent-Depth Transformer (RDT) with Prelude/Recurrent/Coda stages, switchable MLA or GQA attention, and sparse MoE feed-forward layers.

What is a Recurrent-Depth Transformer (RDT)?

An RDT (also called a Looped Transformer) is an architecture that divides its layers into three blocks: Prelude (standard transformer layers, run once), a Recurrent Block (a set of layers looped T times, with the hidden state updated each iteration), and a Coda (standard layers, run once). The recurrent block runs the same weights multiple times — effectively increasing reasoning depth without increasing parameter count.

How is Claude Mythos different from Claude Sonnet or Opus?

Claude Mythos is suspected to be Anthropic's first RDT-based model — a fundamentally different architecture from prior Claude versions. Rather than running through hundreds of unique layers once, Mythos recycles a subset of layers through multiple loops per forward pass, enabling depth-variable reasoning that adapts to the difficulty of each query. This is not chain-of-thought — reasoning happens silently in continuous latent space inside a single forward pass.

What is the recurrent block update rule in OpenMythos?

The update rule is: h_{t+1} = A·h_t + B·e + Transformer(h_t, e), where h_t is the hidden state after loop t, e is the encoded input injected at every step (from the Prelude), and A and B are learned injection parameters. The continuous injection of e prevents the model from drifting away from the original input during deep recurrence.

How do I install and run OpenMythos?

pip install open-mythos. Then import OpenMythos and MythosConfig from open_mythos.main. Configure model dimensions, loop count, expert counts, and attention type (mla or gqa). Call model(ids, n_loops=4) for forward pass or model.generate(ids, max_new_tokens=8, n_loops=8) for generation. The n_loops parameter lets you dial compute at inference time.

What is MLA attention and how does it differ from GQA?

MLA (Multi-head Latent Attention) is DeepSeek's attention variant that uses low-rank decomposition of key-value projections, dramatically reducing KV cache memory. GQA (Grouped-Query Attention) groups queries to share key-value heads. OpenMythos lets you switch between them via attn_type='mla' or attn_type='gqa' in the config.

What is sparse MoE in OpenMythos?

OpenMythos uses a sparse Mixture-of-Experts feed-forward layer with routed experts (selected per token by a router) and shared experts (always active). Configurable parameters: n_experts (total pool), n_shared_experts (always-on), n_experts_per_tok (activated per token). This follows the DeepSeek MoE pattern — similar to what's suspected in frontier MoE models.

What is systematic generalization and why does it emerge from looped transformers?

Systematic generalization is the ability to combine knowledge in ways never seen during training — handling novel compositions out of distribution. Vanilla transformers typically fail this test. Looped transformers pass it through a three-stage grokking process: memorization, in-distribution generalization, and then (abruptly) systematic generalization. Train on 5-hop reasoning chains, test on 10-hop — vanilla transformer fails, looped transformer succeeds.

Is OpenMythos the actual Claude Mythos model?

No. OpenMythos is a theoretical reconstruction from publicly available research — it is not affiliated with Anthropic. The architecture choices are informed speculation based on the Recurrent-Depth Transformer literature, MLA/GQA papers, and MoE research. Think of it as an open research implementation for studying RDT architectures, not a weights release.

OpenMythos: The Open-Source Reverse-Engineering of Claude's Recurrent-Depth Architecture

By Prahlad Menon Published 2026-04-19 4 min read

Last week Anthropic shipped Claude Mythos. No architecture paper. No technical report. Just a new model that feels qualitatively different — especially on complex multi-step reasoning — with no public explanation of why.

OpenMythos is the community’s attempt to figure it out.

Built by Kye Gomez, it’s a theoretical open-source reconstruction of the suspected Claude Mythos architecture: a Recurrent-Depth Transformer with looped computation, switchable MLA/GQA attention, and sparse Mixture-of-Experts feed-forward layers. Not affiliated with Anthropic. Not actual weights. But a serious implementation of the architecture that the available research suggests.

pip install open-mythos

The Core Hypothesis: Looped Transformers

Standard transformers run each layer exactly once, sequentially. If you have 96 layers, every token goes through all 96, once, and that’s your forward pass.

A Recurrent-Depth Transformer (RDT) — also called a Looped Transformer — does something different. It divides its computation into three stages:

Input
  ↓
[Prelude] — standard transformer layers, run once
  ↓
[Recurrent Block] — looped T times (same weights, T iterations)
  ↑___________↓  (hidden state h updated each loop)
  ↓
[Coda] — standard transformer layers, run once
  ↓
Output

The Recurrent Block runs the same weights T times, updating the hidden state each loop according to:

h_{t+1} = A·h_t + B·e + Transformer(h_t, e)

Where h_t is the current hidden state, e is the encoded input injected at every step, and A and B are learned injection parameters. The continuous injection of e is crucial — it keeps the original signal alive and prevents the model from drifting during deep recurrence.

The key insight: same parameters, more loops = more reasoning depth, no extra parameters. And the number of loops can be varied at inference time — dial it up for hard problems, dial it down for simple ones.

Why This Architecture Produces Different Behavior

OpenMythos documents something called the three-stage grokking process that emerges from looped transformers:

Memorization — model fits the training distribution
In-distribution generalization — model handles known compositions
Systematic generalization — model abruptly handles novel compositions OOD

That third stage — systematic generalization — is what vanilla transformers typically fail at. If you train on 5-hop reasoning chains and test on 10-hop, a standard transformer fails. A looped transformer passes. The capability doesn’t emerge gradually; it phase-transitions in.

This would explain why Mythos feels qualitatively different on novel, complex questions — not just quantitatively better on benchmarks, but structurally more capable of combining ideas it hasn’t seen combined before.

Importantly: this is not chain-of-thought. There’s no intermediate token output. All reasoning happens silently inside a single forward pass, in continuous latent space.

The Architecture Components

Prelude — Standard transformer layers that encode the input once. Output becomes e, the injection signal.

Recurrent Block — The looped computation core. Attention is switchable:

mla — Multi-head Latent Attention (DeepSeek-style, low-rank KV decomposition, smaller KV cache)
gqa — Grouped-Query Attention (standard multi-group sharing)

Feed-forward uses sparse MoE:

n_experts — total expert pool
n_shared_experts — always-active experts (DeepSeek MoE pattern)
n_experts_per_tok — experts activated per token by the router

Coda — Standard transformer layers that project from the final hidden state to logits.

Running It

import torch
from open_mythos.main import OpenMythos, MythosConfig

cfg = MythosConfig(
    vocab_size=1000,
    dim=256,
    n_heads=8,
    max_seq_len=128,
    max_loop_iters=4,
    prelude_layers=1,
    coda_layers=1,
    n_experts=8,
    n_shared_experts=1,
    n_experts_per_tok=2,
    expert_dim=64,
    lora_rank=8,
    attn_type="mla",   # or "gqa"
    n_kv_heads=8,
    kv_lora_rank=32,
    q_lora_rank=64,
    qk_rope_head_dim=16,
    qk_nope_head_dim=16,
    v_head_dim=16,
)

model = OpenMythos(cfg)

ids = torch.randint(0, cfg.vocab_size, (2, 16))

# Vary loops at inference time — more loops = more compute = deeper reasoning
logits = model(ids, n_loops=4)
out = model.generate(ids, max_new_tokens=8, n_loops=8)

# Verify stability: spectral radius of A must be < 1
A = model.recurrent.injection.get_A()
print(f"Spectral radius ρ(A): {A.max().item():.4f}")  # must be < 1 for stability

The n_loops parameter at inference time is the compute knob. Routine queries: low loops. Hard math: high loops. Same model weights, variable depth.

What This Is (and Isn’t)

This is a research implementation — a codebase for studying Recurrent-Depth Transformer architectures. It’s not actual Claude Mythos weights. Anthropic hasn’t released those. OpenMythos is informed speculation based on:

The Recurrent-Depth Transformer literature (Geiping et al., 2025)
DeepSeek MLA attention papers
MoE architecture research
Observed behavioral characteristics of Claude Mythos

The value is in the implementation itself — a clean, runnable, inspectable codebase that lets researchers experiment with looped computation, compute-adaptive depth, and sparse MoE at any scale.

If you’re interested in what frontier architectures might look like under the hood, or want to experiment with depth-variable reasoning without training a 100B parameter model, this is worth pulling.

Resources

GitHub: github.com/kyegomez/OpenMythos
PyPI: pypi.org/project/open-mythos
API docs: docs/open_mythos.md
RDT paper: Geiping et al., “Scaling up Test-Time Compute with Latent Reasoning” (2025)