AgentWASP: The Self-Hosted Agent Runtime Built for Operators (vs OpenClaw & Hermes)

By Prahlad Menon 5 min read

A new entrant just raised the bar for self-hosted autonomous agents. AgentWASP is a Docker-based runtime built around a single premise: agents that can actually be trusted to run unattended.

The project ships with 10+ persistent memory tiers, a truth-binding response layer, capability tiers per skill, and 41 background jobs running 24/7. It’s Apache 2.0 licensed and installs with one line.

What Makes WASP Different

Three architectural commitments separate WASP from other self-hosted agent harnesses:

1. Truth-Binding Response Layer

The biggest problem with autonomous agents? They confidently report actions they never took. WASP addresses this with five deterministic guards:

  • URL substitution — ensures links match actual fetched content
  • Action announcer — logs every action before execution
  • Response grounder — validates outputs against actions actually taken
  • Schedule honesty — prevents false claims about scheduled tasks
  • Prompt-leak redaction — blocks system prompt exfiltration

The key design choice: no LLM in the policy path. These are deterministic guards, not AI-powered validators that can be fooled.

2. Layered Persistent Memory (28 PostgreSQL Tables)

WASP doesn’t just have “memory” — it has a full cognitive architecture across 10+ named tiers:

LayerPurpose
EpisodicFull conversation history with timestamps
SemanticDistilled facts and preferences
ProceduralMulti-step solutions as named procedures
Knowledge GraphEntity + relation extraction per message
BehavioralRules learned from user corrections
TemporalWorld timeline with trend detection
VectorDense embeddings (cosine similarity)
Self-ModelSkill success rates, known failures
WorkingActive context for current task
Goal-ScopedPer-goal memory isolation

All memory persists across sessions, reboots, and model switches. This is the level of memory architecture you’d expect from cognitive science research, not a side project.

3. Capability Tiers + Anticipatory Simulation

Every skill has an explicit privilege level:

  • PUBLIC — safe operations anyone can trigger
  • CONTROLLED — standard operations with logging
  • RESTRICTED — sensitive ops requiring confirmation
  • PRIVILEGED — dangerous actions with simulation preview

For privileged operations, WASP runs anticipatory simulation — the LLM previews consequences before execution, with a 5-minute cache. Combined with a plan critic that validates every TaskGraph before execution, you get actual guardrails rather than “please don’t do bad things” prompt engineering.

The Comparison Table

WASP publishes a detailed comparison against Hermes Agent and OpenClaw. Here’s what stands out:

CapabilityWASPHermes AgentOpenClaw
Memory tiers10+ named (28 Postgres tables)Agent-curated + FTS5Workspace-scoped
Knowledge graphBuilt-inNot documentedNot documented
Background jobs41 pre-builtCron + background_reviewCron + nodes
Self-improvementPatch-based with persistenceAutonomous skill creationSkill installation
Capability tiers4 explicit levelsCommand-pattern approvalPer-tool allow/deny
Anticipatory simulationYes (5-min cache)Not documentedNot documented
Plan criticLLM validates TaskGraphsNot documentedNot documented
Channels12 platforms6 platforms22+ platforms
LLM providers5 native + Ollama300+ via Nous PortalMultiple with failover

Installation

One-liner that handles everything:

sudo bash -c "$(curl -fsSL https://agentwasp.com/install.sh)"

This detects your distro (8 Linux flavors + macOS), installs Docker if missing, pulls the source, generates secrets, and walks you through onboarding. Default install path: /opt/wasp.

For the security-conscious:

curl -fsSL https://agentwasp.com/install.sh -o install.sh
curl -fsSL https://agentwasp.com/install.sh.sha256 | sha256sum -c -
sudo bash install.sh

Requirements: 4GB RAM minimum (8GB recommended), 10GB disk, and at least one LLM provider key (or Ollama for fully self-hosted).

The 41 Background Jobs

WASP runs 41 scheduled jobs 24/7:

  • Memory consolidation — compresses and indexes memories across tiers
  • Perception — processes incoming sensory data from integrations
  • Autonomous goals — generates and pursues long-running objectives
  • Self-integrity monitor — validates system consistency
  • CPI monitor — tracks cognitive pressure index
  • Behavioral learner — extracts rules from user corrections

All jobs persist state and catch up on restart.

Skills and Integrations

37 built-in skills covering:

  • Web search and browser automation (nodriver + Selenium)
  • Email (Gmail with allowlist)
  • Scraping and RSS
  • Scheduling and reminders
  • File operations
  • Python execution (sandboxed)
  • Self-improvement

40+ integrations including:

  • Smart home (Home Assistant, Philips Hue, Eight Sleep)
  • Automation (Zapier, webhooks, cron)
  • Communication (Telegram, Discord, Slack, Teams, WhatsApp, Signal, Matrix, iMessage, and more)
  • Developer tooling (MCP)

Controlled Self-Improvement

WASP can modify its own source code. But unlike unrestricted self-modification, it uses:

  1. AST validation — blocks subprocess, eval, ctypes, pickle, importlib
  2. Sandbox execution — tests changes before applying
  3. Patch persistence — changes survive container rebuilds
  4. Reversibility — all patches can be rolled back

The default limit is 5 self-improvement operations per day.

Dashboard

151 HTTP endpoints power the dashboard across:

  • Chat interface
  • Traces and debugging
  • Scheduler management
  • Memory exploration
  • Knowledge graph visualization
  • World model timeline
  • Agent management
  • Integration configuration
  • Audit log

Dashboard binds to 127.0.0.1:8080 (loopback only) by default — use SSH tunnel or reverse proxy for remote access.

Security Model

WASP takes a fail-closed approach:

  • Telegram bridge refuses to start without a user allowlist — no public bot mode
  • Gmail requires per-address or domain allowlist — prevents prompt-injection exfiltration
  • SSRF prevention with DNS rebinding protection and manual redirect re-validation
  • Path traversal blocked via realpath containment
  • Argon2 hashing with Redis sessions (24h TTL) and 5-attempt lockout
  • CSRF protection with session-bound single-use tokens

When to Use WASP vs OpenClaw

Choose WASP if:

  • You need an agent that runs unattended 24/7
  • Truth-binding and audit trails are requirements
  • You want structured memory architecture out of the box
  • Self-improvement with safety rails matters

Choose OpenClaw if:

  • You want the largest skill ecosystem (5,400+)
  • You need 22+ communication channels
  • Community and ecosystem size matter more than built-in memory depth
  • You prefer Node.js over Python

Choose Hermes Agent if:

  • You want 300+ LLM providers via Nous Portal
  • You need multiple runtime isolation backends (Docker, SSH, Modal, Vercel Sandbox)
  • Isolated subagent spawn for parallel workstreams is critical

The Bottom Line

WASP is what happens when someone builds an agent runtime with operational reliability as the primary design goal rather than feature count. The truth-binding layer, capability tiers with anticipatory simulation, and 10+ memory tiers create a foundation for agents you can actually trust to run while you sleep.

The tradeoff is ecosystem size — WASP has 37 skills vs OpenClaw’s 5,400+. But for operators who need reliability over breadth, WASP fills a gap that’s been missing in the self-hosted agent space.

GitHub: github.com/agentwasp/agentwasp
Docs: docs.agentwasp.com
License: Apache 2.0