AgentWASP: The Self-Hosted Agent Runtime Built for Operators (vs OpenClaw & Hermes)

By Prahlad Menon Published 2026-05-31 5 min read

A new entrant just raised the bar for self-hosted autonomous agents. AgentWASP is a Docker-based runtime built around a single premise: agents that can actually be trusted to run unattended.

The project ships with 10+ persistent memory tiers, a truth-binding response layer, capability tiers per skill, and 41 background jobs running 24/7. It’s Apache 2.0 licensed and installs with one line.

What Makes WASP Different

Three architectural commitments separate WASP from other self-hosted agent harnesses:

1. Truth-Binding Response Layer

The biggest problem with autonomous agents? They confidently report actions they never took. WASP addresses this with five deterministic guards:

URL substitution — ensures links match actual fetched content
Action announcer — logs every action before execution
Response grounder — validates outputs against actions actually taken
Schedule honesty — prevents false claims about scheduled tasks
Prompt-leak redaction — blocks system prompt exfiltration

The key design choice: no LLM in the policy path. These are deterministic guards, not AI-powered validators that can be fooled.

2. Layered Persistent Memory (28 PostgreSQL Tables)

WASP doesn’t just have “memory” — it has a full cognitive architecture across 10+ named tiers:

Layer	Purpose
Episodic	Full conversation history with timestamps
Semantic	Distilled facts and preferences
Procedural	Multi-step solutions as named procedures
Knowledge Graph	Entity + relation extraction per message
Behavioral	Rules learned from user corrections
Temporal	World timeline with trend detection
Vector	Dense embeddings (cosine similarity)
Self-Model	Skill success rates, known failures
Working	Active context for current task
Goal-Scoped	Per-goal memory isolation

All memory persists across sessions, reboots, and model switches. This is the level of memory architecture you’d expect from cognitive science research, not a side project.

3. Capability Tiers + Anticipatory Simulation

Every skill has an explicit privilege level:

PUBLIC — safe operations anyone can trigger
CONTROLLED — standard operations with logging
RESTRICTED — sensitive ops requiring confirmation
PRIVILEGED — dangerous actions with simulation preview

For privileged operations, WASP runs anticipatory simulation — the LLM previews consequences before execution, with a 5-minute cache. Combined with a plan critic that validates every TaskGraph before execution, you get actual guardrails rather than “please don’t do bad things” prompt engineering.

The Comparison Table

WASP publishes a detailed comparison against Hermes Agent and OpenClaw. Here’s what stands out:

Capability	WASP	Hermes Agent	OpenClaw
Memory tiers	10+ named (28 Postgres tables)	Agent-curated + FTS5	Workspace-scoped
Knowledge graph	Built-in	Not documented	Not documented
Background jobs	41 pre-built	Cron + background_review	Cron + nodes
Self-improvement	Patch-based with persistence	Autonomous skill creation	Skill installation
Capability tiers	4 explicit levels	Command-pattern approval	Per-tool allow/deny
Anticipatory simulation	Yes (5-min cache)	Not documented	Not documented
Plan critic	LLM validates TaskGraphs	Not documented	Not documented
Channels	12 platforms	6 platforms	22+ platforms
LLM providers	5 native + Ollama	300+ via Nous Portal	Multiple with failover

Installation

One-liner that handles everything:

sudo bash -c "$(curl -fsSL https://agentwasp.com/install.sh)"

This detects your distro (8 Linux flavors + macOS), installs Docker if missing, pulls the source, generates secrets, and walks you through onboarding. Default install path: /opt/wasp.

For the security-conscious:

curl -fsSL https://agentwasp.com/install.sh -o install.sh
curl -fsSL https://agentwasp.com/install.sh.sha256 | sha256sum -c -
sudo bash install.sh

Requirements: 4GB RAM minimum (8GB recommended), 10GB disk, and at least one LLM provider key (or Ollama for fully self-hosted).

The 41 Background Jobs

WASP runs 41 scheduled jobs 24/7:

Memory consolidation — compresses and indexes memories across tiers
Perception — processes incoming sensory data from integrations
Autonomous goals — generates and pursues long-running objectives
Self-integrity monitor — validates system consistency
CPI monitor — tracks cognitive pressure index
Behavioral learner — extracts rules from user corrections

All jobs persist state and catch up on restart.

Skills and Integrations

37 built-in skills covering:

Web search and browser automation (nodriver + Selenium)
Email (Gmail with allowlist)
Scraping and RSS
Scheduling and reminders
File operations
Python execution (sandboxed)
Self-improvement

40+ integrations including:

Smart home (Home Assistant, Philips Hue, Eight Sleep)
Automation (Zapier, webhooks, cron)
Communication (Telegram, Discord, Slack, Teams, WhatsApp, Signal, Matrix, iMessage, and more)
Developer tooling (MCP)

Controlled Self-Improvement

WASP can modify its own source code. But unlike unrestricted self-modification, it uses:

AST validation — blocks subprocess, eval, ctypes, pickle, importlib
Sandbox execution — tests changes before applying
Patch persistence — changes survive container rebuilds
Reversibility — all patches can be rolled back

The default limit is 5 self-improvement operations per day.

Dashboard

151 HTTP endpoints power the dashboard across:

Chat interface
Traces and debugging
Scheduler management
Memory exploration
Knowledge graph visualization
World model timeline
Agent management
Integration configuration
Audit log

Dashboard binds to 127.0.0.1:8080 (loopback only) by default — use SSH tunnel or reverse proxy for remote access.

Security Model

WASP takes a fail-closed approach:

Telegram bridge refuses to start without a user allowlist — no public bot mode
Gmail requires per-address or domain allowlist — prevents prompt-injection exfiltration
SSRF prevention with DNS rebinding protection and manual redirect re-validation
Path traversal blocked via realpath containment
Argon2 hashing with Redis sessions (24h TTL) and 5-attempt lockout
CSRF protection with session-bound single-use tokens

When to Use WASP vs OpenClaw

Choose WASP if:

You need an agent that runs unattended 24/7
Truth-binding and audit trails are requirements
You want structured memory architecture out of the box
Self-improvement with safety rails matters

Choose OpenClaw if:

You want the largest skill ecosystem (5,400+)
You need 22+ communication channels
Community and ecosystem size matter more than built-in memory depth
You prefer Node.js over Python

Choose Hermes Agent if:

You want 300+ LLM providers via Nous Portal
You need multiple runtime isolation backends (Docker, SSH, Modal, Vercel Sandbox)
Isolated subagent spawn for parallel workstreams is critical

The Bottom Line

WASP is what happens when someone builds an agent runtime with operational reliability as the primary design goal rather than feature count. The truth-binding layer, capability tiers with anticipatory simulation, and 10+ memory tiers create a foundation for agents you can actually trust to run while you sleep.

The tradeoff is ecosystem size — WASP has 37 skills vs OpenClaw’s 5,400+. But for operators who need reliability over breadth, WASP fills a gap that’s been missing in the self-hosted agent space.

GitHub: github.com/agentwasp/agentwasp
Docs: docs.agentwasp.com
License: Apache 2.0