Hermes Agent

AI Agents Developer Tools Open Source AI

27 min read

Updated Jun 9, 2026

Suggest edit History

RawGraph

Last reviewed

Jun 9, 2026

Sources

No citations yet

Review status

Needs citations

Revision

v3 · 5,411 words

Hermes Agent
Developer	Nous Research
Repository	NousResearch/hermes-agent
Initial repository creation	July 22, 2025
First tagged public release	March 12, 2026 (`v0.2.0`)
License	MIT
Primary implementation	Python, with TypeScript and shell components
Minimum Python version	3.11
Supported host environments	Linux, macOS, WSL2, Android via Termux, native Windows (early beta)
Interfaces	CLI, TUI, messaging gateway, ACP integration, MCP server, REST API
Latest release (May 7, 2026)	v0.13.0 ("The Tenacity Release")
Website	https://hermes-agent.nousresearch.com/

Hermes Agent is an open-source AI agent platform developed by Nous Research. Official project materials describe it as a long-running agent runtime that combines a terminal interface, tool use, persistent memory, scheduled jobs, browser automation, and messaging integrations in one system. The project is MIT licensed, requires Python 3.11 or newer, and is distributed primarily through a one-line installer, a pip package, and a public GitHub repository.^[1]^[2]^[5]^[6]^[18]

Although it is built by the same lab that publishes the Hermes family of language models, Hermes Agent is not itself a standalone large language model. The documentation presents it as an orchestration layer that can run on multiple providers and endpoints, including Nous Portal, OpenRouter, OpenAI, Anthropic, AWS Bedrock, Hugging Face, DeepSeek Direct, xAI, GitHub Copilot, Gemini, and a growing list of regional providers. It can also expose its own messaging capabilities to outside clients through Model Context Protocol support.^[2]^[6]^[15]^[17]

The project's defining tagline is "the agent that grows with you," a reference to its built-in learning loop that creates reusable skills from successful task completions, refines them during use, and persists user-specific knowledge across sessions. This positioning distinguishes Hermes Agent from coding-focused assistants and stateless chatbot wrappers; it is designed to operate as a continuously running personal agent that accumulates capability over time rather than starting from zero with each conversation.^[1]^[18]^[19]

History

GitHub metadata shows that the NousResearch/hermes-agent repository was created on July 22, 2025.^[3] The first public release on GitHub is tagged v2026.3.12, labeled "Hermes Agent v0.2.0," published on March 12, 2026. That initial release notes 63 community contributors, 216 merged pull requests, and 119 resolved issues since the prior v0.1.0 foundation, indicating that substantial private development preceded the public debut.^[4]^[19]

Industry coverage and a contemporaneous Nous Research announcement place the broader public launch of Hermes Agent on February 25, 2026.^[20]^[21] By mid-April the project had passed 95,600 GitHub stars and by early May exceeded 152,000 stars, making it one of the fastest-growing agent repositories of 2026.^[3]^[22]

In May 2026, OpenRouter tracking reportedly placed Hermes Agent at the top of its global daily app and agent rankings, generating roughly 224 billion daily tokens versus 186 billion for the previously dominant OpenClaw project. Several outlets framed the milestone as the moment a community-built, self-improving agent overtook the early local-first AI assistant that had defined the category in late 2025 and early 2026.^[20]^[22]

Release timeline

Tag	Public name	Date	Notable additions
`v2026.3.12`	v0.2.0	March 12, 2026	First tagged public release; 70+ bundled skills; native MCP client/server; multi-platform messaging; ACP editor integration; native Windows support.^[4]^[19]
`v2026.3.17`	v0.3.0	March 17, 2026	Maintenance and stability follow-up to v0.2.0.^[4]
`v2026.3.23`	v0.4.0	March 24, 2026	Continued documentation and provider work.^[4]
`v2026.3.28`	v0.5.0	March 28, 2026	Memory and gateway iterations.^[4]
`v2026.3.30`	v0.6.0	March 30, 2026	Tooling refinements.^[4]
`v2026.4.3`	v0.7.0	April 3, 2026	Provider and toolset expansion.^[4]
`v2026.4.8`	v0.8.0	April 8, 2026	Voice mode and messaging upgrades.^[4]
`v2026.4.13`	v0.9.0	April 13, 2026	Stability and skill ecosystem hardening.^[4]
`v2026.4.16`	v0.10.0	April 16, 2026	Nous Tool Gateway: bundled web search, image generation, TTS, and browser automation for paid Portal subscribers; 180+ commits.^[4]^[23]
`v2026.4.23`	v0.11.0	April 23, 2026	Provider and adapter additions.^[4]
`v2026.4.30`	v0.12.0	April 30, 2026	Autonomous Curator agent for skill libraries on a seven-day cycle; class-first grading rubric for the self-improvement loop; ComfyUI v5, TouchDesigner-MCP, and LM Studio promoted to first-class integrations; Spotify, Yuanbao, Microsoft Teams, and Google Meet adapters; Vercel Sandbox backend.^[4]^[24]
`v2026.5.7`	v0.13.0	May 7, 2026	"The Tenacity Release": multi-agent Kanban with durable task board and retry budgets; `/goal` command for cross-turn objectives; video analysis tool; xAI voice cloning; seven UI locales; Google Chat as 20th messaging platform; Checkpoints v2 state persistence; post-write delta linting.^[4]^[25]

Origins and developer

Hermes Agent is published by Nous Research, an open-source AI lab best known for its Hermes line of language models, the Nous Chat product, the Nous Portal API, the Psyche network, and a long-running emphasis on "open source language model capabilities." Nous Research describes its mission as advancing "human rights and freedoms by creating and proliferating open source language models, supporting their unrestricted availability and use, and furthering their scientific and popular understanding."^[26]

The project sits inside Nous Research's wider stack, which includes its Hermes 4 model family, the Nous Portal subscription, and the Psyche distributed training network. Hermes Agent is the lab's first major effort to ship a full agent runtime rather than a model checkpoint, and it is explicitly designed to interoperate with both Nous-hosted models and third-party providers.^[2]^[6]^[26]

Relationship to OpenClaw

Hermes Agent is widely characterized as the spiritual and technical successor to OpenClaw, the local-first AI assistant originally published by Austrian developer Peter Steinberger in November 2025 under the name Clawdbot. After an Anthropic trademark notice prompted renames to Moltbot and then OpenClaw, Steinberger announced in February 2026 that he was joining OpenAI as a product and engineering lead for personal agents, transferring OpenClaw to an independent foundation with OpenAI as a financial and technical sponsor.^[20]^[22]

Nous Research positioned Hermes Agent as a fully open, community-governed alternative within this transition. The project ships a built-in hermes claw migrate command that imports SOUL.md personality files, memories, skills, API keys, command allowlists, messaging settings, TTS assets, and workspace instructions from existing OpenClaw installations. The repository's topic tags on GitHub still include clawdbot, moltbot, and openclaw, reflecting the genealogy.^[3]^[18]

Independent comparisons published in April and May 2026 generally describe Hermes Agent as preserving OpenClaw's local-first, terminal-centric philosophy while adding longer-running orchestration, a self-improvement loop, and a broader provider matrix. Hermes also expands the runtime model from a developer laptop tool to one that can operate as a continuously running gateway across messaging platforms and remote sandboxes.^[20]^[22]^[27]

Architecture

The official architecture guide describes Hermes Agent as a platform with multiple entry points feeding a common AIAgent core defined in run_agent.py. The documented entry points include a classic CLI, a --tui modern terminal UI, a messaging gateway with around 20 platform adapters, ACP integration for editors such as VS Code, Zed, and JetBrains IDEs, a batch trajectory runner, an API server, and a Python library interface. Session state is stored in SQLite with FTS5 full-text search, while tool discovery is centralized in a registry-based tool system that holds more than 70 registered tools across roughly 28 toolsets.^[17]

Subsystem	Role in the documented architecture
`AIAgent` core	Handles prompt construction, provider resolution, tool execution, retries, context compression, prompt caching, and persistence.^[17]
CLI and TUI	Provide an interactive terminal-first interface with multiline editing, slash commands, resume support, mouse support in `--tui`, streaming tool output, and status bars showing model, tokens, and cost.^[7]^[17]
Messaging gateway	Routes messages from connected platforms through per-chat sessions, runs the cron scheduler every 60 seconds, and delivers responses back through adapters.^[8]^[17]
Session storage	Uses SQLite and FTS5 in `~/.hermes/state.db` for session history, lineage tracking across compressions, profile-scoped separation, and recall.^[9]^[17]
Tool registry	Self-registering tool modules organized into categories and toolsets, with separate execution backends for terminal, browser, web, memory, and orchestration tasks.^[11]^[17]
Provider resolver	A shared runtime resolver maps `(provider, model)` tuples to `(api_mode, api_key, base_url)` across 18+ providers, supporting OAuth flows and credential pools.^[6]^[17]
Integrations layer	Connects to external systems through ACP, MCP, provider APIs, and platform adapters.^[15]^[17]
Context compression	A dedicated module summarizes earlier conversation turns when context exceeds configured thresholds.^[17]
Prompt caching	Applies cache breakpoints for providers that support them, including Anthropic-style prefix caching.^[17]

The CLI documentation emphasizes that the command-line interface is a full terminal UI rather than a browser interface. It supports interactive chat, model selection via --model and --provider, toolset selection via --toolsets, resumable sessions through --continue or --resume, worktree-based parallel runs with -w, voice input shortcuts when voice mode is enabled, and a separate --tui mode for a more graphical terminal experience with modal overlays.^[7]

The self-improvement loop

The feature most often singled out in independent coverage of Hermes Agent is its closed learning loop. Whereas typical agent frameworks treat each task as an independent run, Hermes is engineered to convert successful task completions into reusable knowledge artifacts that future runs can load on demand. The loop has three documented components: persistent memory files, a session search layer, and an autonomous skills system.^[1]^[9]^[10]^[28]

The agentskills.io specification that Hermes follows treats a "skill" as a Markdown document with YAML frontmatter describing when the skill should be loaded and how it should be used. Skills can be authored manually, generated autonomously by the agent after complex workflows (typically defined as five or more tool calls or any non-trivial solution worth preserving), or downloaded from public registries.^[28]

In benchmarks published by Nous Research, agents with 20 or more self-created skills reportedly completed research tasks 40 percent faster than fresh instances starting with no prior skills, without any manual prompt engineering.^[21] Beginning in v0.12.0, an autonomous Curator agent maintains the skill library on a seven-day cycle, applying a class-first grading rubric to prune, merge, and rewrite skills that have not held up in subsequent use.^[24]

Persistent memory

Hermes Agent stores two distinct memory files under ~/.hermes/memories/. MEMORY.md captures environment facts, conventions, and lessons the agent has learned about a workflow, with a default 2,200 character (roughly 800 token) cap. USER.md captures user preferences, communication style, and personal expectations, with a default 1,375 character (roughly 500 token) cap. Both files are read once at session start and injected as a frozen snapshot into the system prompt to preserve prompt caching behavior across turns.^[9]

Updates the agent makes during a session persist to disk immediately but do not appear in the system prompt until the next session begins, a deliberate trade-off that keeps the prefix stable for caching. The agent manipulates memory through three operations exposed by the memory tool: add, replace, and remove. There is no read action because memory content is already in the conversation context.^[9]

Session recall

Past CLI and messaging sessions are stored in ~/.hermes/state.db and become searchable through the session_search tool, which uses SQLite FTS5 plus LLM summarization (the documentation references Gemini Flash) for cross-session recall. The session layer also tracks lineage across context compressions so that long-running threads can be resumed without losing their history.^[9]^[17]

Skills system

Skills live in ~/.hermes/skills/. Each SKILL.md file uses YAML frontmatter for fields such as name, description, version, optional platforms, metadata.hermes.tags, metadata.hermes.config, and required_environment_variables. The body of the file typically contains "When to Use," "Procedure," "Pitfalls," and "Verification" sections.^[28]

Loading uses a progressive disclosure pattern with three levels. Level 0 lists skill metadata through skills_list() and costs roughly three thousand tokens. Level 1 calls skill_view(name) to retrieve the full content of one skill. Level 2 uses skill_view(name, path) to pull specific reference files attached to a skill. The system tracks the SHA hash of bundled skills so that user modifications can be distinguished from upstream updates, and hermes skills reset can restore bundled defaults.^[28]

Skills can also be installed from external registries, including the official optional skills bundle, the skills.sh directory, well-known endpoints at /.well-known/skills/index.json, direct GitHub repositories, and ClawHub and LobeHub marketplaces. All hub skills undergo security scanning before installation.^[28]

External memory providers

The memory provider guide lists eight optional plugin backends that can run alongside the built-in memory rather than replacing it: Honcho, OpenViking, Mem0, Hindsight, Holographic, RetainDB, ByteRover, and Supermemory. Each provider adds capabilities such as semantic search, knowledge graphs, or dialectic user modeling on top of the built-in MEMORY.md and session search baseline.^[10]

Tool system

Hermes Agent's tool catalog covers web access, terminal and filesystem operations, browser automation, multimodal media, memory, orchestration, scheduling, and outbound messaging. The official documentation describes more than 70 tools across roughly 28 toolsets, with self-registration at import time and a registry that supports per-session toolset filtering.^[11]^[17]

Category	Representative tools
Web	`web_search`, `web_extract`
Terminal and files	`terminal`, `process`, `read_file`, `patch`, `search_files`
Browser automation	`browser_navigate`, `browser_snapshot`, `browser_vision`, `browser_console`
Media and multimodal	`vision_analyze`, `image_generate`, `text_to_speech`, video analysis (v0.13.0)
Orchestration	`todo`, `clarify`, `execute_code`, `delegate_task`, `skill_manage`
Memory and recall	`memory`, `session_search`, `honcho`
Automation and delivery	`cronjob` (create/list/update/pause/resume/run/remove), `send_message`
Integrations	`ha_` for Home Assistant, `mcp_` for MCP servers, `rl_*` for reinforcement learning trajectories, `spotify`, `discord`, `discord_admin`

Terminal backends

The terminal tool runs commands through one of seven backends. The choice is made at configuration time and can be switched per session.^[11]

Backend	Purpose	Typical use
`local`	Default, runs directly on the host	Personal development on a trusted machine
`docker`	Single persistent container per session with hardened defaults	Local sandboxing of agent commands
`ssh`	Tunnels into a remote host	Cloud VMs and air-gapped servers; prevents self-modification of the agent host
`singularity`	Rootless HPC containers	Academic and shared cluster environments
`modal`	Serverless cloud functions via Modal	Burst compute and ephemeral isolation
`daytona`	Persistent remote development workspaces	Long-running cloud sandboxes
`vercel_sandbox`	Cloud microVM with snapshot-backed filesystem	Always-on automation with low idle cost

Docker and Modal backends mount credential files read-only or sync them before each command. Both can selectively forward environment variables declared in skill frontmatter or config.yaml.^[11]^[16]

Browser automation

The browser guide documents three cloud backends and three local options. Cloud backends include Browserbase (with random fingerprints, viewport randomization, residential proxies, and CAPTCHA solving), Browser Use (deprioritized if Browserbase credentials are also present), and Firecrawl (cloud or self-hosted, with built-in scraping). Local options include Chrome through CDP via /browser connect, the Camofox anti-detection Firefox stack via Docker, and a local Chromium fallback through the agent-browser CLI.^[12]

Pages are rendered as accessibility trees, with interactive elements assigned reference IDs such as @e1 and @e2. The browser_vision tool falls back to a screenshot plus vision model when the accessibility tree is insufficient, which is especially useful for CAPTCHAs and visually complex layouts. JavaScript execution is exposed through browser_console. When cloud providers are configured, Hermes automatically routes private and LAN addresses to local Chromium while sending public URLs through the cloud backend.^[12]

Subagent delegation

The delegate_task tool spawns isolated child agents with their own sessions, terminal contexts, and toolsets. The default ceiling is three concurrent subagents, configurable through delegation.max_concurrent_children, with the system using a ThreadPoolExecutor to run them in parallel. Subagents receive only the goal and context parameters supplied by the parent; they have no access to parent conversation history, and only their final summary is returned to the parent context.^[13]

Certain tools are unavailable to leaf subagents: delegation itself (so they cannot recurse), clarify, memory, code_execution, and send_message. An opt-in orchestrator role allows subagents to delegate further, gated by max_spawn_depth, which defaults to 1 and is capped at 3. The default per-child timeout is 600 seconds, and a /agents overlay in the TUI shows a live tree view of running subagents with cost and token rollups. Delegation is synchronous: interrupting the parent cancels active children immediately.^[13]

Programmatic tool calling

The execute_code tool runs Python scripts that import from hermes_tools, with Hermes generating a stub module whose calls travel over a Unix domain socket RPC layer to the agent process. Only print() output returns to the LLM, which keeps intermediate results out of the conversation context. The documentation says this is the preferred path when a workflow requires three or more tool calls with processing logic between them, bulk filtering or conditional branching, or loops over results.^[14]

Available tools inside scripts include web_search, web_extract, read_file, write_file, search_files, patch, and foreground terminal calls. Default limits are 300 seconds of wall time, 50 kilobytes of standard output, and 50 tool calls per execution. Code execution is restricted to Linux and macOS because of its dependency on Unix domain sockets; on Windows the agent automatically falls back to sequential tool calls.^[14]

Messaging and automation

A defining feature of Hermes Agent is its messaging gateway, a single background process that connects to all configured platforms, manages per-chat sessions, runs the cron scheduler, and delivers voice messages when voice features are enabled. The cron loop ticks every 60 seconds. Background prompts run in isolated sessions and report results back to the originating chat when they finish.^[8]

Supported messaging surfaces

As of v0.13.0 the gateway documents at least twenty platform adapters and several auxiliary surfaces:^[8]^[25]

Platform	Notes
Telegram	Full toolset with media, voice transcription, and slash commands; allowlist via `TELEGRAM_ALLOWED_USERS`.
Discord	Full voice integration, admin commands, and DM pairing.
Slack	Full voice integration and threaded messaging.
Google Chat	Added as the 20th adapter in v0.13.0.
WhatsApp	Text plus media; no native transcription.
Signal	Text and media; no native transcription.
SMS via Twilio	Text only.
Email (IMAP/SMTP)	Long-form replies and file attachments.
Home Assistant	Smart home automation and event listening.
Mattermost, Matrix	Team-chat platforms with full voice support on Matrix.
DingTalk, Feishu/Lark, WeCom, WeCom Callback, Weixin, Yuanbao, QQ Bot	Enterprise and consumer surfaces popular in mainland China.
BlueBubbles	iMessage bridge.
Microsoft Teams, Teams Meetings	Added as a plugin-shipped adapter in v0.12.0.
LINE, SimpleX Chat	Messenger and privacy-focused platforms.
Open WebUI, Webhooks	Generic chat-style frontends and outbound HTTP callbacks.

Each platform uses a distinct toolset such as hermes-telegram or hermes-discord, all of which retain terminal access except for the API server toolset, which excludes clarify, send_message, and text_to_speech. Feature parity varies by platform: Microsoft Teams lacks file support, WeCom Callback lacks images, files, and threads, and SMS supports only text.^[8]

Slash commands and background tasks

The gateway exposes a consistent slash-command vocabulary across platforms. Core commands include /new or /reset for fresh conversations, /model [provider:model] to display or switch models, /voice [on|off|tts|join|leave|status] to manage voice features, /background <prompt> to launch isolated background runs, /status, /usage, and /insights [days] for monitoring, /approve and /deny for dangerous-command confirmation, and /goal (added in v0.13.0) for persistent cross-turn objectives.^[8]^[25]

Background tasks spawn separate agent instances with their own session history, inheriting the configured model and toolset but receiving no context from the originating chat. Results return to the same channel when the task completes. Administrators can partition users into admins (full command access) and regular users (restricted slash-command scope), and the partition can differ between direct messages and group chats.^[8]

Scheduled tasks

The cron system lets users schedule tasks in either natural language or standard cron syntax, attach one or more skills that should load before execution, and deliver results back to chats, files, or platform targets. Common uses described in the documentation include morning briefings, periodic backups, log digests, and unattended research jobs. A no_agent watchdog mode added in v0.13.0 lets the scheduler invoke deterministic actions without involving an LLM.^[8]^[15]^[25]

Interoperability

Hermes Agent is designed to work with external model providers and external tool servers.

Model providers

The quickstart guide documents provider setup flows for Nous Portal, OpenAI Codex, Anthropic Claude with OAuth or API keys, OpenRouter, AWS Bedrock, DeepSeek Direct, GitHub Copilot, Google Gemini, xAI, Hugging Face, NovitaAI, NVIDIA NIM (Nemotron), Xiaomi MiMo, z.ai/GLM, Kimi/Moonshot, MiniMax, LM Studio (promoted to first-class in v0.12.0), and arbitrary OpenAI-compatible endpoints such as vLLM, SGLang, or Ollama. The system requires a model with at least 64,000 tokens of context.^[6]^[24] The project pyproject.toml also lists optional extras for messaging, voice, MCP, ACP, Modal, Daytona, Bedrock, and other integrations.^[5]

The v0.10.0 release introduced the Nous Tool Gateway, which bundles web search via Firecrawl, image generation via FAL with FLUX 2 Pro, OpenAI-hosted text-to-speech, and browser automation via Browser Use into a single subscription-detected entitlement for paid Nous Portal customers, removing the need to manage separate API keys for those services.^[23]

Model Context Protocol

The MCP guide describes two related roles. As an MCP client, Hermes connects to local stdio servers (subprocesses communicating over stdin/stdout for low-latency local access) and remote HTTP servers (for hosted MCP endpoints), discovers their tools automatically, and exposes them to the agent with namespaced identifiers such as mcp_filesystem_read_file or mcp_github_create_issue. The system honors notifications/tools/list_changed for live capability updates.^[15]

As an MCP server, Hermes lets outside clients including Claude Code, Cursor, OpenAI Codex, and any other MCP-capable agent use Hermes-managed messaging channels, approval workflows, and other tools. Per-server include and exclude filters let operators whitelist or blacklist tool names and separately control resource and prompt wrappers.^[15]

Agent Communication Protocol

ACP integration lets editors such as VS Code, Zed, and JetBrains IDEs talk to Hermes Agent over JSON-RPC, allowing it to operate as the chat backend for editor extensions. ACP support was one of the headline additions in the v0.2.0 release.^[17]^[19]

Security model

The official security guide describes Hermes Agent as using a seven-layer defense-in-depth model. The documented layers cover user authorization, dangerous command approval, container isolation, MCP credential filtering, context-file scanning for prompt injection, cross-session isolation, and input sanitization for terminal backends.^[16]

Command approval modes

The command approval system supports three documented modes set through approvals.mode. manual (the default) always prompts before destructive operations. smart uses an auxiliary LLM to assess risk, auto-approving low-risk commands, auto-denying clearly dangerous ones, and escalating uncertain cases for human review. off disables all approval checks, the same posture as session-level YOLO mode.^[16]

YOLO mode is triggered with the --yolo CLI flag, the /yolo slash command, or the HERMES_YOLO_MODE=1 environment variable, and the /yolo toggle is per-session. A hardline blocklist remains active regardless of mode: even YOLO will not run rm -rf /, fork bombs, direct device writes, or other catastrophic operations the project considers a hard floor.^[16] Default triggers include recursive deletes, world-writable permissions, SQL DROP/DELETE/TRUNCATE, system configuration overwrites, service termination, shell piping such as curl | sh, and privilege escalation patterns.^[16]

Container hardening

Docker containers run with all Linux capabilities dropped except DAC_OVERRIDE, CHOWN, and FOWNER. They enforce no-new-privileges, a 256-process cap, and /tmp mounted with size limits and noexec. Default resource limits are 5 GB of memory and 50 GB of disk, both configurable. Inside the container, the dangerous-command checks are skipped because the container itself forms the security boundary.^[16]

Authorization and pairing

Gateway authorization is checked in a strict order: per-platform allow-all flag, DM pairing approved list, platform-specific allowlists, global allowlist, global allow-all, and default deny. The DM pairing system issues eight-character codes drawn from a 32-character unambiguous alphabet with a one-hour TTL, a one-per-ten-minute rate limit per requester, and a lockout after five failed attempts. Default behavior is to deny unknown users unless an allowlist or pairing approval is present.^[16]

Credentials and prompt injection

MCP subprocesses receive a filtered environment: only PATH, HOME, USER, LANG, LC_ALL, TERM, SHELL, TMPDIR, and XDG_* variables pass through unless explicitly overridden. Credential files such as OAuth tokens mount read-only into Docker containers or sync to Modal before each command. Error messages are scanned for GitHub personal access tokens, OpenAI-style keys, bearer tokens, and similar patterns before being returned to the LLM. Tirith integration detects homograph spoofing, pipe-to-interpreter patterns, and terminal injection attacks before execution, with auto-install and SHA-256 verification of the Tirith binary. Context files including AGENTS.md, .cursorrules, and SOUL.md are scanned for prompt injection attempts and invisible Unicode characters before inclusion in the system prompt.^[16] URL tools validate against RFC 1918 private networks, loopback, link-local, CGNAT space, cloud metadata hostnames, and reserved addresses, with redirect chains revalidated at each hop.^[16]

Positioning and comparisons

Official project materials consistently position Hermes Agent as a self-hostable, long-running personal agent rather than a chatbot wrapper or a coding-only copilot.^[1]^[2]^[18] Independent reviews emphasize the same point and contrast Hermes with two adjacent categories.^[20]^[22]^[27]

Design choice	What it implies
Multiple entry points, one core agent	Hermes runs in terminals, editors, background services, and external integrations rather than only in a chat window.^[17]
Built-in memory plus session search	The product is optimized for continuity across sessions rather than stateless prompt-response interaction.^[9]
Skills as procedural memory	Successful task runs accrete reusable capability, shifting performance from "static capability based on prompt quality" to "cumulative capability that grows with usage."^[21]^[28]
Built-in scheduler and gateway	Hermes is meant to continue operating after a single conversation ends and across many messaging surfaces simultaneously.^[8]^[15]
Multiple execution backends	The agent can run locally, in containers, on remote servers, or on cloud execution backends depending on the deployment model.^[11]
Provider-agnostic model setup	The system is designed to switch among many external model providers instead of depending on one fixed model family.^[6]
MIT licensing and open development	Skills, providers, and adapters can be authored by anyone, and the project ships migration tools to reduce switching costs from competing agents such as OpenClaw.^[3]^[18]

Versus coding copilots

Unlike single-purpose coding assistants such as GitHub Copilot or terminal-bound coding agents, Hermes is positioned as a general-purpose personal agent. While it can be used for code editing through ACP and terminal tools, the project frames itself as something that "talks to you from Telegram while it works on a cloud VM," with first-class support for non-coding tasks such as briefings, automations, scheduling, and home assistant integration.^[1]^[17]

Versus chatbot wrappers

Unlike products that wrap a single model behind a chat UI, Hermes brings its own runtime, tool registry, scheduler, gateway, and memory system. It is engineered to run for weeks or months on a $5 VPS or serverless infrastructure that costs "nearly nothing when idle," persisting across model swaps and provider changes through its abstraction layers.^[1]^[6]^[21]

Versus OpenClaw

As noted earlier, Hermes Agent inherits much of OpenClaw's local-first ethos and offers a direct migration path. Comparison pieces published in spring 2026 generally agree that Hermes extends the OpenClaw model with a stronger runtime, a richer skill ecosystem, broader messaging support, and a self-improvement loop that OpenClaw does not provide natively. By May 2026, public usage trackers showed Hermes generating more daily OpenRouter tokens than OpenClaw, a milestone widely cited as evidence that the community-led successor had eclipsed the original.^[20]^[22]^[27]

Use cases

Releases, marketing copy, and independent reviews describe a broad range of intended use cases.^[1]^[8]^[21]^[25]

Use case	How Hermes supports it
Personal automation	Natural-language cron tasks, scheduled briefings, periodic backups, log digests, and home assistant control.^[8]^[15]
Long-running research	Background sessions with isolated subagents, web extraction, browser automation, trajectory export, and persistent skills capturing methodology.^[11]^[12]^[13]
Cross-platform messaging	A single agent reachable from Telegram, Discord, Slack, WhatsApp, Signal, email, and more, with consistent slash commands and shared memory.^[8]
Coding and editor workflows	ACP integration with VS Code, Zed, and JetBrains, plus worktree-based parallel runs and post-write delta linting added in v0.13.0.^[17]^[19]^[25]
Voice-first interaction	Voice memo transcription on supported platforms, text-to-speech replies, and xAI voice cloning added in v0.13.0.^[8]^[25]
Multimodal projects	Image generation, vision analysis, video analysis (v0.13.0), and ComfyUI v5 integration for creative workflows.^[11]^[24]^[25]
Enterprise pilots	Multi-user gateway with allowlists, DM pairing, scoped slash commands, and hardened Docker or Modal sandboxes.^[8]^[11]^[16]
Self-hosting on small infrastructure	$5 VPS deployments, serverless backends with low idle cost, and migration tooling that imports OpenClaw configurations wholesale.^[1]^[18]

Adoption and reception

By mid-May 2026, the NousResearch/hermes-agent repository reported over 152,000 stars, 24,000 forks, and 11,000 open issues, with Python comprising roughly 88 percent of the codebase and TypeScript another 9 percent.^[3] At launch on February 25, 2026, Nous Research presented Hermes Agent as the lab's flagship application of its open-source mission.^[20] By April it had reportedly become the fastest-growing AI agent framework on GitHub, hitting 95,600 stars in seven weeks.^[21] In May, multiple outlets reported that Hermes Agent had passed OpenClaw on OpenRouter's global rankings by daily token volume, with industry coverage framing the milestone as a turning point for community-built personal agents.^[20]^[22]

Independent reviewers in the AI press generally praised the combination of memory, skills, and cross-platform messaging, while flagging two operational considerations: the cumulative cost of LLM inference for an always-on agent (estimated around USD 0.30 per complex task on budget models) and the operational discipline required to run gateway processes safely with terminal access exposed to multiple users.^[21]^[27]

Availability

The documented installation path on Linux, macOS, WSL2, and Termux is pip install hermes-agent followed by hermes postinstall for optional dependencies including Node.js, ripgrep, ffmpeg, and the bundled browser. The git-based installer remains available through curl -fsSL https://raw.githubusercontent.com/NousResearch/hermes-agent/main/scripts/install.sh | bash, and a native Windows beta uses a PowerShell irm | iex installer. Post-install, provider and tool configuration runs through hermes model, hermes tools, or hermes setup.^[6]^[18]

The source repository is public on GitHub under the MIT license.^[3]^[5] The project also publishes a community Discord, a skills hub at agentskills.io, and an OpenClaw-compatible WeChat bridge called HermesClaw for users migrating from earlier WeChat bot configurations.^[18]^[28]

References

Hermes Agent homepage, Nous Research. Accessed May 16, 2026. https://hermes-agent.nousresearch.com/ ↩
Hermes Agent documentation overview. Accessed May 16, 2026. https://hermes-agent.nousresearch.com/docs/ ↩
GitHub API metadata for `NousResearch/hermes-agent`. Accessed May 16, 2026. https://api.github.com/repos/NousResearch/hermes-agent ↩
GitHub releases for `NousResearch/hermes-agent`. Accessed May 16, 2026. https://api.github.com/repos/NousResearch/hermes-agent/releases ↩
`pyproject.toml` in `NousResearch/hermes-agent`, main branch. Accessed May 16, 2026. https://raw.githubusercontent.com/NousResearch/hermes-agent/main/pyproject.toml ↩
Quickstart, Hermes Agent docs. Accessed May 16, 2026. https://hermes-agent.nousresearch.com/docs/getting-started/quickstart/ ↩
CLI Interface, Hermes Agent docs. Accessed May 16, 2026. https://hermes-agent.nousresearch.com/docs/user-guide/cli/ ↩
Messaging Gateway, Hermes Agent docs. Accessed May 16, 2026. https://hermes-agent.nousresearch.com/docs/user-guide/messaging/ ↩
Persistent Memory, Hermes Agent docs. Accessed May 16, 2026. https://hermes-agent.nousresearch.com/docs/user-guide/features/memory/ ↩
Memory Providers, Hermes Agent docs. Accessed May 16, 2026. https://hermes-agent.nousresearch.com/docs/user-guide/features/memory-providers ↩
Tools and Toolsets, Hermes Agent docs. Accessed May 16, 2026. https://hermes-agent.nousresearch.com/docs/user-guide/features/tools/ ↩
Browser Automation, Hermes Agent docs. Accessed May 16, 2026. https://hermes-agent.nousresearch.com/docs/user-guide/features/browser/ ↩
Subagent Delegation, Hermes Agent docs. Accessed May 16, 2026. https://hermes-agent.nousresearch.com/docs/user-guide/features/delegation/ ↩
Code Execution, Hermes Agent docs. Accessed May 16, 2026. https://hermes-agent.nousresearch.com/docs/user-guide/features/code-execution/ ↩
MCP (Model Context Protocol), Hermes Agent docs. Accessed May 16, 2026. https://hermes-agent.nousresearch.com/docs/user-guide/features/mcp/ ↩
Security, Hermes Agent docs. Accessed May 16, 2026. https://hermes-agent.nousresearch.com/docs/user-guide/security/ ↩
Architecture, Hermes Agent docs. Accessed May 16, 2026. https://hermes-agent.nousresearch.com/docs/developer-guide/architecture/ ↩
`README.md` for `NousResearch/hermes-agent`, main branch. Accessed May 16, 2026. https://github.com/NousResearch/hermes-agent ↩
GitHub release notes for `v2026.3.12` (v0.2.0). Accessed May 16, 2026. https://github.com/NousResearch/hermes-agent/releases/tag/v2026.3.12 ↩
"OpenClaw vs Hermes Agent: Why Nous Research's Self-Improving Agent Now Leads OpenRouter's Global Rankings," MarkTechPost, May 10, 2026. https://www.marktechpost.com/2026/05/10/openclaw-vs-hermes-agent-why-nous-researchs-self-improving-agent-now-leads-openrouters-global-rankings/ ↩
"Hermes Agent Review: 95.6K Stars, Self-Improving AI Agent (April 2026)," TokenMix Blog and DEV Community, April 2026. https://tokenmix.ai/blog/hermes-agent-review-self-improving-open-source-2026 ↩
"Nous Research's Hermes Agent Dethrones OpenClaw as the World's Most-Used Open-Source AI Agent," TechTimes, May 15, 2026. https://www.techtimes.com/articles/316694/20260515/nous-researchs-hermes-agent-dethrones-openclaw-worlds-most-used-open-source-ai-agent.htm ↩
GitHub release notes for `v2026.4.16` (v0.10.0). Accessed May 16, 2026. https://github.com/NousResearch/hermes-agent/releases/tag/v2026.4.16 ↩
GitHub release notes for `v2026.4.30` (v0.12.0). Accessed May 16, 2026. https://github.com/NousResearch/hermes-agent/releases/tag/v2026.4.30 ↩
GitHub release notes for `v2026.5.7` (v0.13.0). Accessed May 16, 2026. https://github.com/NousResearch/hermes-agent/releases/tag/v2026.5.7 ↩
Nous Research home page. Accessed May 16, 2026. https://nousresearch.com ↩
"Hermes Agent vs OpenClaw: Which Open-Source AI Agent Wins in 2026?" KuCoin, 2026. https://www.kucoin.com/blog/hermes-agent-vs-openclaw-which-open-source-ai-agent-wins-in-2026 ↩
Skills, Hermes Agent docs. Accessed May 16, 2026. https://hermes-agent.nousresearch.com/docs/user-guide/features/skills/ ↩

Improve this article

Add missing citations, update stale details, or suggest a clearer explanation.

Suggest edit

What links here

Computer-use agent LangChain

History

Release timeline

Origins and developer

Relationship to OpenClaw

Architecture

The self-improvement loop

Persistent memory

Session recall

Skills system

External memory providers

Tool system

Terminal backends

Browser automation

Subagent delegation

Programmatic tool calling

Messaging and automation

Supported messaging surfaces

Slash commands and background tasks

Scheduled tasks

Interoperability

Model providers

Model Context Protocol

Agent Communication Protocol

Security model

Command approval modes

Container hardening

Authorization and pairing

Credentials and prompt injection

Positioning and comparisons

Versus coding copilots

Versus chatbot wrappers

Versus OpenClaw

Use cases

Adoption and reception

Availability

References

Improve this article

Related Articles

LangChain

CrewAI

Semantic Kernel

Claude Agent SDK

LangGraph

Cline (AI coding agent)

What links here

Related Articles

LangChain

CrewAI

Semantic Kernel

Claude Agent SDK

LangGraph

Cline (AI coding agent)

What links here