Agent-Sudo: Adaptive High-Friction Guardrails for Autonomous Agents

Traditional HITL asks: "Do you agree?" — Users click Yes without reading.

SUDO.md asks: "Prove you mean it." — Users must actively demonstrate intent.

The Problem: Alert Fatigue

Traditional Human-in-the-Loop (HITL) systems suffer from a critical flaw: they degenerate into formalism.

When agents prompt users for every action, users develop muscle memory and approve without reading. The human is in the loop, but the brain is not. In security research, this is called "Alert Fatigue".

Why Sandboxing Is Not Enough

Sandboxes prevent malicious code escape. But AI agents are authorized users.

When Codex misunderstands your intent and decides to DROP TABLE users, the sandbox won't stop it — because Codex has permission. You need something that catches authorized stupidity.

Agent-Sudo solves this by requiring Proof of Intent, not just Authorization.

The Solution: Cognitive Forcing Functions

We propose "Adaptive High-Friction Guardrails" — a protocol that doesn't just ask for permission, but forces cognitive engagement through progressively challenging verification mechanisms.

Traditional HITL	Agent-Sudo
Passive approval: "Click OK to continue"	Active challenge-response
Susceptible to habituation	Breaks autopilot mode
Same friction for all actions	Friction scales with risk
Authorization only	Proof of Intent

The Friction Protocol: Three Layers

Level	Name	Mechanism	Purpose
L2	⏳ Temporal Friction	5-second mandatory cooldown	Forces users out of "fast thinking" mode
L3	🧠 Cognitive Friction	Semantic Echo (type to confirm)	Forces reading and understanding
L4	🔐 Strong Authentication	Passkey, TOTP, YubiKey, Push, SMS/Email	Out-of-band verification for critical ops

L0 (read-only) and L1 (reversible writes) pass through with minimal or standard confirmation.

The `SUDO.md` Specification

Just as AGENTS.md provides context for coding agents, Agent-Sudo introduces the SUDO.md standard. This file resides in the root of a repository to define safety boundaries.

Example SUDO.md:

version: "1.0"
security_rules:
  # L3: Database deletion requires typing to confirm
  - pattern: "DROP TABLE .*"
    risk_level: "L3"
    challenge: "semantic_echo"
    message: "⚠️ DATABASE DELETION DETECTED"

  # L2: System restart requires 5s delay
  - command: "systemctl restart .*"
    risk_level: "L2"
    delay_seconds: 5

  # L4: Large fund transfers require biometric
  - tool: "transfer_funds"
    condition: "amount > 50"
    risk_level: "L4"
    auth: "biometric"

Architecture

Agent-Sudo acts as a middleware/interceptor between the LLM reasoning loop and the tool execution environment.

graph TD
    User[Human Operator]
    Agent[LLM Agent]
    Interceptor[🛡️ Agent-Sudo Layer]
    Execution[Sandbox/Env]

    User -->|Prompts| Agent
    Agent -->|Tool Call| Interceptor

    Interceptor -->|1. Regex/Static Check| RiskCalc{Risk Level?}
    Interceptor -->|2. Semantic Audit| RiskCalc

    RiskCalc -->|L0-L1| Execution
    RiskCalc -->|L2-L4| Guardrail[🚧 Friction Challenge]

    Guardrail -->|⏳ Wait 5s / 🧠 Type Key / 🔐 Biometric| User
    User -->|Verify| Guardrail

    Guardrail -->|Success| Execution
    Guardrail -->|Fail/Timeout| Agent

Who Should Adopt SUDO.md?

For Agent Developers

If you're building an AI agent (coding assistant, computer use agent, workflow automation), implement SUDO.md protocol:

Codex, Cursor, Windsurf → Read project's SUDO.md before risky git/shell operations
Manus, Claude Computer Use → Check SUDO.md before file deletion, email actions
n8n, Zapier AI → Enforce SUDO.md for financial workflows

For Project/System Owners

Add SUDO.md to your repository or system to define safety rules:

# Example: Backend project SUDO.md
version: "1.0"
security_rules:
  - command: "git push --force"
    risk_level: "L3"
    semantic_key: "force-push"
  - pattern: "DROP TABLE"
    risk_level: "L4"
    auth_methods: ["passkey", "totp"]

Getting Started

For Agent Developers:

Parse SUDO.md files in user projects
Implement friction UI for L1-L4 levels
Intercept tool calls and match against rules

For Project Owners:

Add SUDO.md to your repository root
Define rules for risky operations
Agents that support SUDO.md will enforce your rules

License

MIT License — Open Source and Free Forever.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
components		components
pages		pages
public		public
styles		styles
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
SUDO.md		SUDO.md
next-env.d.ts		next-env.d.ts
next.config.ts		next.config.ts
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Agent-Sudo: Adaptive High-Friction Guardrails for Autonomous Agents

The Problem: Alert Fatigue

Why Sandboxing Is Not Enough

The Solution: Cognitive Forcing Functions

The Friction Protocol: Three Layers

The `SUDO.md` Specification

Architecture

Who Should Adopt SUDO.md?

For Agent Developers

For Project/System Owners

Getting Started

License

About

Uh oh!

Releases

Packages

Languages

License

Agent-Sudo-Org/agent-sudo

Folders and files

Latest commit

History

Repository files navigation

Agent-Sudo: Adaptive High-Friction Guardrails for Autonomous Agents

The Problem: Alert Fatigue

Why Sandboxing Is Not Enough

The Solution: Cognitive Forcing Functions

The Friction Protocol: Three Layers

The SUDO.md Specification

Architecture

Who Should Adopt SUDO.md?

For Agent Developers

For Project/System Owners

Getting Started

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

The `SUDO.md` Specification

Packages