Introduction
Building agents is easy. Running them in production is hard. Polos handles sandboxes, durability, approvals, triggers, and observability - so you can focus on writing the agent. No DAGs. No graph syntax. Just Python or TypeScript.Polos vs. Typical Agent Frameworks
| Challenge | Typical agent framework | With Polos |
|---|---|---|
| Sandboxing | None - DIY or run unsandboxed | Docker, E2B + built-in tools (exec, files, search) |
| Durability | Agent crashes, start over | Auto-retry, resume from exact step |
| Approvals | Build it yourself | Slack, UI, terminal - one tap |
| Slack | Build a bot from scratch | @mention an agent, get responses in-thread |
| Triggers | Glue code for every webhook | Built-in: HTTP, webhooks, cron, events |
| Observability | Grep through logs | Full tracing, every tool call, every decision |
| Cost | Re-run failed LLM calls from scratch | Durable execution, prompt caching, 60-80% savings |
What You Get With Polos
- Sandboxed Execution - Isolated Docker and E2B environments. Built-in tools: exec, read, write, edit, glob, grep, web_search. No tool code to write, no sandbox lifecycle to manage.
- Human-in-the-Loop - Approval flows for any tool call. Reach your team via Slack, Discord, email. Paused agents consume zero compute.
- Durable Workflows - Auto-retry, log-replay, concurrency control. Resume from the exact step that failed. 60-80% cost savings via prompt caching.
- Slack Integration - @mention your bot to trigger any agent. Responses stream back to the thread. Tool approvals show up as Approve/Reject buttons in Slack.
- Triggers - Webhook URLs, HTTP API, cron schedules, event-driven.
- Observability - OpenTelemetry tracing for every step, tool call, and approval. Full execution history. Visual dashboard for monitoring and debugging.
- Bring Your Stack - Any LLM via Vercel AI SDK and LiteLLM. CrewAI, LangGraph, and Mastra compatible. Python or TypeScript. Open source - run anywhere.
CLI and UI to monitor and run your agents

See It in Action
Watch a coding agent built with Polos - sandboxed execution, tool approvals, and real-time observability.What Can You Build?
- Coding Agent - Clones repos, writes code, runs tests, and opens PRs - all inside a Docker sandbox with human approval before anything ships.
- Data Analyst - Connects to data warehouses, executes SQL in sandboxed environments, builds charts, sends approval pages before sharing with stakeholders.
- Research Agent - Crawls multiple sources, extracts findings, builds knowledge bases. Durable execution checkpoints after each source. Notifies via Slack or Discord.
- Order Processing - Handles incoming orders, runs fraud checks, suspends for human review on flagged transactions. Exactly-once execution guarantees no double charges.
- Scheduled Workflows - Cron-triggered agents that run nightly data syncs, generate reports, or clean up stale resources. Durable state means missed runs pick up where they left off.
Next Steps
Quickstart
Build and run a sandboxed coding agent in minutes
Core Concepts
Learn how Polos handles state and durability
Examples
See how to build HITL agents and multi-agent systems
Polos is 100% open source. Star us on GitHub if you find it useful, and join our Discord to connect with the community.