GitHub - factspark23-hash/Agent-OS: 🌐 199 browser tools for AI agents — stealth, self-healing, captcha-solving, production-grade automation server

Quick Start · Connectors · Stealth · Adaptive · Commands · Architecture · Deployment

Agent-OS gives AI agents a real browser — persistent, stealthy, and self-hosted. It ships 203 tools for navigation, form filling, data extraction, CAPTCHA bypass, adaptive scraping, and more. Works with Claude, GPT-4, Codex, OpenClaw, and any agent that can send an HTTP request.

One command to install. One config to connect. Zero API keys needed.

curl -sSL https://raw.githubusercontent.com/factspark23-hash/Agent-OS/main/install.sh | bash

Why Agent-OS?

Problem	Agent-OS Solution
AI agents can't interact with websites	Real Chromium browser with 203 tools
Bot detection blocks automation	26+ anti-detection vectors, Cloudflare bypass
Website changes break selectors	Adaptive scraper — learns element fingerprints, auto-relocates
Manual login required	Login handoff — pause AI, human logs in, resume
Single IP gets blocked	Proxy rotation with 4 strategies + health tracking
LLM token waste on browser output	SmartCompressor — 87% token savings
Need multiple AI platforms	7 connectors — MCP, OpenAI, Claude, CLI, REST, OpenClaw

⚡ Quick Start

Option 1: One-Command Install

curl -sSL https://raw.githubusercontent.com/factspark23-hash/Agent-OS/main/install.sh | bash

# With options
curl -sSL .../install.sh | bash -s -- --token my-secret-token
curl -sSL .../install.sh | bash -s -- --headed          # Show browser
curl -sSL .../install.sh | bash -s -- --port 9000       # Custom port

Option 2: Manual Install

git clone https://github.com/factspark23-hash/Agent-OS.git
cd Agent-OS
python3 -m venv venv && source venv/bin/activate
pip install -r requirements.txt
python3 -m patchright install chromium

export JWT_SECRET_KEY=$(python3 -c 'import secrets; print(secrets.token_urlsafe(48))')
python3 main.py --agent-token "your-token"

Option 3: Docker

git clone https://github.com/factspark23-hash/Agent-OS.git
cd Agent-OS
export POSTGRES_PASSWORD="strong-password"
docker compose up -d

First Commands

# Health check
curl http://localhost:8001/health

# Navigate
curl -X POST http://localhost:8001/command \
  -H "Content-Type: application/json" \
  -d '{"token":"your-token","command":"navigate","url":"https://github.com"}'

# Screenshot
curl -X POST http://localhost:8001/command \
  -H "Content-Type: application/json" \
  -d '{"token":"your-token","command":"screenshot"}'

# Click by text (no CSS selector needed)
curl -X POST http://localhost:8001/command \
  -H "Content-Type: application/json" \
  -d '{"token":"your-token","command":"smart-click","text":"Sign in"}'

🔌 Connectors

All 203 tools available in every connector:

Connector	Tools	Use With	API Key?
MCP Passthrough ⭐	203	Claude Desktop, Claude Code, Codex	❌ No
MCP Server	203	Claude Desktop, Claude Code, Codex	Optional
OpenAI	203	GPT-4, GPT-4o, any OpenAI-compatible	Yes
Claude API	203	Claude API (tool-use format)	Yes
OpenClaw	203	OpenClaw agent framework	Optional
CLI (Bash)	202	Any language (Python, Node, Go...)	Token
HTTP REST	202	Direct API calls	Token

MCP Passthrough (Zero API Key) ⭐

./run_mcp.sh --token "my-secret-token"

Claude Desktop config:

{
  "mcpServers": {
    "agent-os": {
      "command": "python3",
      "args": ["/path/to/Agent-OS/connectors/mcp_passthrough.py"],
      "env": {
        "AGENT_OS_URL": "http://localhost:8001",
        "AGENT_OS_TOKEN": "my-secret-token",
        "AGENT_OS_COMPRESS": "aggressive"
      }
    }
  }
}

🛡️ Stealth Engine

Agent-OS defeats bot detection with a 4-layer defense system:

┌─────────────────────────────────────────────────────────┐
│ Layer 1: Network                                         │
│ Chrome TLS fingerprint (JA3/JA4) via curl_cffi          │
│ HTTP/2 matching • Bot scripts blocked at network level   │
├─────────────────────────────────────────────────────────┤
│ Layer 2: CDP (Chrome DevTools Protocol)                  │
│ Page.addScriptToEvaluateOnNewDocument injection          │
│ User-Agent metadata spoofing • Timezone override         │
├─────────────────────────────────────────────────────────┤
│ Layer 3: JavaScript (19 injection modules)               │
│ navigator.webdriver removal • CDP property filtering     │
│ WebGL/Canvas/Audio fingerprint spoofing                  │
│ WebRTC IP leak prevention • Function toString masking    │
├─────────────────────────────────────────────────────────┤
│ Layer 4: Behavior                                        │
│ Bezier-curve mouse movements • Realistic typing rhythms  │
│ Word pause simulation • Typo + correction (3% rate)      │
└─────────────────────────────────────────────────────────┘

Blocked vendors: DataDome, PerimeterX, Imperva, Akamai, Cloudflare Bot Management, Turnstile, Kasada, Shape Security, F5, Arkose Labs, ThreatMetrix, hCaptcha, reCAPTCHA

🧠 Adaptive Scraper

When a website changes its DOM structure, traditional selectors break. Agent-OS remembers element fingerprints and relocates them automatically:

1. Find element with CSS selector → ✅ Found → Save fingerprint (tag, attrs, text, path, parent)
2. Website redesigns, selector breaks → ❌ Not found
3. Load stored fingerprint → Scan all page elements → Score similarity
4. Best match above 40% threshold → ✅ Element relocated!

Fingerprint components:

Component	Weight	What it captures
Tag name	30%	`div`, `span`, `a`, etc.
Attributes	30%	class, id, name, href
Text content	20%	Inner text (survives minor changes)
DOM path	10%	Tag chain from root
Parent context	10%	Parent tag + attributes

Commands:

# Find element adaptively
{"command": "adaptive-find", "selector": ".product-title", "identifier": "product-name"}

# Save element fingerprint manually
{"command": "adaptive-save", "selector": "#login-btn", "identifier": "login-button"}

# View stored fingerprints
{"command": "adaptive-stats"}

# Clean old fingerprints
{"command": "adaptive-cleanup", "max_age_days": 30}

🔄 Proxy Rotation

Thread-safe proxy rotator with 4 strategies:

Strategy	How it works	Best for
Cyclic	Sequential round-robin	General scraping
Weighted	Higher weight = more requests	Premium vs budget proxies
Random	Random selection	Anti-pattern detection
Sticky	Same proxy per domain	Session-based scraping

Health tracking: Success rate, latency, consecutive failures. Unhealthy proxies auto-skipped with failover.

from src.tools.proxy_rotator import ProxyRotator

rotator = ProxyRotator(
    proxies=["http://proxy1:8080", "http://proxy2:8080", "http://proxy3:8080"],
    strategy="weighted"
)

proxy = rotator.get_proxy()                    # Get next proxy
proxy = rotator.get_proxy(domain="google.com") # Sticky per domain
proxy = rotator.get_proxy(country="US")        # Geo-targeted

rotator.record_result(proxy_id, success=True, latency_ms=120)

🌐 Browser Automation

203 tools across 15 categories:

Category	Tools	Highlights
Navigation	6	`navigate`, `smart-navigate` (auto HTTP/browser)
Interaction	17	`click`, `fill-form`, `drag-drop`, `scroll`
Smart Finder	4	Find by visible text — no CSS selectors
Content	9	`screenshot`, `get-dom`, `evaluate-js`
Page Analysis	9	`page-seo`, `page-emails`, `page-accessibility`
Network	8	Capture XHR, export HAR
Security	3	`scan-xss`, `scan-sqli`, `scan-sensitive`
Workflows	6	Multi-step automation with variables
Sessions	8	Save/restore cookies, auto-login
Proxy	18	Pool management, health checks, rotation
Adaptive	4	Element fingerprinting + relocation
Smart Wait	7	7 wait strategies
Auto-Heal	10	Self-healing selectors
Auto-Retry	10	Circuit breaker + exponential backoff
Recording	18	Record, replay, export workflows
Multi-Agent	19	Shared sessions, task queues, locks
Login Handoff	8	Pause AI → human logs in → resume
LLM	7	Built-in `llm-complete`, `llm-summarize`
AI Content	6	Structured extraction, schema.org
CAPTCHA	6	Preempt, solve, monitor
TLS HTTP	4	Chrome TLS fingerprint without browser

🔐 Authentication

3-layer auth system:

# Layer 1: JWT (recommended)
curl -X POST http://localhost:8001/auth/register \
  -H "Content-Type: application/json" \
  -d '{"email":"you@example.com","username":"admin","password":"StrongPass123!"}'

curl -X POST http://localhost:8001/auth/login \
  -H "Content-Type: application/json" \
  -d '{"username":"admin","password":"StrongPass123!"}'

# Layer 2: API Keys
curl -X POST http://localhost:8001/auth/api-keys \
  -H "Authorization: Bearer YOUR_JWT" \
  -d '{"name":"my-key","scopes":["browser"]}'

# Layer 3: Legacy Tokens (dev only)
python3 main.py --agent-token "dev-token"

🏗️ Architecture

┌──────────────────────────────────────────────────────────────┐
│  External Clients                                             │
│  Claude Desktop │ GPT-4 │ Codex │ CLI │ HTTP/WS             │
└────────┬────────┴───┬─────┴───┬───┴──┬──┴──────┬─────────────┘
         │            │         │      │         │
         ▼            ▼         ▼      ▼         ▼
┌──────────────────────────────────────────────────────────────┐
│  Connectors (203 tools each)                                 │
│  MCP │ OpenAI │ Claude │ OpenClaw │ CLI │ REST+WebSocket    │
└────────┬──────┴───┬─────┴────┬─────┴──┬──┴──────┬───────────┘
         └──────────┴────┬─────┴────────┴─────────┘
                         ▼
┌──────────────────────────────────────────────────────────────┐
│  Agent Server (aiohttp)                                      │
│  Auth │ Rate Limiter │ Validator │ Command Router            │
└────────────────────────┬─────────────────────────────────────┘
              ┌──────────┼──────────┐
              ▼          ▼          ▼
┌──────────────┐ ┌──────────────┐ ┌──────────────┐
│ Browser      │ │ Tools Layer  │ │ Infrastructure│
│ (Patchright  │ │ Adaptive     │ │ PostgreSQL   │
│  + Stealth)  │ │ Auto-Heal    │ │ Redis        │
│ 26+ vectors  │ │ Workflows    │ │ JWT Auth     │
│              │ │ LLM Provider │ │ Logging      │
└──────────────┘ └──────────────┘ └──────────────┘

🚀 Deployment

Production Checklist

# 1. Set JWT secret
export JWT_SECRET_KEY=$(python3 -c 'import secrets; print(secrets.token_urlsafe(48))')

# 2. Start with production flags
python3 main.py \
  --agent-token "strong-random-token" \
  --port 8000 \
  --max-ram 500 \
  --json-logs

# 3. Verify
curl http://localhost:8001/health

Docker Compose (Full Stack)

export POSTGRES_PASSWORD="strong-db-password"
docker compose --profile with-nginx up -d

Scaling

Config	Concurrent Users	Memory
1 instance × 50 contexts	50	~800 MB
3 instances × 50 contexts	150	~2.4 GB
5 instances × 50 contexts	250	~4 GB

📁 Project Structure

Agent-OS/
├── main.py                          # Entry point
├── install.sh                       # One-command installer
├── docker-compose.yml               # Full Docker stack
├── requirements.txt                 # Python dependencies
│
├── src/
│   ├── core/                        # Browser engine
│   │   ├── browser.py               #   Main browser (Patchright/Chromium)
│   │   ├── stealth.py               #   Anti-detection JS (1264 lines)
│   │   ├── cdp_stealth.py           #   CDP-level stealth
│   │   ├── stealth_god.py           #   GOD MODE (26+ vectors)
│   │   ├── llm_provider.py          #   12 LLM providers
│   │   └── config.py                #   YAML configuration
│   │
│   ├── tools/                       # Feature engines
│   │   ├── adaptive_scraper.py      #   ⭐ Adaptive element relocation
│   │   ├── proxy_rotator.py         #   ⭐ 4-strategy proxy rotation
│   │   ├── auto_heal.py             #   Self-healing selectors
│   │   ├── workflow.py              #   Multi-step workflows
│   │   ├── session_recording.py     #   Record & replay
│   │   └── ...                      #   15+ more tools
│   │
│   ├── security/                    # Stealth & evasion
│   │   ├── evasion_engine.py        #   Fingerprint generation
│   │   ├── captcha_solver.py        #   CAPTCHA solving
│   │   └── cloudflare_bypass.py     #   Cloudflare bypass
│   │
│   └── agents/
│       └── server.py                # WebSocket + HTTP (202 commands)
│
├── connectors/                      # AI Platform Connectors
│   ├── _tool_registry.py            #   203 tool definitions
│   ├── mcp_server.py                #   MCP (Claude/Codex)
│   └── openai_connector.py          #   OpenAI function-calling
│
└── tests/                           # Test suite

🛠️ Tech Stack

Component	Technology
Browser	Patchright (stealth Playwright) + Chromium
HTTP Client	curl_cffi (Chrome TLS fingerprint)
Database	PostgreSQL (SQLAlchemy async)
Cache	Redis (with in-memory fallback)
Auth	JWT (HS256) + API keys
Validation	Pydantic v2
Logging	structlog
Runtime	Python 3.10+ / asyncio

🤝 Contributing

git clone https://github.com/factspark23-hash/Agent-OS.git
cd Agent-OS
python3 -m venv venv && source venv/bin/activate
pip install -r requirements.txt
python3 -m patchright install chromium

# Run tests
python3 -m pytest tests/ -v

# Start dev server
python3 main.py --headed --debug --agent-token "dev-token"

❓ Troubleshooting

Problem	Solution
Port in use	`python3 main.py --port 9000`
Chromium not found	`python3 -m patchright install chromium`
JWT warning	`export JWT_SECRET_KEY=$(python3 -c 'import secrets; print(secrets.token_urlsafe(48))')`
Site detects bot	Try `--device iphone_14` or add `--proxy`
High RAM	`python3 main.py --max-ram 500`

📄 License

MIT License — free for commercial and personal use.

Third-Party Code

Scrapling by Karim Shoair — Adaptive scraping algorithm and proxy rotation engine. Used under BSD 3-Clause License.

Name		Name	Last commit message	Last commit date
Latest commit History 171 Commits
.github/workflows		.github/workflows
alembic		alembic
browser-engine		browser-engine
connectors		connectors
docs		docs
proof		proof
src		src
tests		tests
tools		tools
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
BRUTAL_TEST_REPORT.md		BRUTAL_TEST_REPORT.md
BRUTAL_TEST_REPORT_V2.md		BRUTAL_TEST_REPORT_V2.md
BUG_TRACKER.md		BUG_TRACKER.md
Dockerfile		Dockerfile
LICENSE		LICENSE
MCP_WRAPPER_README.md		MCP_WRAPPER_README.md
PRD.md		PRD.md
README.md		README.md
THIRD_PARTY_LICENSES.md		THIRD_PARTY_LICENSES.md
USAGE_GUIDE.md		USAGE_GUIDE.md
USER_FLOW.md		USER_FLOW.md
alembic.ini		alembic.ini
bot_test_screenshot.png		bot_test_screenshot.png
brutal_audit_test.py		brutal_audit_test.py
brutal_e2e_test.py		brutal_e2e_test.py
brutal_feature_stress_test.py		brutal_feature_stress_test.py
brutal_feature_stress_test_results.json		brutal_feature_stress_test_results.json
brutal_full_test.py		brutal_full_test.py
brutal_full_test_results.json		brutal_full_test_results.json
brutal_grind.py		brutal_grind.py
brutal_honest_test.py		brutal_honest_test.py
brutal_honest_test_results.json		brutal_honest_test_results.json
brutal_max_test.py		brutal_max_test.py
brutal_stress_test.py		brutal_stress_test.py
brutal_stress_test_v2.py		brutal_stress_test_v2.py
brutal_stress_test_v2_results.json		brutal_stress_test_v2_results.json
brutal_test.py		brutal_test.py
comprehensive_test.py		comprehensive_test.py
creepjs_screenshot.png		creepjs_screenshot.png
demo_login_handoff.py		demo_login_handoff.py
docker-compose.yml		docker-compose.yml
human_demo.py		human_demo.py
install.sh		install.sh
live_brutal_test.py		live_brutal_test.py
live_brutal_test_results.json		live_brutal_test_results.json
live_form_test.py		live_form_test.py
live_test_screenshot.png		live_test_screenshot.png
main.py		main.py
max_grind_test.py		max_grind_test.py
nginx.conf		nginx.conf
production_test.py		production_test.py
quickstart.sh		quickstart.sh
qwen_bridge.py		qwen_bridge.py
real_world_brutal_test.py		real_world_brutal_test.py
real_world_brutal_test_results.json		real_world_brutal_test_results.json
requirements.lock		requirements.lock
requirements.txt		requirements.txt
run_batch.py		run_batch.py
run_mcp.sh		run_mcp.sh
setup.sh		setup.sh
stress_test.py		stress_test.py
stress_test_100.py		stress_test_100.py
stress_test_full.py		stress_test_full.py
stress_test_report.json		stress_test_report.json
stress_test_results.json		stress_test_results.json
stress_test_results_20260411_222429.json		stress_test_results_20260411_222429.json
stress_test_v2.py		stress_test_v2.py
test_handoff_e2e.py		test_handoff_e2e.py
test_instagram_handoff.py		test_instagram_handoff.py
test_login_handoff.py		test_login_handoff.py
test_login_handoff_live.py		test_login_handoff_live.py
test_output.log		test_output.log
test_results.json		test_results.json
ultimate_grind_test.py		ultimate_grind_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Why Agent-OS?

⚡ Quick Start

Option 1: One-Command Install

Option 2: Manual Install

Option 3: Docker

First Commands

🔌 Connectors

MCP Passthrough (Zero API Key) ⭐

🛡️ Stealth Engine

🧠 Adaptive Scraper

🔄 Proxy Rotation

🌐 Browser Automation

🔐 Authentication

🏗️ Architecture

🚀 Deployment

Production Checklist

Docker Compose (Full Stack)

Scaling

📁 Project Structure

🛠️ Tech Stack

🤝 Contributing

❓ Troubleshooting

📄 License

Third-Party Code

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Why Agent-OS?

⚡ Quick Start

Option 1: One-Command Install

Option 2: Manual Install

Option 3: Docker

First Commands

🔌 Connectors

MCP Passthrough (Zero API Key) ⭐

🛡️ Stealth Engine

🧠 Adaptive Scraper

🔄 Proxy Rotation

🌐 Browser Automation

🔐 Authentication

🏗️ Architecture

🚀 Deployment

Production Checklist

Docker Compose (Full Stack)

Scaling

📁 Project Structure

🛠️ Tech Stack

🤝 Contributing

❓ Troubleshooting

📄 License

Third-Party Code

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages