Open source · MIT Licensed · 0 cloud accounts needed

The web was built for humans.

Ghost was built for agents.

Turn any website into structured data your AI agent can use — automatically, in milliseconds.

via auto-generated MCP tools

0ms

cached tools

success rate

faster than screenshot agents

Built for

Claude

Anthropic

ChatGPT

OpenAI

Gemini

Google

DeepSeek

DeepSeek AI

Cohere

NVIDIA

NVIDIA AI

Grok

xAI

Claude

Anthropic

ChatGPT

OpenAI

Gemini

Google

DeepSeek

DeepSeek AI

Cohere

NVIDIA

NVIDIA AI

Grok

xAI

Claude

Anthropic

ChatGPT

OpenAI

Gemini

Google

DeepSeek

DeepSeek AI

Cohere

NVIDIA

NVIDIA AI

Grok

xAI

Claude

Anthropic

ChatGPT

OpenAI

Gemini

Google

DeepSeek

DeepSeek AI

Cohere

NVIDIA

NVIDIA AI

Grok

xAI

The difference

Configuration vs. auto-generation

Traditional scraping APIs require manual setup for every site. Ghost generates typed tools automatically from the live DOM.Example: scraping product data from Nike.com

Parse.bot (YC F'25) — Manual config, 15 min, failed

// Parse.bot API Configuration
// Sign up → Read docs → Configure selectors

{
  "url": "nike.com/products",
  "selectors": {
    "title": ".product-title",
    "price": ".product-price",
    "image": ".product-image img",
    "link":  ".product-card a"
  },
  "pagination": { "type": "scroll" },
  "auth": { "apiKey": "pk_live_..." }
}

// Time: 15 minutes → Failed ✗
// Cost: $50–500/mo + overages

Ghost — Auto-generated, 20ms, success

// Ghost Auto-Generated Tool
// Install extension → Browse naturally

const product = await ghost.nike_get_product(
  "air-jordan-1"
);

// Returns:
{
  name: "Air Jordan 1 Mid",
  price: "$125.00",
  inStock: true,
  colorways: 5,
  url: "/product/air-jordan-1-mid"
}

// Time: 20ms → Success ✓
// Cached — next call: 0.03ms, 0 tokens
// Cost: $0 (runs locally)

Traditional API

Sign upCreate account, verify email

Read docsLearn proprietary API format

Configure selectorsWrite CSS selectors manually

API callSend request to cloud service

Wait 15 minProcessing on their servers

✗

FailedNike blocks cloud scrapers

Ghost

Install extensionOne click, no account needed

Browse naturallyVisit nike.com like a human

Auto-generatedTools created in 20ms

MCP tool callStructured data returned

20msRuns locally on your machine

✓

Success94.6% across 1,000+ tools

Switch

Compare Ghost to your current tool

I currently use

Architecture

It just works. Here's why.

Three-tier cascade. Each tier is a fallback for the one above. Ghost auto-selects the fastest path for every request.

Instant pattern matching27ms

Heuristic Generation

Recognizes common page layouts on sight. No AI call needed. Covers 80%+ of sites instantly.

Smart fallbacks5 strategies

Multi-Strategy Selectors

If the first approach breaks, it automatically tries five other strategies. Self-healing, no intervention needed.

AI polishbackground

LLM Refinement

For truly unusual pages, an LLM refines selectors in the background. Your tools work immediately; accuracy improves async.

One command. Zero configuration.

Install Ghost and start generating tools in under 10 seconds.

Terminal — ghost

Interactive

See Ghost in action

Pick a scenario and watch Ghost navigate, analyze, and generate tools in real time.

ghost \u2014 interactive demo

idle

ghost v0.1.0|Extract top stories from Hacker News

~/.ghost/registry/|MCP connected

Intelligence

Powered by Claude Opus 4.6

Ghost uses Claude Opus 4.6 with extended thinking for its most critical operations. Not just API calls — deep reasoning that makes tools smarter over time.

Self-Healingextended thinking

Opus 4.6 Selector Repair

When a website changes its DOM and CSS selectors break, Opus 4.6 reasons about the breakage using extended thinking (chain-of-thought), then generates repaired selectors that adapt to the new layout. Zero human intervention.

2,048 thinking tokensauto-retry

Researchextended thinking

Deep Research Planning

Ghost's ultra-research mode uses Opus 4.6 extended thinking to plan multi-site research strategies. The model reasons about which sites to visit, what data to extract, and how to synthesize findings — all with visible chain-of-thought.

4,096 thinking tokensmulti-site synthesis

Why extended thinking matters: Standard LLM calls generate tools that work 80% of the time. With extended thinking, Opus 4.6 reasons through edge cases — shadow DOM, dynamic class names, framework-specific patterns — producing selectors that survive site redesigns. Ghost's 94.6% success rate is built on this deeper reasoning.

Benchmarks

1,000+ tools. 50+ sites. No cherry-picking.

Every number is from real-world testing against production websites.

Tools Generated

Validation Rate

0ms

Avg Generation

0ms

Avg Execution

Ghost vs Parse.bot (YC F'25)

Same task. Same site. Wildly different results.

	Parse.bot (YC F'25)	Ghost
Nike.com products	15+ min (failed)	20ms (0.03ms cached) ✅
Setup	Sign up, API keys, docs	Install extension
Cost	$50–500/mo + overages	$0 (local)
Maintenance	Manual selector updates	Self-healing
Privacy	Cloud (data on their servers)	100% local
API keys	Required	0 required

Ghost gets faster the more you use it

Every other tool runs the same work on repeat. Ghost generates once, caches forever.

	Day 1	Day 30
Avg generation	20ms	0.03ms
Tokens used	Minimal	Zero
Tools cached	0	1,000+
Maintenance needed	None	Still none

Parse.bot on day 30? Same as day 1. Same latency. Same cost. Same token burn.

Use Cases

Built for developers building agents

Not “save time.” Not “easy to use.” Specific technical capabilities for specific technical problems.

Automated UI testing without Selenium boilerplate

Ghost-generated tools replace fragile Selenium selectors. Self-healing means tests don't break on redesigns.

Multi-step agent workflows

Search → filter → select → purchase. Ghost generates a tool for each step. Chain them in your agent.

Real-time competitive price monitoring

Generate tools for competitor product pages. Run them on a schedule. Structured pricing data, zero maintenance.

Form automation for enterprise SaaS

Ghost maps form fields automatically. Fill, submit, and verify across any web application.

Web scraping without maintenance overhead

No selector updates when sites change. Quality scoring detects degradation. Auto-regeneration on failures.

RAG pipelines from any web source

Extract structured content from documentation sites, wikis, forums. Clean data for your retrieval pipeline.

What your agent code looks like

// Auto-generated Nike.com tool — no config needed
const product = await ghost.nike_get_product("air-jordan-1");
// { name: "Air Jordan 1", price: "$170", inStock: true }

// Competitive monitoring — runs on schedule
const prices = await ghost.amazon_get_price("B09V3KXJPB");
const competitor = await ghost.bestbuy_get_price("6505727");

// Form automation — maps fields automatically
await ghost.salesforce_fill_lead({
  name: "Acme Corp",
  email: "deals@acme.com",
  stage: "Qualified"
});

Technical

Zero fluff. Here's what's under the hood.

0 config

MCP-native

Works with Claude Code, Cline, Continue, Cursor. Any MCP client gets structured web tools automatically.

5 fallbacks

5-strategy selectors

data-testid → aria-label → id → semantic HTML → text content. If one breaks, the next takes over.

0 data sent

Local-first

Tools, sessions, data — everything in ~/.ghost/ on your device. No cloud. No telemetry. No third-party processing.

undetected

Browser pool

Stealth mode, auth persistence, multi-tab. Handles anti-bot measures and authenticated sessions.

27ms primary

Heuristic + LLM hybrid

Instant heuristic generation in 27ms. LLM refines in background. Tools work immediately, improve async.

0 maintenance

Self-healing

Quality scoring per selector. Detects degradation automatically. Regenerates broken tools without intervention.

Join the beta. Build the future.

100 beta spots. Chrome extension launching March 2026.

+37

joined the waitlist

Get notified when the Chrome extension launches: