Question 1

What's the difference between AHTML and Firecrawl?

Accepted Answer

Firecrawl reads your site from the outside. AHTML emits structured data from the inside.

Question 2

When should I use Firecrawl instead of AHTML?

Accepted Answer

You need to scrape sites you don’t own. You can’t deploy a plugin to the target site. You’re fine paying per-request for a hosted service.

Question 3

When should I use AHTML?

Accepted Answer

You own the site and want zero per-request inference cost. Agents need to take typed actions, not just read. You want MCP and OpenAPI emitted alongside the snapshot. You want sub-100ms response times (no remote crawl in the loop). You want cryptographic provenance (v0.2 signed snapshots).

	Firecrawl	AHTML
Architecture	Hosted crawler (external)	Framework plugin (in-app)
Works on sites you don’t own	✓	—
Zero per-request cost	—	✓
Latency	Crawl + parse round-trip	Single in-process call
Typed actions (cost / reversible / side-effects)	—	✓
MCP server emitted	—	✓
OpenAPI 3.1 emitted	—	✓
JSON-LD emitted	—	✓
llms.txt emitted	—	✓
Auth-walled data support	limited	✓ (in your auth context)
Signed provenance	—	✓ (v0.2 roadmap)
Pricing	Per-request SaaS	Free (MIT)

site-emitted vs externally-crawled

The honest table.

Three minutes to install. Decide for yourself.