Engagemii AEO Data Marketplace
Engagemii grades every brand on the web from 0 to 10 for how well AI engines β ChatGPT, Claude, Perplexity, Gemini β can find, parse, and cite them. The score breaks down into six sub-scores covering structured data, content structure, entity clarity, E-E-A-T signals, technical AEO, and AI discoverability. We pair every record with the brand's industry, audience type, tech stack, geography, social URLs, and how many times AI bots have actually crawled them.
Agencies use it to find clients who need fixing. SaaS companies embed the scores into their products. Researchers track which industries are falling behind. Then pull on-demand Fix-It Kits for any brand and walk into sales meetings with the exact patches to apply.
Every column in the CSV
28 columns per brand.
The AEO score
aeoScore
Overall AEO score from 0 to 10 β how well AI engines like ChatGPT, Claude, and Perplexity can find, parse, and cite this brand.
structuredData
Sub-score 0-10 for JSON-LD / schema.org markup quality.
contentStructure
Sub-score 0-10 for headings, sections, and answerable copy.
entityClarity
Sub-score 0-10 for how clearly the brand identifies itself as an entity.
eeaT
Sub-score 0-10 for Experience, Expertise, Authority, and Trust signals.
technicalAeo
Sub-score 0-10 for robots.txt, llms.txt, sitemaps, and crawler accessibility.
aiDiscoverability
Sub-score 0-10 for AI-engine-specific signals (allowlist, ai.txt, etc.).
AI engine crawl activity
aiBotHitsTotal
How many times AI bots (GPTBot, ClaudeBot, PerplexityBot, Applebot, AmazonBot, Meta AI) have crawled this brand. Real demand signal.
aiBotsSeen
Which AI bots have actually visited (semicolon-separated).
Identity & classification
companyName
Brand name as it appears on their site.
domain
Root domain (e.g. shopify.com).
websiteUrl
Full URL we crawled.
businessCategory
One of 20 canonical industries (Legal, E-commerce, Healthcare, Home & Living, etc.).
audienceType
B2B, B2C, or Mixed β classified from on-page signals.
audienceConfidence
Classifier confidence 0-1.
detectedPlatform
What runs the site: shopify, wordpress, wix, webflow, woocommerce, squarespace, ghost, hubspot, etc.
Geography
city
City (when known).
state
US state, lowercased.
country
Country.
Social URLs (direct links)
LinkedIn URL found on the site.
Instagram URL.
Facebook URL.
X (Twitter) URL.
tiktok
TikTok URL.
youtube
YouTube channel URL.
Timestamps
addedToDb
When we first added this brand.
lastUpdated
Last time any field changed.
scoredAt
Last time we re-scored this brand.
Sign up free, browse the data and build your filter inside the portal, then pick a tier and pay only when you're ready to download.
No subscriptions. No auto-renew.
Starter
$1,000
one-time
100,000 records
Test the data on a single vertical or geography.
β’ 100,000 records, your filter applied
β’ Full CSV export, 90-day download link
β’ All 28 columns per row
β’ Fix-It Kits not included at this tier
Agency
$5,000
one-time
1,000,000 records
For agencies pitching clients and lead-gen teams.
β’ 1,000,000 records, your filter applied
β’ Fix-It Kit access unlocked
β’ 50 Fix-It Kits included (worth $4,950)
β’ $5 per additional kit
Enterprise
$10,000
one-time
Full dataset
Everything we have. Product integrations + broad plays.
β’ Every scored brand in the database
β’ Fix-It Kit access unlocked
β’ 200 Fix-It Kits included
β’ $3 per additional kit
β’ API access for programmatic use
How we built this
We crawl and score every brand on the web continuously. The scorer pulls each site's homepage, parses on-page content, JSON-LD structured data, and meta tags, then runs the AEO scoring engine that powers our paid Fix-It Kit product at engagemii.com/aeo. Each brand gets one overall 0-10 score plus six sub-scores. Scores are refreshed when sites change.
We track every AI bot hit (GPTBot, ClaudeBot, PerplexityBot, Applebot, AmazonBot, Meta AI) and tie those hits to the brand pages they visited. The aiBotHitsTotal and aiBotsSeen columns let you see which brands are already being ingested by AI engines β real demand signal you can't get elsewhere.
Brands are classified into 20 canonical industries and a B2B/B2C/Mixed audience type using a sentence-transformer model. The detectedPlatform column tells you the stack (Shopify, WordPress, Webflow, etc.) so you can filter to clients on the platforms you know.