Engagemii AEO Data Marketplace

License the AEO scoring data behind 2 million+ brands.

Engagemii grades every brand on the web from 0 to 10 for how well AI engines β€” ChatGPT, Claude, Perplexity, Gemini β€” can find, parse, and cite them. The score breaks down into six sub-scores covering structured data, content structure, entity clarity, E-E-A-T signals, technical AEO, and AI discoverability. We pair every record with the brand's industry, audience type, tech stack, geography, social URLs, and how many times AI bots have actually crawled them.

Agencies use it to find clients who need fixing. SaaS companies embed the scores into their products. Researchers track which industries are falling behind. Then pull on-demand Fix-It Kits for any brand and walk into sales meetings with the exact patches to apply.

Every column in the CSV

28 columns per brand.

The AEO score

aeoScore

Overall AEO score from 0 to 10 β€” how well AI engines like ChatGPT, Claude, and Perplexity can find, parse, and cite this brand.

structuredData

Sub-score 0-10 for JSON-LD / schema.org markup quality.

contentStructure

Sub-score 0-10 for headings, sections, and answerable copy.

entityClarity

Sub-score 0-10 for how clearly the brand identifies itself as an entity.

eeaT

Sub-score 0-10 for Experience, Expertise, Authority, and Trust signals.

technicalAeo

Sub-score 0-10 for robots.txt, llms.txt, sitemaps, and crawler accessibility.

aiDiscoverability

Sub-score 0-10 for AI-engine-specific signals (allowlist, ai.txt, etc.).

AI engine crawl activity

aiBotHitsTotal

How many times AI bots (GPTBot, ClaudeBot, PerplexityBot, Applebot, AmazonBot, Meta AI) have crawled this brand. Real demand signal.

aiBotsSeen

Which AI bots have actually visited (semicolon-separated).

Identity & classification

companyName

Brand name as it appears on their site.

domain

Root domain (e.g. shopify.com).

websiteUrl

Full URL we crawled.

businessCategory

One of 20 canonical industries (Legal, E-commerce, Healthcare, Home & Living, etc.).

audienceType

B2B, B2C, or Mixed β€” classified from on-page signals.

audienceConfidence

Classifier confidence 0-1.

detectedPlatform

What runs the site: shopify, wordpress, wix, webflow, woocommerce, squarespace, ghost, hubspot, etc.

Geography

city

City (when known).

state

US state, lowercased.

country

Country.

Social URLs (direct links)

linkedin

LinkedIn URL found on the site.

instagram

Instagram URL.

facebook

Facebook URL.

twitter

X (Twitter) URL.

tiktok

TikTok URL.

youtube

YouTube channel URL.

Timestamps

addedToDb

When we first added this brand.

lastUpdated

Last time any field changed.

scoredAt

Last time we re-scored this brand.

One-time pricing.

Sign up free, browse the data and build your filter inside the portal, then pick a tier and pay only when you're ready to download.

No subscriptions. No auto-renew.

Starter

$1,000

one-time

100,000 records

Test the data on a single vertical or geography.

β€’ 100,000 records, your filter applied

β€’ Full CSV export, 90-day download link

β€’ All 28 columns per row

β€’ Fix-It Kits not included at this tier

Get started
Most Popular

Agency

$5,000

one-time

1,000,000 records

For agencies pitching clients and lead-gen teams.

β€’ 1,000,000 records, your filter applied

β€’ Fix-It Kit access unlocked

β€’ 50 Fix-It Kits included (worth $4,950)

β€’ $5 per additional kit

Get started

Enterprise

$10,000

one-time

Full dataset

Everything we have. Product integrations + broad plays.

β€’ Every scored brand in the database

β€’ Fix-It Kit access unlocked

β€’ 200 Fix-It Kits included

β€’ $3 per additional kit

β€’ API access for programmatic use

Get started

How we built this

We crawl and score every brand on the web continuously. The scorer pulls each site's homepage, parses on-page content, JSON-LD structured data, and meta tags, then runs the AEO scoring engine that powers our paid Fix-It Kit product at engagemii.com/aeo. Each brand gets one overall 0-10 score plus six sub-scores. Scores are refreshed when sites change.

We track every AI bot hit (GPTBot, ClaudeBot, PerplexityBot, Applebot, AmazonBot, Meta AI) and tie those hits to the brand pages they visited. The aiBotHitsTotal and aiBotsSeen columns let you see which brands are already being ingested by AI engines β€” real demand signal you can't get elsewhere.

Brands are classified into 20 canonical industries and a B2B/B2C/Mixed audience type using a sentence-transformer model. The detectedPlatform column tells you the stack (Shopify, WordPress, Webflow, etc.) so you can filter to clients on the platforms you know.

Already a customer? Log in β†’