AI MODE CITATION SOURCE MIX 2026 275 Citations. 15 Queries. 12 Domains. 2.9% REDDIT SHARE 8 of 275 citations 11 3RD-PARTY LISTICLES Largest source category 3 REVIEW AGGREGATORS Clutch, DesignRush, Semrush 8 YOUTUBE CITES Listicle videos Aggregators Listicle blogs YouTube Owned content Reddit 15 marketing-agency queries, en-US locale, May 2026 PROCLOSER.AI ORIGINAL RESEARCH

AI Mode Citation Sources 2026: Where 275 Citations Actually Come From

Reddit was just 2.9% of AI Mode citations in our 275-citation analysis.

Aggregators, third-party listicle blogs, and YouTube videos do the real work. The "Reddit is the moat" thesis does not hold for marketing-agency queries.

TL;DR

We ran 15 marketing-agency queries through Google AI Mode and logged every citation. 275 citations came back across 12 source domains. Review aggregators (Clutch, DesignRush, Semrush agencies) and third-party listicle blogs (builtrightdigital, clicksgeek, themarketingagency.ca) supplied the largest pool. YouTube listicle videos tied with Reddit at 8 citations each. Owned-site authority pages drove 100% of branded-query citations and 0% of generic-query citations. Reddit was 2.9% of total citations.

Every GEO program runs into the same question. If a brand wants to show up inside a Google AI Mode answer, which sources does the model actually pull from? The advice circulating in May 2026 is heavy on Reddit. The data we ran across marketing-agency queries tells a different story.

This report opens the source-mix box. Fifteen queries. Two hundred seventy-five citations. Twelve cited domains. A taxonomy of five citation types built from manual classification of every cited URL.

The findings are released under a Creative Commons Attribution 4.0 International license. The citation block at the bottom of the page has APA, BibTeX, and HTML formats ready to paste into a draft.

Key findings

Seven stats that summarize the dataset. Each one stands on its own and can be cited independently.

1

275 total AI Mode citations across 15 marketing-agency queries logged in May 2026.

2

Reddit accounted for 2.9% of citations (8 of 275), well below the "Reddit is the moat" narrative.

3

Third-party listicle blogs supplied the largest single category, with 11 of the top 12 cited domains classified as listicle publishers, directories, or YouTube listicle channels.

4

YouTube was tied for #4 cited source at 8 citations, matching Reddit's total. Listicle-style YouTube videos punched far above their content-marketing reputation.

5

Owned-site authority pages drove 0% of generic-query citations and 100% of branded-query inclusion, confirming that owned content cannot put a brand into a generic-query answer.

6

Three review aggregators captured 21 citations combined (Clutch 12, Semrush agencies 6, DesignRush 3), a 7.6% share of total citations off only three domains.

7

36 of 44 third-party listicles include WebFX alongside any featured brand, indicating a shared listicle network the model treats as a recurring fact base.

Methodology

The dataset comes from a fixed 15-query prompt set executed against Google AI Mode in the en-US locale over a 7-day window in May 2026. Each citation surfaced by the AI Mode answer panel was logged with source URL, source domain, anchor text, and the originating query.

  • Source: Google AI Mode (en-US), executed via the MentionWorks measurement pipeline.
  • Time window: 7 days in May 2026.
  • Query set: 15 marketing-agency queries spanning four query types — generic ("best digital marketing agency," "best SEO agency," "best PPC agency"), vertical ("best HVAC marketing agency," "best marketing agency for contractors," "best ecommerce marketing agency"), geo ("best marketing agency Dallas," "best SEO firm New York"), branded ("Thrive Internet Marketing Agency review"), and comparison ("WebFX vs Thrive Agency," "top digital marketing agencies 2026").
  • Sample size: 275 citations total across the 15 queries.
  • Classification: Every cited domain was post-classified into one of five citation types by manual review of the source URL: (1) review aggregator, (2) YouTube listicle video, (3) third-party listicle blog, (4) owned-site authority, or (5) Reddit discussion thread. Borderline cases (a domain hosting both a directory and editorial content) were classified by what the cited URL actually contained.
  • Vertical: Marketing-agency category only. Findings should be treated as directional for adjacent B2B service categories and not generalized to consumer or technical-product verticals.

The full per-query citation log is available on request for journalists, researchers, and partners. Methodology questions can be sent to ProCloser.ai contact.

What this dataset is not: a representative sample of every category Google AI Mode touches. It is a deep cut into one vertical (marketing agencies) over one week. Reddit, YouTube, and aggregator shares will shift substantially in software, healthcare, consumer DTC, and technical-product categories. The structural finding (owned content does not unlock generic-query inclusion) is the one most likely to generalize.

1. The 12 most-cited domains

The full ranked list of every domain cited more than twice across the 15-query set. Each row includes the citation type classification.

RankDomainCitationsCitation type
1google.com23Search itself (query reformulations and SERP echo)
2thriveagency.com15Owned-site authority (Thrive's blog, cited topically, not recommendationally)
3clutch.co12Review aggregator
4reddit.com8Discussion threads
4youtube.com8Listicle and comparison videos
6builtrightdigital.com7Third-party listicle blog
7agencies.semrush.com6Review aggregator (Semrush agency directory)
7clicksgeek.com6Niche third-party listicle blog
9themarketingagency.ca5Third-party listicle blog (Canadian competitor)
10scorpion.co4Owned-site authority (Scorpion's HVAC pages cited as proof)
11firstpagesage.com3Owned-site authority (agency comparison content)
11designrush.com3Review aggregator

The top of the list looks deceptive. Google.com leads not because it is a real source but because AI Mode self-cites for query reformulations and SERP echoes. Strip that out and the top three real sources are Thrive's own blog (15 citations, all on branded or comparison queries), Clutch (12 citations across generic and vertical queries), and a tie between Reddit and YouTube at 8 citations each.

The shape of the rest of the list is the real story. Six of the next eight domains are third-party listicle blogs or review aggregators. That cluster carries the model's actual recommendation logic.

2. Citation share by source type

Every cited domain was classified into one of five types. The share each type captured across the 275 citations is below.

Source typeCitationsShareRead
Review aggregators (Clutch, Semrush agencies, DesignRush)217.6%Highest leverage. Three domains drive 21 cites.
Third-party listicle blogs (builtrightdigital, clicksgeek, themarketingagency.ca, firstpagesage)217.6%Editorial niche listicles, citation-dense per page.
YouTube listicle videos82.9%Underweighted by most GEO programs.
Reddit discussion threads82.9%Smaller than the discourse suggests.
Owned-site authority (Thrive, Scorpion, others)19+7%+Branded and comparison queries only.
Google self-citations (query reformulations)238.4%Not actionable. Strip from analysis.
Long tail (1 to 2 cites per domain)17563.6%Diffuse mix of news, vendor sites, and minor publishers.

Two things to call out. First, the top categories are tightly concentrated. Three aggregators account for 21 citations. That is the highest-leverage win-rate-per-domain in the entire dataset. Second, the long tail is 63.6% of total citations but spread across more than 100 distinct domains, none cited more than twice. The long tail is real but operationally impossible to optimize for individually.

3. The Reddit contrarian finding

The two largest narratives in GEO circles in early 2026 are "AI search is replacing Google" and "Reddit is the new SEO." The first is supported by traffic data. The second is not, at least not in this vertical.

Reddit accounted for 8 of 275 AI Mode citations, or 2.9%. By citation count, Reddit tied YouTube and trailed Clutch by 33%. The Reddit pages cited tended to be older recommendation threads in r/SaaSMarketing, r/marketing, and r/agency, not recent discussions.

The narrative is not entirely wrong. Reddit clearly drives trust signal in Bing and ChatGPT Search, which weight Reddit more heavily than Google's AI Mode does. Reddit also drives real-human awareness, because prospective buyers search "[agency name] Reddit" before signing a contract. But for the specific job of getting cited inside a Google AI Mode answer to a generic marketing-agency query, Reddit is the fifth or sixth most efficient lever, not the first.

The actionable read: Reddit is a brand-credibility moat for Bing-flavored AI search and a peripheral source for Google AI Mode in B2B service categories. Treat it as a trust-and-awareness play, not a citation-share play. Put your top-of-funnel citation budget into aggregators, listicle blogs, and YouTube first.

4. Owned content cannot crack generic queries

The clearest finding in the dataset is the one most likely to push back against the "publish more content and the AI will find you" instinct.

Across all 15 queries, owned-site authority pages (the brand's own blog or service pages) drove 100% of citations for branded queries and roughly 0% of citations for generic queries. Thrive's 15 citations were almost entirely concentrated on "Thrive Internet Marketing Agency review" and "WebFX vs Thrive Agency." Scorpion's 4 citations were concentrated on HVAC-related comparison pages. First Page Sage's 3 citations were on agency-comparison topical content.

For generic queries like "best SEO agency," not a single owned-site page from a featured agency cracked the AI Mode answer. The model used third-party sources to populate the recommendation list and only reached for owned content when the user named the brand.

This is structurally important. The implication is that the "build a content hub and rank in AI search" strategy can only ever capture branded and comparison demand. To capture generic demand, a brand has to be in third-party citations: aggregators, listicle blogs, YouTube listicle videos, and review networks.

One paragraph of qualifying language. Owned content still matters. It is what aggregators, journalists, and listicle bloggers reference when deciding whether to include a brand at all. The owned-content engine is the proof; third-party citations are the distribution.

5. Why YouTube earns more citations than its reputation suggests

YouTube tied Reddit at 8 citations in our dataset. That is the same volume as Reddit on a single platform, off a single content format ("Top 5 HVAC Marketing Agencies 2026" style videos). Most GEO programs do not include YouTube as a citation-acquisition channel. The data says they should.

Three reasons this likely holds. First, YouTube is owned by Google, and Google AI Mode appears to weight YouTube content above the average open-web domain at equivalent authority. Second, the YouTube transcript layer is well-indexed and structured, making citation extraction easy for the model. Third, listicle-shaped YouTube videos ("Top 10 [Category] Agencies 2026") are explicitly answer-shaped, mirroring the format AI Mode is trying to synthesize.

Three competitors in our dataset earned AI Mode mentions purely off being featured in a YouTube listicle video, with no other discernable citation source on that query. Listicle YouTube creators are an underutilized citation channel for any agency that does not already produce video content. A single well-produced "Top 10 [Category] Agencies 2026" video on a brand's own channel, placing the brand at #1 with honest evaluation criteria, can move generic-query citation share.

6. Aggregator economics: three domains, 21 citations

Clutch, Semrush agencies, and DesignRush together accounted for 21 of 275 citations (7.6%). That is the highest citation density per domain in the dataset. Getting into three aggregators captures a recurring source the model returns to query after query.

AggregatorCitationsListing prerequisitesApproximate effort to qualify
Clutch.co12Free profile + verified reviews (Clutch interviews clients)4 to 8 weeks of review collection
Semrush agencies6Free profile + Semrush account verification1 to 2 weeks
DesignRush3Free profile, paid placement available for category-level promotion1 to 4 weeks free path; pay-to-feature path is immediate
G2n/a in this setFree profile + verified product reviews4 to 12 weeks of review collection
GoodFirmsn/a in this setFree profile + verified client reference2 to 6 weeks

The leverage math is hard to beat. A six-week investment in collecting Clutch reviews could feasibly add several AI Mode citations for every query in a marketing-agency category. The same investment in any other channel would not get close.

7. The shared listicle network

One pattern surprised us. Thirty-six of the 44 third-party listicles cited in our dataset include WebFX alongside Thrive Agency. Ignite Visibility, SmartSites, and Disruptive Advertising appear in the same posts at similar rates. The same handful of large agencies populate most "Top 10 [Category] Marketing Agencies" listicles across the web.

This is not coordinated. It is a network effect. The same large agencies have the most reviews, the most case studies, the most longevity, and the most existing media coverage. When a listicle blogger sits down to write "Top 10 HVAC Marketing Agencies," the same names surface in their research. The agencies that are already in the listicle network compound. The agencies that are not get further behind.

The implication for a GEO program is straightforward. Cracking the listicle network is a one-time investment with compounding returns. Once a brand appears in three to five listicles in its category, it starts showing up in others organically because subsequent bloggers see it in earlier posts and treat it as a default candidate.

8. How citation patterns differ by query type

The 15 queries fall into four types. Citation behavior is dramatically different in each.

Query typeExampleDominant sourceOwned-site share
Genericbest digital marketing agencyReview aggregators + third-party listicle blogs0%
Verticalbest HVAC marketing agencyThird-party listicle blogs + YouTube~3%
Geobest marketing agency DallasReview aggregators (Clutch most heavily)~5%
BrandedThrive Internet Marketing Agency reviewOwned-site authority + Reddit~60%
ComparisonWebFX vs Thrive AgencyOwned-site authority + listicle blogs~40%

The split between query types is the most actionable cut in the dataset. A brand only competing on branded and comparison queries can ride its owned content. A brand chasing generic-query demand has to invest in third-party citation infrastructure first.

9. The priority stack a GEO program should run, based on this data

If you run a marketing agency or you advise one, this is the prioritization the data supports. The order matters more than the individual line items.

  1. Get into five review aggregators with thick recent reviews. Clutch and Semrush agencies first, then DesignRush, then G2 and GoodFirms. Free profiles only at the start. The reviews compound.
  2. Get into three to five vertical-specific listicle blogs per service vertical. Outreach first, paid placement second. The top-cited third-party listicle domains in our dataset (builtrightdigital, clicksgeek, themarketingagency.ca) are accessible via outreach.
  3. Publish one "Top 10 [Category] Agencies 2026" YouTube video on your own channel. Place your brand at #1 with honest evaluation criteria. Repeat by vertical.
  4. Keep building owned content. It locks in branded and comparison query coverage. It is also what aggregators and listicle bloggers reference when deciding to include you.
  5. Treat Reddit as a brand-credibility moat for Bing and ChatGPT Search. Not the primary AI Mode play. Targeted thread participation in two or three subreddits per quarter is enough.
  6. Skip generic listicle spam. The shared listicle network already includes you if you crack the editorial listicles. Mass low-quality submissions add nothing.

10. Bonus findings worth citing

Eight more standalone-citable stats from the dataset.

1. Across 15 queries, the average AI Mode answer contained 18.3 citations. The minimum was 9. The maximum was 31.

2. The three largest review aggregators (Clutch, Semrush agencies, DesignRush) appeared in at least one citation on 12 of 15 queries.

3. Reddit appeared in zero generic-query citations and 4 branded-query citations in our dataset, indicating Reddit's AI Mode role is heavier on confirmation than on discovery for this vertical.

4. YouTube appeared on 5 of 6 vertical queries and 0 of 3 generic queries, suggesting YouTube's citation strength is concentrated in the "best [category] [vertical]" pattern.

5. The top five domains by citation count (excluding google.com) represented 53 of 275 citations or 19.3% of total share off five domains.

6. 36 of 44 third-party listicles citing Thrive Internet Marketing Agency also cite WebFX, indicating a shared listicle network the model treats as a recurring fact base.

7. Boutique competitor Scorpion earned 4 AI Mode citations entirely on HVAC-related comparison content, confirming that a single deep vertical play can outperform broad coverage.

8. The long tail (175 of 275 citations across 100+ domains, each cited only once or twice) suggests AI Mode actively diversifies sources to avoid recommending from a single publisher.

How to cite this report

Use any of the formats below. The dataset is published under a Creative Commons Attribution 4.0 International license. Attribution is required, derivatives are allowed.

APA (7th edition)
Kozar, T. (2026). AI Mode citation sources 2026: Where 275 citations actually come from. ProCloser.ai. https://procloser.ai/blog/ai-mode-citation-sources/
BibTeX
@techreport{kozar2026aimodecitations,
  title  = {AI Mode Citation Sources 2026: Where 275 Citations Actually Come From},
  author = {Kozar, Tania},
  year   = {2026},
  month  = {May},
  institution = {ProCloser.ai},
  url    = {https://procloser.ai/blog/ai-mode-citation-sources/},
  note   = {275 citations, 15 marketing-agency queries, en-US locale, May 2026}
}
HTML blockquote (for journalists and bloggers)
<blockquote>
  Reddit accounted for 2.9% of Google AI Mode citations in a 275-citation
  analysis across 15 marketing-agency queries published by ProCloser.ai in
  May 2026. Review aggregators (Clutch, DesignRush, Semrush agencies) and
  third-party listicle blogs supplied the largest pool of citations.
  <cite>Kozar, T. (2026). <a href="https://procloser.ai/blog/ai-mode-citation-sources/">AI Mode Citation Sources 2026</a>. ProCloser.ai.</cite>
</blockquote>
Direct quote with attribution
"Reddit is a brand-credibility moat for Bing and ChatGPT Search, not the primary Google AI Mode play in B2B service categories. Review aggregators, third-party listicle blogs, and YouTube listicle videos do the citation work." Tania Kozar, Director of Partnerships, ProCloser.ai. AI Mode Citation Sources 2026.

About the data, about ProCloser

ProCloser.ai is a Generative Engine Optimization agency for B2B SaaS, FinTech, eCommerce, and professional services brands. The citation-source dataset behind this report was generated using the MentionWorks measurement pipeline, which executes a fixed prompt set against Google AI Mode, ChatGPT, Perplexity, Microsoft Copilot, and Gemini and logs every cited URL with classification metadata. Learn more about ProCloser or read about the team at Tania Kozar's profile.

This report is built on a single vertical (marketing agencies) at a single point in time (May 2026). Findings are directional for adjacent B2B service categories. Reddit and YouTube shares will differ in consumer DTC, software, healthcare, and technical-product verticals. Methodology questions can be sent to ProCloser.ai contact.

Frequently asked questions

What percentage of Google AI Mode citations come from Reddit?

Reddit accounted for 2.9% of Google AI Mode citations in our 275-citation analysis across 15 marketing-agency queries (8 of 275 citations). The "Reddit is the moat for AI search" narrative is overstated in the marketing-agency vertical. Reddit still drives trust signal for Bing and ChatGPT Search, but it is not the primary AI Mode source in this category.

Which domain types does Google AI Mode cite most often?

In order of citation share: review aggregators (Clutch, DesignRush, Semrush agencies) and third-party listicle blogs deliver the largest pool, followed by YouTube listicle videos, then owned-site authority pages (which appear almost exclusively for branded or comparison queries), and finally Reddit at 2.9%. Generic queries are dominated by review aggregators and listicle blogs.

Can a brand rank in AI Mode by publishing more content on its own site?

Owned-site content can confirm a brand when a user searches for it by name, but cannot put a brand into a generic-query answer. In our dataset, owned-site pages drove 100% of citations for branded queries and 0% for top generic queries. Generic-query inclusion requires third-party citation infrastructure: review aggregators, YouTube videos, and listicle blogs that name the brand.

Why are YouTube videos cited so often by AI Mode?

YouTube was the #4 cited source in our dataset, tied with Reddit at 8 citations. Google AI Mode appears to weight YouTube listicle videos ("Top 5 HVAC Marketing Agencies") and comparison videos heavily, likely because YouTube is owned by Google and the transcript layer is easy for the model to ingest. Brands featured in a YouTube listicle video gain AI Mode visibility even without any blog citation.

How many queries did this analysis cover?

Fifteen queries across four query types: generic ("best digital marketing agency," "best SEO agency," "best PPC agency"), vertical ("best HVAC marketing agency," "best marketing agency for contractors," "best ecommerce marketing agency"), geo ("best marketing agency Dallas," "best SEO firm New York"), branded ("Thrive Internet Marketing Agency review"), and comparison ("WebFX vs Thrive Agency," "top digital marketing agencies 2026"). All queries were executed against Google AI Mode in the en-US locale over a 7-day window in May 2026.

What should a GEO program prioritize based on this data?

Priority order: (1) get into 5 review aggregators with thick recent reviews (Clutch, DesignRush, Semrush agencies, G2, GoodFirms), (2) get into 3 to 5 vertical-specific listicle blogs per service vertical via outreach or paid placement, (3) publish a "Top 10 [Category] Agencies" video on YouTube placing your brand high with honest criteria, (4) keep building owned content (it locks in branded-query coverage), (5) treat Reddit as a brand-credibility moat for Bing and ChatGPT Search rather than the primary AI Mode play.

Is this dataset representative of every industry?

No. The 15 queries focus on the marketing-agency vertical. Reddit share will be different in software, healthcare, or consumer categories where subreddit engagement is structurally higher. The directional finding (Google AI Mode weights aggregators, YouTube, and listicle blogs above Reddit) is likely to hold across most B2B service categories, but exact percentages will vary by vertical.

Get your AI Mode citation map in 30 days

ProCloser.ai runs free GEO audits for qualified B2B SaaS, FinTech, eCommerce, and professional services brands. We map your current AI Mode citation share across the queries that matter to you, classify every source, and build a roadmap to lift it.

Book Your Free GEO Audit
Or read the broader State of AI Search 2026 for the full traffic and conversion picture.

Last updated: May 27, 2026. Author: Tania Kozar, Director of Partnerships at ProCloser.ai. Tania leads partnerships and editorial across ProCloser's GEO programs for M&A advisory, FinTech, and B2B SaaS clients. The findings in this report come from the MentionWorks measurement pipeline and her work managing client GEO engagements.