New: Real-time hallucination alerts are live. Learn more →

LLM Metrix

Glossary of Terms

117 definitions covering AEO, GEO, AI visibility, and the terminology behind AI-powered search optimization.

A B C D E F G H IJK L M N O P Q R S T U V WXYZ

A

AEO

Answer Engine Optimization — The practice of optimizing content to appear prominently in AI-generated answers and responses.

Answer Engine

An AI system that generates direct answers to user queries rather than returning a list of links. Examples: ChatGPT, Claude, Perplexity, Gemini.

AI Overviews

Google's AI-generated summaries placed above traditional search results, powered by Gemini with live web retrieval — one of the highest-value AEO placement opportunities.

AI Agent

An AI system that autonomously plans and executes multi-step research tasks — browsing pages, following links, and synthesizing findings — rather than answering a single query. Changes the rules for brand visibility as engines like ChatGPT and Perplexity adopt agentic modes.

Anchor Text

The clickable text of a hyperlink — when other sites link to your pages with category-relevant anchor text, it signals to AI retrieval systems what your brand and pages are about, influencing category association and retrieval ranking.

AI Recommendation

A suggestion made by an AI engine for a product, service, or tool in response to a user query — the AI equivalent of a trusted referral, delivered at scale. Higher purchase-intent than most advertising because the user is actively asking for a recommendation in a defined context.

Author Authority

The recognized expertise and credibility of the individual who wrote a piece of content — a key E-E-A-T signal. AI retrieval systems score documents partly on author credibility, making named bylines, author bio pages, and Person schema markup directly actionable for improving citation rates.

Answer Box

A formatted display in Google SERPs that presents a direct answer — extracted from a third-party page — at the top of results. The behavioral and structural precursor to AI-generated answers. Pages that consistently win answer boxes typically have the direct-answer structure that AI engines also prefer for citations.

AI Mode

Google's dedicated AI-powered conversational search interface — a full-page multi-turn research experience powered by Gemini, separate from standard Google Search. Announced at Google I/O 2025. Higher value for considered-purchase and B2B brands because users conduct deeper research in AI Mode than in standard AI Overviews.

B

Brand Sentiment

The overall tone (positive, neutral, negative) with which AI engines discuss your brand in their responses.

Brand Recall

The strength with which AI models associate your brand with a particular category or use case — determines how often your brand surfaces in category-level queries without being explicitly named.

Brand Safety

The practice of detecting and correcting harmful, inaccurate, or misleading AI-generated representations of your brand — before they reach customers at scale across every engine.

Brand Entity Disambiguation

The process by which AI systems resolve which specific entity a brand name refers to — especially important for brands with common words in their names. Strong disambiguation signals (consistent category anchoring, Wikidata entries, schema markup) reduce hallucination and category confusion in AI responses.

Brand Mention

Any appearance of your brand name in an AI-generated response, whether or not a URL link is included. The broader category containing citations as a subset — all citations are brand mentions, but not all brand mentions are citations.

Brand Salience

The degree to which a brand comes to mind in a relevant buying situation. In the AI era, brand salience is increasingly mediated by AI engines — brands absent from AI category recommendations never enter the buyer's mental consideration set, making AI visibility a new dimension of salience strategy.

Brand Equity

The commercial value a brand name adds beyond its product's functional utility. In AI visibility: high-equity brands have richer training data signals and receive more confident AI recommendations. Brand equity also creates a durable competitive moat — displacing a well-known brand from AI consideration sets requires generating far more training data volume.

BM25

A classical document retrieval algorithm that ranks documents by term frequency and inverse document frequency — used in traditional search engines and as a first-pass retrieval stage in many AI search pipelines. The technical grounding for why vocabulary matching and using audience-native terminology still matters in AI-indexed content.

Brand Voice

The consistent personality, tone, and terminology with which a brand communicates. Relevant to AI visibility because consistent brand voice creates coherent entity associations in AI training data — and content written with authoritative, direct voice is more strongly associated with confident AI recommendations than hedging or vague language.

Backlink Profile

The complete collection of external links pointing to a website — including quantity, quality, domain diversity, anchor text, and topical relevance. A strong backlink profile is a primary input into domain authority scores that AI retrieval systems use as trust signals when selecting content to cite.

C

Citation

A reference to an external source that an AI engine includes in its response, either as an inline link, footnote, or attribution.

Content Gap

A topic or query where competitors are cited by AI engines but your brand is not — identifying and closing content gaps is one of the highest-leverage AEO strategies.

Competitive Displacement

When a competitor's brand appears in an AI response in place of yours — the AI-era equivalent of losing search rankings, with compounding effects across all instances of a query.

Content Freshness

The recency of web content as a ranking signal for AI engines — particularly RAG-powered systems that prefer recently updated pages when retrieving sources for citation.

Context Window

The maximum amount of text an LLM can process in one interaction — including query, retrieved documents, and system instructions. Determines how much of your content is actually read and cited in a single AI response.

Chunking

The process of splitting long documents into smaller segments for RAG indexing — chunks are what retrieval systems actually retrieve, not whole pages. Heading structure and section clarity determine where chunk boundaries fall and which content gets cited.

Citation Velocity

The rate at which your brand gains new AI citations over time — how quickly you're being added to AI-generated responses. A rising velocity signals an effective AEO strategy; flat or declining velocity signals a plateau even if total citation count is high.

Content Velocity

The rate at which a brand publishes new, indexable content over time. High content velocity within a focused topic cluster builds topical authority faster and increases AI citation coverage — compounding over time as each new piece expands query surface area.

Crawl Budget

The number of pages a web crawler will fetch from your site within a given period. AI crawlers (GPTBot, PerplexityBot) have independent crawl budgets — pages not crawled are not eligible for AI citation. Wasted budget on duplicate or thin pages means important content may go unindexed.

Consideration Set

The shortlist of brands a buyer actively evaluates before making a purchase decision — typically 2–5 options. AI engines now form consideration sets directly: the brands named in an AI response to a category query become the buyer's shortlist, making AI mention presence prerequisite to entering evaluation.

Conversational Search

Querying an AI system using natural language sentences rather than keyword fragments — the dominant query mode for AI engines. Conversational queries contain context, constraints, and persona signals that change which brands get recommended. Content written to address specific scenarios and constraints performs better for conversational retrieval.

Citation Diversity

A measure of how many different source types and domains mention your brand — vs. concentrated coverage from few sources. High citation diversity signals broad market recognition to AI engines and provides resilience; concentrated coverage from few sources may not generalize across query types.

Canonical URL

The preferred, definitive URL for a page — specified with a `<link rel='canonical'>` tag. Consolidates crawl budget and link equity to a single URL when duplicate or near-duplicate versions exist. Missing or incorrect canonical tags fragment authority across URL variants, reducing AI retrieval priority.

Content Hub

A dedicated site section aggregating related content — articles, guides, glossary entries, case studies — organized around a central topic. Creates topical authority signals at scale through content breadth and internal link density. More AI-visibility-effective than a chronological blog because all content remains equally discoverable.

Category Leadership

The position of being recognized as the foremost brand in a product or service category. In AI responses: category leaders are named first in broad category queries and used as benchmarks against which other brands are compared. Challenger brands can target category leadership in specific niches before competing for broad category ownership.

Content Audit

A systematic review of all published content on a website — assessing quality, accuracy, relevance, and performance — to determine what to keep, update, consolidate, or remove. Essential for AEO because AI crawlers index your entire content footprint; outdated or thin content wastes crawl budget and creates brand safety risks.

Cosine Similarity

A metric measuring how similar two vectors are — used in AI search to compare query embeddings to document embeddings. Documents with higher cosine similarity to the query are retrieved first. Explains why semantically precise, vocabulary-consistent content outperforms keyword-dense content in AI retrieval.

D

Domain Authority

A metric of your website's overall trustworthiness and credibility based on backlink profile, age, and content quality — influences AI visibility.

Digital PR

The practice of earning online press coverage, editorial mentions, and backlinks through story-driven outreach. One of the highest-leverage AEO activities because it produces third-party citations on high-authority domains — the kind of signal AI engines weight most heavily for brand authority.

Duplicate Content

Substantial content blocks appearing at multiple URLs — within a site or across domains. Fragments crawl budget and dilutes link equity by splitting authority across URL variants. Fixed by canonical tags, 301 redirects, and page consolidation. Common sources: URL parameters, HTTP/HTTPS variants, and syndicated content without canonical tags.

Dwell Time

The amount of time a user spends on a page after arriving. A content quality signal that correlates with AI citation-friendly attributes: comprehensive coverage, direct answers, and structured depth. Content optimized for human engagement quality tends to also meet AI retrieval quality standards.

E

E-A-T

Expertise, Authoritativeness, Trustworthiness — Google's quality framework that also strongly influences how AI systems evaluate and cite content.

Entity

A uniquely identifiable concept — brand, person, product, or place — recognized by AI systems as a distinct object with known attributes and category relationships.

Embeddings

Mathematical vector representations of text that encode meaning rather than exact words — the core technology behind semantic search and RAG retrieval, enabling AI engines to find conceptually relevant content regardless of keyword overlap.

E-E-A-T

Experience, Expertise, Authoritativeness, Trustworthiness — Google's content quality framework. RAG-powered AI engines, especially Google AI Overviews, use similar signals to decide which sources to cite. Author attribution, original research, external citations, and factual accuracy all contribute.

Evergreen Content

Content designed to remain accurate and valuable over an extended period without frequent rewrites. The foundation of AI visibility strategy: evergreen pages accumulate citation signals, stay in RAG indexes, and generate stable brand-concept associations in LLM training data — compounding in value over months and years.

F

First Mention

When your brand is the first one named in an AI response — the most valuable position for visibility and recall.

Fine-tuning

The process of training a pre-trained AI model on a domain-specific dataset to adapt its behavior — relevant for enterprise AI deployments and specialized industry models.

Foundation Model

A large AI model trained on broad general-purpose data that underlies specific AI products — GPT-4, Claude, Gemini, and Llama are foundation models. Major model updates can shift brand visibility even when your own content hasn't changed.

Featured Snippet

A selected excerpt Google displays above organic search results to directly answer a query — also called 'position zero.' The structural precursor to AI-generated answers. Content that wins featured snippets is typically well-positioned for AI citation because both reward the same direct-answer format.

Few-Shot Prompting

A technique that guides AI model behavior by including 2–5 examples in the prompt — showing the model the desired pattern before asking it to perform a task. Useful in AEO research for designing persona-accurate monitoring queries and investigating how AI engines represent your brand under realistic buyer framing conditions.

G

GEO

Generative Engine Optimization — Strategies for improving brand visibility and representation in generative AI systems like ChatGPT, Claude, and Gemini.

Grounding

The process by which AI engines connect their responses to real-world sources and facts, often through retrieval-augmented generation (RAG).

Geographic Variation

The phenomenon where the same AI query returns different brand mentions, rankings, and citations depending on the user's country or region — a major blind spot for brands with international presence.

GPTBot

OpenAI's official web crawler that indexes content for ChatGPT's browse mode and RAG features. Blocking it in robots.txt removes your content from ChatGPT's retrieval system. User agent: GPTBot/1.0. Allowing access is recommended for any brand pursuing ChatGPT citation presence.

H

Hallucination

When an AI model generates factually incorrect or fabricated information with the same confidence as accurate facts — a key risk for brand accuracy in AI responses.

I

Impression Rate

The percentage of tracked queries for which your brand appears in AI-generated responses — the broadest measure of AI brand presence.

Indexability

The degree to which AI engines and search systems can discover and crawl your content — a prerequisite for appearing in RAG-powered AI responses.

Inference

The real-time process by which a trained AI model generates a response to a user's prompt. Distinct from training (which happens once in advance). Brand visibility is determined at inference time by training data recall, live retrieval, and system prompt configuration.

Internal Linking

Linking from one page on your website to another page on the same domain. Distributes link equity, guides AI crawlers to important content, and signals topical relationships between pages. Well-structured internal linking is one of the most controllable factors in improving AI retrieval coverage for strategically important content.

In-Context Learning

An LLM's ability to adapt to new tasks or incorporate new information from content provided within the current session — without retraining. Explains why well-structured, dense content improves citation quality: when retrieved by RAG, it provides richer in-context information for more accurate and specific AI responses.

K

Knowledge Graph

A structured database of entities and their relationships used by search engines and AI systems to represent factual knowledge — inclusion anchors your brand's attributes and reduces hallucinations.

Knowledge Cutoff

The date after which an AI model has no training data — events, products, and changes after this date are unknown to the model unless retrieved via RAG.

L

Listed Mention

When your brand appears in a list alongside competitors, typically with less emphasis than a first or prominent mention.

LLM

Large Language Model — The foundational AI technology behind chatbots like ChatGPT, Claude, and Gemini that powers natural language understanding and generation.

Lift

The measurable improvement in AI visibility metrics resulting from a specific content action or optimization — used both as a forecast (projected lift) before acting and as an attribution metric after shipping.

LLMO

Large Language Model Optimization — an emerging industry term for improving brand visibility in LLM outputs, used interchangeably with AEO and GEO by different practitioners and publications.

Link Equity

The authority value that flows from one page to another through hyperlinks — the higher the linking page's authority, the more equity passes. For AI visibility: link equity from relevant, authoritative sources improves retrieval ranking for RAG engines; the semantic context of linking text also shapes brand-category associations.

M

Mention Positioning

The location and prominence of your brand within an AI response — can be first mention (most valuable), prominent mention (medium), or listed mention (lower value).

Model Alignment

Training that shapes an LLM to be helpful, harmless, and honest — using techniques like RLHF. Alignment policies govern how models handle brand recommendations and can suppress or qualify mentions in sensitive categories, independent of retrieval quality.

Multimodal

AI models that process multiple data types — text, images, charts, video, and code. As AI search becomes multimodal, product screenshots, branded infographics, and video transcripts become part of your citable content surface.

Multi-Turn Conversation

An AI interaction spanning multiple exchanges where context accumulates across messages. Follow-up queries in a session are often higher-intent and more specific than opening queries — brands with content addressing refined, contextual queries appear in these higher-value moments.

Model Distillation

A technique for creating smaller, faster AI models by training them to replicate a larger model's behavior. Consumer AI products often run distilled models that may have less brand-specific knowledge than full-size models — especially for niche or newer brands whose training data signals were compressed out in the distillation process.

N

NLP

Natural Language Processing — the AI field enabling machines to understand and generate human language; the foundational technology behind semantic search, entity recognition, intent detection, and LLMs.

Named Entity Recognition (NER)

An NLP technique that automatically identifies and classifies named entities in text — brands, people, places, products — as distinct labeled objects. The mechanism by which AI engines learn which category your brand belongs to based on how it's named alongside category language across the web.

O

Open Graph

An HTML metadata protocol that defines how a page appears when shared or fetched — specifying title, description, image, and content type via `og:` meta tags. AI crawlers that fetch page metadata read OG tags as a structured summary; well-written OG descriptions improve how crawlers understand and categorize page content.

Organic Search

Unpaid search engine results earned through SEO rather than paid advertising. Organic search and AI-generated answers increasingly compete for user attention, with AI Overviews appearing above organic results for many queries. Strong organic search authority creates the infrastructure that AI visibility requires — but AI responses increasingly intercept users before they reach organic results.

P

Prominent Mention

When your brand is featured in a key position (introduction, recommendation, or highlighted section) of an AI response.

Prompt

A natural language input submitted to an AI engine by a user — understanding how prompts are phrased and which brands they surface is foundational to AEO research.

Position Drift

The gradual or sudden shift in where your brand appears within AI responses over time — moving from first mention to listed mention, or from prominent to buried — a key early warning signal for eroding AI visibility.

Passage Indexing

The ability of retrieval systems to index and rank individual passages within a page independently — meaning a single highly relevant section can be retrieved and cited even if the overall page isn't a top result.

Prompt Engineering

The practice of designing and refining text inputs to AI systems to produce more accurate or targeted outputs. In AEO, relevant for designing effective monitoring queries that approximate real buyer behavior, and for understanding how system prompts shape AI product behavior.

Pillar Page

A comprehensive, long-form content piece that covers a broad topic area in full, serving as the authority hub for a topic cluster with links to all supporting cluster pages. Retrieved frequently by RAG systems for a wide range of related queries due to its broad semantic coverage.

PerplexityBot

The web crawler operated by Perplexity AI that indexes content for its RAG-first retrieval system. Perplexity displays inline citation links prominently, making PerplexityBot citations particularly traffic-visible. Allowing PerplexityBot access in robots.txt is recommended for any brand pursuing AI citation presence.

Page Authority

A metric predicting how well a specific page will rank, based on the quantity and quality of links pointing to that individual URL — distinct from domain authority. For AI retrieval, pages with higher page authority are retrieved more frequently. Direct link-building to strategic content pages improves both PA and AI retrieval priority.

Q

Query Cluster

A group of semantically related queries that users might ask AI engines about your brand, product category, or industry.

Query Intent

The underlying goal behind a user's search or AI prompt — informational (learn), navigational (find), commercial investigation (compare), or transactional (act). Matching your content format to the intent of target queries is one of the highest-leverage AEO tactics.

Query Expansion

A retrieval technique that broadens a user's original query with related terms and synonyms before searching — enabling RAG engines to find relevant content that doesn't match the exact query wording. Explains why topical depth (covering related concepts) beats exact-phrase matching in AI search.

R

RAG

Retrieval-Augmented Generation — A technique where AI systems retrieve relevant documents before generating responses, improving accuracy and citations.

Re-ranking

A second-pass scoring step in RAG pipelines that refines the initial retrieved candidates using a more precise relevance model — determines the final citation order and which sources make it into the LLM's context window.

RLHF

Reinforcement Learning from Human Feedback — a training technique that shapes AI models to produce outputs consistent with human preferences. Influences how AI engines frame brand recommendations: RLHF-trained models often present brands with balanced pros/cons framing and may be conservative in sensitive product categories.

robots.txt

A plain-text file at your site root that tells web crawlers which pages they may or may not access. A direct gate on AI visibility: blocking GPTBot, PerplexityBot, or Googlebot in robots.txt removes your content from those engines' citation systems entirely. Should be audited regularly for unintended AI crawler blocks.

S

SERP

Search Engine Results Page — Traditional Google search results. Different from AI responses but related to overall online visibility strategy.

Share of Voice

The percentage of AI responses that mention your brand compared to competitors for a given set of queries or topics.

Semantic Search

Search methodology that interprets the meaning and intent behind a query rather than matching exact keywords — the foundation of how all modern AI engines process user queries.

Structured Data

Machine-readable Schema.org markup added to web pages that explicitly declares entity attributes and relationships — a direct technical lever for how AI systems represent your brand.

System Prompt

Hidden instructions given to an LLM before any user interaction that configure its behavior, citation preferences, and content policies — the primary reason the same model behaves differently across ChatGPT, Perplexity, Claude, and Copilot.

Semantic Triple

A structured factual statement consisting of subject, predicate, and object — the fundamental unit of knowledge graphs. Example: '[Brand] → is a type of → [category]'. AI engines learn brand attributes through these relationships, encoded in schema markup, Wikidata, and factual text.

Share of Search

The percentage of all branded searches in a category that include your brand name — a leading indicator of market share. Complements AI share of voice: share of search captures navigational intent from buyers who know you; AI share of voice captures discovery intent from buyers who don't yet.

Source Authority

The credibility score AI retrieval systems assign to a domain or page when selecting content to cite — determined by backlink profile, publishing history, author attribution, content quality, and topical consistency. The single most impactful structural factor in RAG-based citation selection.

SGE (Search Generative Experience)

Google's public testing name for what became AI Overviews — AI-generated answer summaries in Google Search. Tested in Search Labs 2023–2024 before full launch as AI Overviews at Google I/O 2024. Still widely used in the SEO/AEO industry as shorthand for AI-generated answers in Google Search.

Search Intent

The underlying goal behind a user's search query — informational, navigational, commercial investigation, or transactional. Synonymous with query intent. AI engines match content to intent before retrieving; a mismatch between content type and query intent prevents citation regardless of content quality.

T

Topic Authority

Your recognized expertise in a specific domain or subject area — established through comprehensive content, expert authorship, and consistent publishing.

Training Data

The corpus of text and information that AI models learn from. Your content in training data influences how AI systems represent your brand.

Topical Depth

The comprehensiveness of content coverage on a given subject — AI engines strongly favor sources with deep topical coverage when generating and citing responses.

Token

The basic unit of text LLMs process — roughly 0.75 words each. Context windows, retrieval budgets, and API costs are all measured in tokens; understanding tokens explains why only portions of long pages get read by AI engines.

Temperature

An LLM parameter controlling output randomness — low temperature produces consistent, predictable responses; high temperature produces varied ones. The primary reason the same query returns different brand mentions across monitoring runs.

Topic Cluster

A content architecture where a central pillar page covers a broad topic, surrounded by supporting cluster pages covering related subtopics — all interlinked. Signals topical authority to AI retrieval systems by demonstrating comprehensive coverage of a subject area.

Thought Leadership

Content that shapes how an industry thinks about a topic — establishing the author or brand as an original authority rather than just a practitioner. Produces outsized AI citation value because it generates third-party citation chains, earns high-authority backlinks, and may shape the vocabulary AI engines use to describe a domain.

Topical Map

A structured plan mapping all topics, subtopics, and content pieces needed to establish full topical authority in a domain — defining pillar pages, cluster pages, and their interlinkage. Enables comprehensive query cluster coverage that builds the topical authority signals AI retrieval systems use to score domain expertise.

Thin Content

Web pages with insufficient depth, originality, or factual density to merit strong AI citation — high word count without specific citable claims, restated information without original perspective, or superficial coverage of too-broad topics. Wastes crawl budget, dilutes topical authority, and increases hallucination risk when retrieved.

U

Unlinked Mention

An instance of your brand name appearing on a third-party page without a hyperlink. Retains significant AI visibility value because LLMs absorb training data regardless of link presence — a brand name mentioned in context builds category associations and entity recognition even without a URL.

V

Visibility Score

A 0-100 metric that quantifies how prominently your brand appears across AI engines, factoring in mention frequency, positioning, and sentiment.

Vector Database

A storage system designed to index and retrieve high-dimensional vector embeddings — the mathematical representations AI systems use to encode meaning in text. Powers the semantic retrieval step in RAG pipelines that determines which content gets cited.

Voice Search

Querying a search engine or AI assistant by speaking rather than typing. Voice queries are the most conversational form of search — longer, more natural, and often local or action-oriented. As AI assistants shift to LLM-powered voice modes, the same AEO content signals that improve text AI visibility also improve voice answer inclusion.

W

Wikidata

A free, community-maintained knowledge base storing structured entity data — used heavily in LLM training and as a source for Google's Knowledge Graph. Having a Wikidata entry anchors your brand as a recognized entity with defined properties, reducing hallucination risk.

Z

Zero-click Visibility

Brand awareness and impression value delivered when your brand appears in AI-generated answers without any user click — the dominant form of value in AI search.

Zero-Shot Learning

An AI model's ability to perform tasks or answer questions it was never explicitly trained on, by generalizing from related knowledge. Brands with strong training data presence benefit from zero-shot generalization — the model can recommend them for new query phrasings it hasn't seen before.

Back to Knowledge Base