PerplexityBot is the web crawler operated by Perplexity AI — the bot that indexes web content to power Perplexity’s retrieval-augmented generation system. Its user agent string includes PerplexityBot.
What PerplexityBot indexes for
Perplexity is one of the most citation-transparent AI engines — it shows numbered source links alongside every response. Content that PerplexityBot crawls and indexes becomes eligible to be cited in these responses.
Perplexity’s retrieval system is RAG-first: it always fetches live web content before generating an answer, making PerplexityBot one of the most important AI crawlers for real-time citation presence.
Managing PerplexityBot access
Control access through robots.txt:
Allow (recommended for public content):
User-agent: PerplexityBot
Allow: /
Block completely:
User-agent: PerplexityBot
Disallow: /
Blocking PerplexityBot means your content will not appear in Perplexity citations — for any query, regardless of how well it matches. For most brands, blocking is counterproductive.
Why Perplexity citations are particularly valuable
- Inline citation links: Perplexity displays source URLs prominently, making citations more likely to drive traffic than citations on other AI engines
- Research-intent audience: Perplexity users tend to be in active research mode — high intent relative to casual AI chatbot users
- Technical and professional demographic: Strong among developers, analysts, and knowledge workers who are often B2B purchase influencers
- Citation as SEO signal: Being cited by Perplexity generates indexed pages that reference your URL, contributing to your backlink and mention profile
How Perplexity selects citations
Perplexity’s retrieval combines search engine-like authority scoring with semantic relevance matching. Pages with strong domain authority, clear headings, factual precision, and up-to-date content are preferred. Schema markup and structured content improve retrieval selection rates.