Perplexity is the most transparent of the major AI answer engines about its source selection process because every answer includes visible footnote citations. This makes it possible to study which types of content get cited and why — providing direct insight into the signals Perplexity's retrieval and synthesis system uses to select sources.
The Two-Stage Selection Process
Perplexity uses a two-stage process for each query. First, the retrieval stage: a search query is run against the web index, returning a set of candidate pages ranked by relevance. Second, the synthesis stage: the language model reads the candidate pages and selects specific passages to include in the answer, attributing each to its source. Content that reaches the answer gets cited. Content that was retrieved but not used in the synthesis does not appear as a citation.
What Gets Through Both Stages
The retrieval stage favors: indexed pages (IndexNow submission accelerates this), pages on domains with topical relevance to the query, and pages with title and meta description signals that match the query intent. The synthesis stage favors: content that directly and specifically answers the query point being synthesized, content structured for extractability, and content that provides information not already covered by the other retrieved sources.
The Specificity Advantage
One consistently observable pattern in Perplexity citations is that specific, detailed answers are cited more often than general overviews. An article that answers "how much does LTL freight reclassification typically cost" with specific percentage ranges and dollar figures is more likely to be cited than an article that says "reclassification can be expensive." The Omni AEO content system builds specificity and extractability into every article by design.