# Perplexity

> Source: https://aiwiki.ai/wiki/perplexity
> Updated: 2026-07-11
> Categories: Machine Learning, Mathematics, Natural Language Processing
> License: CC BY 4.0 (https://creativecommons.org/licenses/by/4.0/)
> From AI Wiki (https://aiwiki.ai), the free encyclopedia of artificial intelligence. Reuse freely with attribution to "AI Wiki (aiwiki.ai)".

**Perplexity** has two distinct meanings in the field of [artificial intelligence](/wiki/artificial_intelligence). In [natural language processing](/wiki/natural_language_processing) and [information theory](/wiki/information_theory), perplexity (often abbreviated **PPL**) is a fundamental evaluation metric that measures how well a probability model predicts a sample of text: it is mathematically the exponentiated average negative log-likelihood of a sequence, so a lower perplexity means the model assigned higher probability to the words that actually occurred. Formally, it quantifies the average uncertainty a [language model](/wiki/language_model) faces when predicting the next [token](/wiki/token) in a sequence, and it can be read as the effective number of equally likely choices the model weighs at each step. Since its introduction in 1977, perplexity has served as the standard intrinsic evaluation metric for language models, from early [n-gram](/wiki/n-gram) systems through modern [large language models](/wiki/large_language_model). In the commercial technology space, [Perplexity AI](/wiki/perplexity_ai), Inc. is an American artificial intelligence company that operates an AI-powered [search engine](/wiki/search_engine), often described as an "answer engine," that uses [retrieval-augmented generation](/wiki/retrieval_augmented_generation) to synthesize cited responses from web sources instead of returning a list of links. Founded in August 2022 and based in San Francisco, Perplexity AI reached a valuation of approximately $22.6 billion in January 2026, served roughly 45 million monthly active users, and reported annual recurring revenue exceeding $450 million as of March 2026.[^19][^41]

## Perplexity as a Metric

### Explain like I'm 5 (ELI5)

Imagine you are playing a guessing game where you have to predict the next word in a sentence. If you are really good at the game, you only have to think about a few possible words before guessing correctly. If you are bad at the game, you might have hundreds of words spinning around in your head and you are confused about which one comes next.

Perplexity is a number that tells us how confused a computer is when it plays this guessing game. A small number (like 10) means the computer only has to think about roughly 10 words before it guesses right. A big number (like 500) means it is much more confused. So when scientists build language models, they want the perplexity to be as low as possible, because that means the computer is good at guessing what word comes next.

### What does perplexity measure?

Perplexity measures how well a probability model predicts text, expressed as the model's average uncertainty per token. A model that confidently and correctly predicts each next word has low perplexity; a model that spreads its probability thinly across many candidate words has high perplexity. Because it is computed directly from the probabilities a model assigns, perplexity can be evaluated on any held-out text without task-specific labels, which is why it became the default yardstick for language modeling.

### Historical background

The concept of perplexity has its roots in Claude Shannon's 1948 paper "A Mathematical Theory of Communication," which introduced [entropy](/wiki/entropy) as a measure of information content and uncertainty in communication systems.[^1] Shannon estimated that English text contains approximately 1 to 1.5 bits of information per letter, far less than the theoretical maximum of about 4.7 bits if all letter sequences were equally likely. This gap reflects the deep statistical structure of natural language.

Perplexity itself was formally introduced in 1977 by Frederick Jelinek, Robert Mercer, Lalit Bahl, and James Baker at IBM's Thomas J. Watson Research Center.[^2] Their paper, "Perplexity: A Measure of the Difficulty of Speech Recognition Tasks," presented at the 94th meeting of the Acoustical Society of America, argued that vocabulary size and static branching factors were inadequate measures of speech recognition complexity. They proposed perplexity (whose logarithm is the familiar entropy) as a more appropriate measure of equivalent choice.

During the n-gram era of language modeling, perplexity served as the primary benchmark. Researchers would train n-gram models (bigram, trigram, and higher-order) on text corpora and compare their perplexity scores on held-out [test sets](/wiki/test_set). Kneser-Ney smoothing and other techniques were developed in part to reduce perplexity on standard benchmarks. Throughout the 1980s and 1990s, perplexity became the primary benchmark for evaluating n-gram language models, used to compare different smoothing techniques, backoff strategies, and model orders. When [neural network](/wiki/neural_network)-based language models emerged in the 2000s, starting with Bengio et al.'s feedforward neural language model (2003), perplexity remained the standard yardstick, allowing direct comparison between statistical and neural approaches.[^3] The [Transformer](/wiki/transformer) architecture (2017) and subsequent large language models continued the tradition, reporting perplexity on established benchmarks such as Penn Treebank, WikiText-2, and WikiText-103, though the field has increasingly adopted task-specific evaluation alongside perplexity.

### Mathematical formulation

Perplexity is mathematically defined as the exponentiated average negative log-likelihood of a sequence. Given a test sequence W = w_1, w_2, ..., w_N, the perplexity of a language model is:

$$
\text{PPL}(W) = \exp\left( -\frac{1}{N} \sum_{i=1}^{N} \ln P(w_i \mid w_1, \ldots, w_{i-1}) \right)
$$

Where:
- N is the total number of tokens in the test sequence
- P(w_i | w_1, ..., w_{i-1}) is the conditional probability the model assigns to token w_i given all preceding tokens
- The inner sum computes the average negative log-likelihood across all tokens

Using base-2 logarithms, this can be written equivalently as:

$$
\text{PPL}(W) = 2^{-\frac{1}{N} \sum_{i=1}^{N} \log_2 P(w_i \mid w_1, \ldots, w_{i-1})}
$$

This is also equivalent to $$\text{PPL}(W) = 2^{H(W)}$$, where $$H(W)$$ is the cross-entropy of the model on the test sequence, measured in bits. In this form, perplexity represents the number of bits needed on average to encode each word using an optimal code based on the model's predicted distribution.

An equivalent product form, often used with n-gram models, is:

$$
\text{PPL}(W) = \left( \prod_{i=1}^{N} \frac{1}{P(w_i \mid \text{context})} \right)^{1/N}
$$

This is the inverse geometric mean of the token probabilities. The formula makes the intuition clear: if a model assigns high probability to each observed token, the product of inverse probabilities will be small, yielding a low perplexity. For a bigram model specifically, the perplexity of a test sequence is $$\text{PPL}(W) = \left( \prod_{i=1}^{N} \frac{1}{P(w_i \mid w_{i-1})} \right)^{1/N}$$. This product form is mathematically equivalent to the exponential form above but can be more intuitive for understanding how individual token probabilities contribute to the overall score.

### Relationship to cross-entropy and entropy

Perplexity is directly linked to [cross-entropy](/wiki/cross-entropy). Cross-entropy measures the average number of bits required to identify an event from a set of possibilities, given a coding scheme optimized for a predicted probability distribution rather than the true distribution. The cross-entropy $$H(p, q)$$ between the true distribution $$p$$ and the model distribution $$q$$ is:

$$
H(p, q) = -\frac{1}{N} \sum_{i=1}^{N} \log_2 P(w_i \mid \text{context})
$$

Perplexity is simply the exponentiation of the cross-entropy:

$$
\text{PPL} = 2^{H(p, q)}
$$

Or equivalently, using natural logarithms:

$$
\text{PPL} = \exp(H_{\text{nat}}(p, q))
$$

This relationship is significant for neural network training. When a neural language model is trained by minimizing the cross-entropy loss (the standard practice), it is implicitly minimizing perplexity on the training data. The average negative log-likelihood computed during training is precisely the cross-entropy, so perplexity is simply the exponential of the training loss. A model that achieves lower cross-entropy loss during training will, by definition, also achieve lower perplexity on the same data.

Perplexity is also related to entropy, which represents the theoretical lower bound on cross-entropy for a given language. While entropy $$H(p)$$ measures the inherent uncertainty in the true distribution, cross-entropy $$H(p, q)$$ is always greater than or equal to entropy. The difference between cross-entropy and entropy is the Kullback-Leibler divergence, which measures how much the model distribution q differs from the true distribution p. A perfect model (where $$q = p$$) would achieve cross-entropy equal to the true entropy, and its perplexity would represent the inherent unpredictability of the language itself.

### Interpretation and intuition

Perplexity can be interpreted as the **effective number of equally likely choices** the model considers at each prediction step, or as the weighted branching factor of a language model. A model with a perplexity of 50 is, on average, as uncertain as if it had to choose uniformly among 50 equally likely tokens for each word. If a model has a perplexity of 10 on a given dataset, it means that, on average, the model is as uncertain as if it had to choose uniformly among 10 equally likely options for each word. A perplexity of 1 would mean the model always predicts the correct next word with absolute certainty (a theoretical ideal that is unreachable in practice for natural language). This interpretation as a "weighted branching factor" gives the metric a concrete, intuitive meaning.

Key properties of perplexity include:

| Property | Description |
|---|---|
| Lower bound | Perplexity is always at least 1.0. A perplexity of 1 means the model predicts every token with 100% confidence and is always correct. |
| Upper bound | The theoretical maximum equals the vocabulary size $$V$$, corresponding to a uniform distribution over all tokens. |
| Lower is better | A lower perplexity indicates the model is less "surprised" by the test data and has learned better language patterns. |
| Same test set required | Perplexity scores are only meaningful when comparing models evaluated on the same test dataset. |
| Tokenizer dependence | Scores are not comparable across models with different [tokenizers](/wiki/tokenization), because different segmentation changes both the number of predictions and the difficulty of each prediction. |
| Dataset dependence | Perplexity scores are only meaningful when compared across models evaluated on the same test dataset using the same tokenization scheme. |

### Bits per character, bits per byte, and bits per word

Because different models use different tokenization schemes (word-level, subword, character-level, byte-level), raw perplexity scores are often not directly comparable. Several normalized metrics address this issue.

**Bits per character (BPC)** measures the average number of bits needed to encode one character of text under the model's predicted distribution. It is computed as the total cross-entropy loss divided by the number of characters. Good character-level language models typically achieve 1.0 to 1.5 BPC on English text. In 2019, Transformer-XL achieved a then state-of-the-art 0.99 BPC on the enwik8 benchmark.[^6]

**Bits per byte (BPB)** is similar but normalizes by raw bytes rather than characters. This metric is particularly useful for byte-level models and allows comparison across different character encodings.

**Bits per word (BPW)** normalizes by word count and is the direct logarithmic transform of word-level perplexity: $$\text{BPW} = \log_2(\text{PPL})$$. For example, a trigram model achieving a perplexity of 247 on the Brown Corpus corresponds to approximately 7.95 bits per word, or about 1.75 bits per letter.

Because these metrics normalize by a fixed unit (character, byte, or word) rather than by the model's internal tokens, they enable fairer comparisons across models with different vocabularies and tokenization strategies. The total information content of a test set is the same regardless of how it is segmented, so cross-entropy can always be converted between different granularity levels.

### Word-level vs. subword-level perplexity

The choice of tokenization granularity significantly affects reported perplexity values, and direct comparison requires careful normalization.

**Word-level perplexity** counts each whitespace-delimited word as one prediction step. Early n-gram models and some recurrent models (such as AWD-LSTM) reported word-level perplexity on benchmarks like Penn Treebank and WikiText-103.

**Subword-level perplexity** counts each subword token (produced by BPE, WordPiece, or SentencePiece) as one prediction step. Models like GPT-2 (using BPE with ~50,000 tokens), BERT (using WordPiece with ~30,000 tokens), and LLaMA use subword tokenization. Because subword models split rare words into multiple tokens, they make more predictions per word, and each individual prediction tends to be easier. This results in lower per-token perplexity even if the model captures equivalent information.

Research has shown that tokenization differences can affect traditional perplexity measurements by up to 21.6%.[^10] To normalize across tokenization schemes, researchers can convert to bits per character or bits per byte, or use token-normalized perplexity that accounts for the total sequence length in characters rather than tokens.

### Perplexity and vocabulary size

Vocabulary size directly affects perplexity. A model with a larger vocabulary faces a harder prediction task at each step because it must distribute probability mass across more possible tokens. All else being equal, a model with a vocabulary of 50,000 tokens will tend to report higher perplexity than an otherwise identical model with a vocabulary of 10,000 tokens.

This is why perplexity comparisons should always be made between models using the same vocabulary and tokenizer. When comparing across different vocabulary sizes, bits per character or bits per byte provides a more equitable comparison, since these metrics are independent of how the text is segmented into tokens.

### Benchmark perplexity scores

The following tables show perplexity scores for notable language models on standard benchmarks. These scores illustrate the progress in language modeling over the years. Lower scores indicate better performance.

#### Penn Treebank (word-level perplexity)

| Model | Test perplexity | Year |
|---|---|---|
| Kneser-Ney 5-gram | 141.2 | 1995 |
| AWD-LSTM + dynamic eval | 51.1 | 2017 |
| Mogrifier LSTM + dynamic eval | 44.8 | 2019 |
| Mogrifier RLSTM + dynamic eval | 42.9 | 2022 |
| [GPT](/wiki/gpt)-2 (1.5B parameters) | 35.76 | 2019 |
| GPT-3 (175B parameters) | 20.50 | 2020 |

#### WikiText-103 (word-level perplexity)

| Model | Test perplexity | Year |
|---|---|---|
| LSTM (Graves, 2013) | 48.7 | 2016 |
| Transformer with tied adaptive embeddings | 20.5 | 2018 |
| Transformer-XL Large | 18.3 | 2018 |
| Compressive Transformer | 17.1 | 2019 |
| Transformer-XL + RMS dynamic eval | 16.4 | 2019 |
| Routing Transformer | 15.8 | 2020 |

#### Character-level benchmarks (bits per character)

| Model | Dataset | BPC | Year |
|---|---|---|---|
| PPM compression | enwik8 | 1.46 | 1996 |
| Transformer-XL | enwik8 | 0.99 | 2019 |

#### Additional benchmark scores

| Model | Dataset | Perplexity | Year |
|---|---|---|---|
| GPT-2 (1.5B) | WikiText-2 | 19.93 | 2019 |
| GPT-2 (1.5B) | LAMBADA | 8.63 | 2019 |
| GPT-3 (175B) | LAMBADA (zero-shot) | 3.00 | 2020 |
| GPT-3 (175B) | LAMBADA (few-shot) | 1.92 | 2020 |

These results demonstrate a clear trend: larger models, better architectures, and more [training data](/wiki/training_data) consistently reduce perplexity. The transition from n-gram models to LSTMs to Transformers brought dramatic improvements, with perplexity on WikiText-103 dropping from roughly 48.7 to below 16 over just a few years. However, perplexity scores should not be compared across different datasets, since the difficulty of prediction varies with the domain, vocabulary, and text style.

### Test set perplexity vs. training perplexity

A critical distinction in language model evaluation is between training perplexity and test set perplexity.

**Training perplexity** measures how well the model fits its training data. It decreases during training as the model learns to assign higher probability to the observed sequences. However, very low training perplexity does not necessarily indicate a good model; it may simply reflect memorization.

**Test set perplexity** (or held-out perplexity) measures how well the model generalizes to unseen data. This is the metric that matters for evaluation. A model that has memorized its training data will show a large gap between training and test perplexity, a classic sign of overfitting.

Monitoring both metrics during training is standard practice. If training perplexity continues to decrease while test perplexity plateaus or increases, the model is overfitting. Techniques such as dropout, weight decay, and early stopping are commonly used to address this gap.

### Applications

Perplexity is used in a wide range of applications beyond direct language model comparison.

**Language model comparison and selection.** Perplexity provides a quick, task-agnostic way to compare language models. Before running expensive downstream evaluations, researchers often use perplexity as a first filter to identify the most promising model variants.

**[Speech recognition](/wiki/speech_recognition).** Perplexity was originally developed to estimate the difficulty of speech recognition tasks. A language model with lower perplexity used in a speech recognizer's decoder typically leads to lower word error rates, because the model better constrains the search space of possible transcriptions. It helps estimate how difficult a recognition task will be for a given language model component in automatic speech recognition (ASR) systems.

**[Machine translation](/wiki/machine_translation).** Language model perplexity is used to evaluate the fluency component of machine translation systems. Although [BLEU](/wiki/bleu) scores are preferred for end-to-end translation evaluation, the target-side language model's perplexity helps assess output fluency.

**Topic modeling.** In probabilistic topic models like Latent Dirichlet Allocation (LDA), held-out perplexity is used to determine the optimal number of topics. Lower perplexity on held-out documents suggests the model has found a better decomposition of the corpus.

**Text quality and anomaly detection.** High perplexity on a given text sample may indicate that the text is out of domain, poorly written, or anomalous relative to the training distribution. This property has applications in data cleaning and quality filtering for training corpora. Documents with unusually high perplexity relative to a trained model may contain errors, be from a different domain, or include unusual content.

**AI-generated text detection.** Perplexity plays a central role in [AI content detection](/wiki/ai_content_detectors) systems such as GPTZero, ZeroGPT, the OpenAI AI Text Classifier, and similar classifiers. Text produced by a language model tends to have low perplexity when evaluated by a similar model, because AI-generated text follows highly predictable statistical patterns. Human-written text typically exhibits higher perplexity due to greater variability in word choice and sentence structure. However, detection based on perplexity alone has significant weaknesses: paraphrasing and editing can raise perplexity above detection thresholds, domain-specific text (such as legal or technical writing) may have naturally low perplexity, and newer models fine-tuned for output variability can evade detection.

### Limitations

Despite its widespread use, perplexity has several important limitations that researchers and practitioners should be aware of.

| Limitation | Explanation |
|---|---|
| No measure of coherence | A model can achieve low perplexity by predicting common word sequences well, while still producing incoherent or nonsensical text over longer spans. |
| No measure of factuality | Perplexity evaluates statistical fit, not whether the model's outputs are factually correct. A model might assign high probability to plausible-sounding but false statements. |
| No measure of usefulness | Low perplexity does not imply that a model is useful for downstream tasks like question answering, summarization, or instruction following. |
| Weak correlation with some downstream tasks | Tasks requiring semantic understanding or reasoning (question answering, sentiment analysis) show weak correlation with perplexity, because next-token prediction does not directly capture deeper comprehension. |
| Domain sensitivity | A model trained on news text may show excellent perplexity on news test sets but poor perplexity on medical or legal text. Cross-domain perplexity comparisons can be misleading. |
| Tokenizer dependence | Models with different tokenizers produce incomparable perplexity scores without normalization. |
| Sensitivity to small test sets | Small differences in perplexity (2 to 3 points) on small test sets may not be statistically significant and can reflect random variation rather than genuine model differences. |
| Repetitive text can score well | A model that produces repetitive, low-entropy text may achieve low perplexity despite generating poor-quality output. |
| Long-context averaging | Standard perplexity averages loss across all tokens equally, which can obscure a model's performance on tokens that genuinely require long-range context. Metrics like LongPPL have been proposed to address this by isolating tokens whose prediction leverages distant context. |

For these reasons, modern evaluation of large language models often supplements perplexity with task-specific benchmarks such as [MMLU](/wiki/mmlu), [HellaSwag](/wiki/hellaswag), TruthfulQA, and [HumanEval](/wiki/humaneval).

### Perplexity vs. downstream task performance

The relationship between perplexity and downstream task performance is nuanced and has been the subject of considerable research.

For tasks that directly involve predicting or generating text (speech recognition, machine translation, text generation), perplexity tends to correlate well with task performance. This makes sense, because these tasks depend on the same core capability that perplexity measures: accurate next-token prediction.

For tasks requiring higher-level understanding (question answering, natural language inference, commonsense reasoning), the correlation is weaker. A model can be excellent at predicting common word sequences yet struggle with reasoning tasks. Research by the XLNet authors noted that improved language model perplexity does not always lead to improvement on downstream tasks. Conversely, the RoBERTa work found that better perplexity on the masked language modeling objective did lead to better end-task accuracy on sentiment analysis and inference benchmarks.[^9]

This mixed evidence motivated the development of comprehensive benchmark suites such as GLUE, SuperGLUE, BIG-Bench, MMLU, and HumanEval, which evaluate language models on a diverse set of downstream tasks. Modern large language model evaluation typically supplements perplexity with these task-specific benchmarks, human evaluations, and safety assessments to provide a more complete picture of model capabilities.

### Use in AI Content Detection

*See also: [AI content detectors](/wiki/ai_content_detectors), [burstiness](/wiki/burstiness)*

Perplexity plays a central role in AI content detection systems such as GPTZero, ZeroGPT, and the OpenAI AI Text Classifier. These tools use perplexity as a signal to distinguish human-written text from AI-generated text.

The core idea is that text produced by a language model tends to have low perplexity when evaluated by a similar model, because AI-generated text follows highly predictable patterns. Human-written text, by contrast, tends to have higher perplexity due to greater variability in word choice, sentence structure, and stylistic decisions. When an AI detector feeds a document through a language model and the resulting perplexity is unusually low, the system flags the text as likely AI-generated.

GPTZero, one of the first widely used AI detectors, pioneered the approach of combining perplexity with [burstiness](/wiki/burstiness) (a measure of how much the writing style varies throughout a document).[^11] AI-generated text typically exhibits both low perplexity and low burstiness, because language models produce text at a consistent level of predictability. Human writing, on the other hand, tends to show bursts of complexity followed by simpler passages, resulting in higher burstiness.

The following table summarizes how perplexity and burstiness interact in AI content detection:

| Metric | Human-Written Text | AI-Generated Text |
|---|---|---|
| Perplexity | Generally higher (more surprising, varied word choices) | Generally lower (predictable, follows statistical patterns) |
| Burstiness | Higher (variable sentence length and complexity) | Lower (uniform writing style throughout) |
| Combined Signal | High perplexity + high burstiness = likely human | Low perplexity + low burstiness = likely AI |

However, perplexity-based detection has significant weaknesses:

- **Paraphrasing and editing** can raise the perplexity of AI-generated text above the detection threshold.
- **Domain-specific text** (such as legal documents or technical manuals) may have naturally low perplexity due to formulaic language, leading to false positives.
- **Multilingual text** and non-standard English dialects can produce unreliable perplexity scores.
- **Newer models** that are [fine-tuned](/wiki/fine_tuning) or prompted to write with more variability can evade perplexity-based detectors.

As a result, modern AI detection tools like GPTZero have evolved beyond relying solely on perplexity and burstiness, incorporating multilayered systems with seven or more components including neural classifiers trained on large datasets of human and AI-generated text.

### Perplexity in practice: computing with modern LLMs

Computing perplexity for modern large language models involves some practical considerations.

**Fixed-length context.** Most Transformer-based models have a fixed context window (for example, 2,048 or 4,096 tokens for GPT-2). When evaluating on text longer than the context window, researchers use a sliding window approach, scoring overlapping segments and averaging the results. The Hugging Face Transformers library provides standard implementations of this approach.[^12]

**[Softmax](/wiki/softmax) probabilities.** Perplexity is computed from the softmax output probabilities of the model. For each position in the test sequence, the model's predicted probability distribution over the vocabulary is compared to the actual next token, and the log probability of the correct token is accumulated.

**Computational cost.** Evaluating perplexity requires a full forward pass through the model for the entire test set, which can be expensive for very large models. Unlike many downstream benchmarks, perplexity evaluation does not require any fine-tuning or task-specific adaptation, making it relatively straightforward to compute.

## Perplexity AI (Company)

### Overview

[Perplexity AI](/wiki/perplexity_ai), Inc. is an American [artificial intelligence](/wiki/artificial_intelligence) company headquartered in San Francisco, California, that develops and operates an AI-powered search engine. Often referred to as an "answer engine," Perplexity processes natural language queries and returns synthesized, cited responses rather than a traditional list of blue links. CEO and co-founder Aravind Srinivas has summarized the product directly: "Perplexity is best described as an answer engine. You ask it a question, you get an answer. Except the difference is, all the answers are backed by sources."[^41] The company was founded in August 2022 and launched its consumer search product on December 7, 2022.

As of early 2026, Perplexity has approximately 45 million monthly active users, handles an estimated 180 to 220 million search queries per week (more than double early-2025 levels), and attracts roughly 170 to 240 million global website visitors per month.[^19] Following a Series E-6 funding round in early 2026, the company is valued at approximately $21 billion, with reports placing the most recent valuation at approximately $22.6 billion as of January 2026. The company's annual recurring revenue surpassed $450 million in March 2026, a roughly 50% jump within a single month that the Financial Times attributed to the rollout of its Computer agent and a shift to usage-based pricing.[^42] Headcount grew from roughly 310 employees in early 2025 to approximately 1,449 employees by April 2026.[^19]

### Founders

Perplexity AI was co-founded by four engineers with backgrounds in back-end systems, artificial intelligence, and [machine learning](/wiki/machine_learning):

| Founder | Role | Background |
|---|---|---|
| [Aravind Srinivas](/wiki/aravind_srinivas) | CEO | Ph.D. in Computer Science from UC Berkeley. Former AI researcher at [OpenAI](/wiki/openai), DeepMind, and Google Brain. B.S. and M.S. in Electrical Engineering from IIT Madras. |
| Denis Yarats | CTO | Ph.D. in Computer Science from NYU, focusing on [reinforcement learning](/wiki/reinforcement_learning) and NLP. Former AI Research Scientist at [Meta](/wiki/meta_ai) AI (FAIR). Previously worked on Bing at Microsoft and as a Staff ML Engineer at Quora. |
| Johnny Ho | Chief Strategy Officer | Former engineer at Quora. Former quantitative trader at Tower Research Capital. Competitive programming world champion. |
| Andy Konwinski | President | Ph.D. in Computer Science. Co-founder of [Databricks](/wiki/databricks). Expert in distributed systems and big data processing through work on Apache Spark. |

[Aravind Srinivas](/wiki/aravind_srinivas) has been the public face of the company, articulating the vision of replacing traditional search engines with a system that directly answers questions with cited sources, rather than requiring users to sift through links.

### How does Perplexity work?

Perplexity's search engine is built on retrieval-augmented generation (RAG), a technique that combines information retrieval with [generative AI](/wiki/generative_ai) to produce grounded, cited responses. The system follows a multi-stage pipeline:

1. **Query Intent Parsing.** When a user submits a query, a language model analyzes the intent behind the question, going beyond simple keyword matching to achieve semantic understanding of context, nuance, and the user's underlying goal.

2. **Web Retrieval.** The system searches the web and its internal index to retrieve relevant documents and web pages. Perplexity uses Vespa.ai as its core search infrastructure, which integrates vector search for semantic understanding, lexical search for precision, structured filtering, and machine-learned ranking into a single engine.[^15]

3. **Chunk-Level Extraction.** Rather than passing entire documents to the language model, the system extracts the most relevant text spans (chunks) from retrieved documents. This chunk-level retrieval improves factual accuracy, reduces context length, and minimizes compute cost.

4. **Answer Generation.** The curated context is passed to a generative large language model, which synthesizes a natural-language response based strictly on the retrieved information. A core architectural principle is that the model should not say anything that was not retrieved from sources. As Srinivas describes it, the model is instructed that "given a bunch of links and paragraphs, write a concise answer for the user, with the appropriate citation."[^41]

5. **Citation Attachment.** Inline citations are attached to the generated text, linking specific claims back to their source documents. This allows users to verify every piece of information and explore original sources.

Perplexity continuously ingests data and updates both text and vector indexes in real time without interrupting queries. Its distributed architecture balances data and computation across nodes, co-locating content, indexes, and ranking logic to eliminate bottlenecks.

### Products and Features

Perplexity offers a range of products and features that have expanded significantly since its 2022 launch.

#### Perplexity Search

The core product is the [AI search](/wiki/ai_search) engine, available at perplexity.ai and through mobile apps on iOS and Android. Users type a natural language question, and the system returns a synthesized answer with inline citations. Follow-up questions are supported in a conversational thread, allowing users to refine their queries. The free tier provides access to basic search with a limited number of Pro Search queries per day.

#### Perplexity Pro

Perplexity Pro is the paid subscription tier, priced at $20 per month (or $200 per year). Pro subscribers receive:

- Unlimited Pro Search queries (which use more compute and return deeper, multi-step research results)
- Access to multiple advanced AI models, including options optimized for coding, creative writing, [deep learning](/wiki/deep_learning) reasoning, and technical summarization
- Unlimited file uploads (up to 50 MB per file) for document analysis
- Priority compute allocation and faster response times
- Early access to new features
- A $5 monthly API credit

#### Perplexity Max

Announced in July 2025, Perplexity Max is a $200-per-month subscription plan aimed at power users.[^17] It includes unlimited access to Labs (a spreadsheet and report generation tool), early access to Comet and Computer features, and access to the most advanced model configurations including [Claude Opus 4.6](/wiki/claude_opus_4_6) and [GPT-5.2](/wiki/gpt-5.2) within Deep Research and Model Council.[^17]

#### Enterprise Pro

Perplexity Enterprise Pro, priced at $40 per user per month, targets business teams with features including centralized billing, admin controls, data privacy protections, and team collaboration tools through Spaces.

#### Comet Browser

[Comet](/wiki/perplexity_comet) is Perplexity's AI-native web browser. It launched first for macOS and Windows on July 9, 2025, expanded to Android on November 20, 2025, and arrived on iOS on March 18, 2026, making it available across all major desktop and mobile platforms by spring 2026.[^20] On April 28, 2026, Perplexity released an iPad-optimized version with support for multiple windows and Split View multitasking.[^25] The browser integrates Perplexity's search and AI capabilities directly into the browsing experience, with features including:

- A built-in AI assistant that can summarize pages, answer questions about content, and complete web-based tasks
- Voice mode for spoken queries (upgraded in March 2026 to use GPT Realtime 1.5)
- Hybrid search combining traditional web browsing with AI-powered answers
- [Deep Research](/wiki/deep_research) integration for multi-source summaries
- Support for task automation such as comparing prices across websites or summarizing emails

Max subscribers can select the model powering their browser agent, choosing from options such as [Claude Opus 4.6](/wiki/claude_opus_4_6) and [Claude Sonnet 4.6](/wiki/claude_sonnet_4_6).

**Comet Enterprise**, launched in March 2026, is a secure version for teams with granular admin controls, MDM (mobile device management) deployment support, audit logs, and integration with CrowdStrike Falcon for threat detection.

#### Comet Plus

Announced in August 2025 and broadly available in early January 2026, Comet Plus is a $5-per-month subscription add-on that shares 80% of subscription revenue with participating publishers. Perplexity opened a $42.5 million revenue pool for publishers, with payouts based on three categories: direct traffic to publisher sites from the Comet browser, citations when publisher content appears in AI-generated answers, and agent-driven use when Comet's assistant relies on publisher content to complete tasks. Comet Plus is bundled with Pro and Max subscriptions.[^26]

#### Perplexity Computer (and Personal Computer)

Announced in February 2026, **Perplexity Computer** is a multi-model agentic AI system that orchestrates tasks across approximately 20 specialized AI models and connects to more than 400 applications, including Gmail, Slack, GitHub, Notion, and Salesforce.[^27] In May 2026, Computer became available to all Pro subscribers and added integrations with Microsoft Teams, Snowflake, and Databricks, as well as the ability to publish websites and full-stack applications to persistent *.pplx.app URLs.[^28]

**Personal Computer for Mac**, announced in March 2026 and rolled out to Max subscribers in April 2026, is a locally running variant of Computer designed to run on Apple Silicon Macs. It executes on the user's own hardware (Perplexity recommends a Mac mini or similar always-on device) and operates over local files, iMessage, Apple Mail, Calendar, and Notes without uploading content to Perplexity's servers. On May 7, 2026, Perplexity launched an all-new native Mac app, and Personal Computer was opened to all Pro and Max subscribers on May 11, 2026.[^29][^30]

In March 2026 Perplexity also introduced **Skills** (reusable, user-defined task instructions for Computer) and support for the **[Model Context Protocol](/wiki/model_context_protocol) (MCP)**, letting Pro, Max, and Enterprise users connect Computer to external tools via custom MCP servers.

#### Model Council

Launched in February 2026, **Model Council** dispatches a single user query to three frontier models in parallel, typically [Claude Opus 4.6](/wiki/claude_opus_4_6), [GPT-5.2](/wiki/gpt-5.2), and [Gemini 3](/wiki/gemini_3) Pro, and produces a synthesis output that highlights agreement, disagreement, and unique insights between the models. The feature is available to Max subscribers on the web, with Pro and mobile rollout planned.[^31]

#### Deep Research

[Deep Research](/wiki/deep_research) is Perplexity's autonomous research feature that performs dozens of searches, reads hundreds of sources, and produces a comprehensive cited report. It is free with daily usage limits, with higher quotas for Pro subscribers. In early 2026, Perplexity upgraded Deep Research to run on [Claude Opus 4.5](/wiki/claude_opus_4_5) (and later [Opus 4.6](/wiki/claude_opus_4_6)) for Pro and Max users, with the ability to produce presentations, spreadsheets, dashboards, and websites directly from research outputs. Perplexity reports that Deep Research scores 21.1% on Humanity's Last Exam and 93.9% on [SimpleQA](/wiki/simpleqa).[^32]

#### Buy with Pro (Shopping) and Virtual Try-On

Launched in November 2024 for Pro users in the United States, Buy with Pro is an AI-powered shopping feature that lets users research products, compare options, and check out directly within Perplexity. Pro users receive free shipping on Buy with Pro orders. In May 2025, Perplexity partnered with PayPal to add PayPal and Venmo as checkout options within the app.

In May 2026, Perplexity introduced a **Virtual Try-On** capability that lets users create a virtual avatar and preview how apparel items would look on them before buying.[^33]

#### Spaces and Pages

Spaces is an organizational feature that allows users to create dedicated workspaces for different topics or projects, grouping related searches and conversations together. Perplexity Pages allows users to generate shareable, article-style content from their research queries, transforming AI search results into formatted, publishable pages.

#### Perplexity Finance

Perplexity Finance is a dedicated experience for stocks, cryptocurrencies, and other financial data. In May 2026, the company added live intraday price information embedded directly within answers, with indicators linking to Perplexity Finance pages.[^34] For Personal Computer, Perplexity added Personal CFO functionality supporting up to 30 connected accounts and enterprise finance connectors including Morningstar, PitchBook, FactSet, Daloopa, and Carbon Arc.

### Sonar Models and API

Perplexity develops its own family of AI models under the Sonar brand, which power both the consumer search product and the developer API.

#### Sonar Model Family

The Sonar models were introduced in February 2025, built on top of Meta's [Llama 3.3](/wiki/llama_3_3) 70B and further trained to enhance answer factuality and readability.[^13]

| Model | Context Window | Speed | Key Strength |
|---|---|---|---|
| Sonar | 127K tokens | ~1,200 tokens/sec | Fast, real-time answers for general queries |
| Sonar Pro | 200K tokens | ~144 tokens/sec | Higher accuracy; leads SimpleQA factuality benchmark (F-score 0.858) |
| Sonar Reasoning Pro | 200K tokens | Varies | Deep analytical and [reasoning](/wiki/reasoning) tasks |
| Sonar Deep Research | 200K tokens | Varies | Comprehensive multi-source research |

Sonar achieves its high throughput of approximately 1,200 tokens per second by running on [Cerebras](/wiki/cerebras) inference infrastructure.[^13] Sonar Pro, while slower, delivers meaningfully higher factual accuracy, scoring 0.858 on the [SimpleQA](/wiki/simpleqa) benchmark compared to Sonar's 0.773.[^14]

#### API Access and Pricing

The Sonar API allows developers to integrate Perplexity's web-grounded search and answer capabilities into their own applications. The API supports real-time web search, conversational answers with citations, and structured retrieval. In 2026, Perplexity introduced an **Agent API** that brokers access to third-party frontier models including GPT-5.4, NVIDIA Nemotron, [Claude Sonnet 4.6](/wiki/claude_sonnet_4_6), and Gemini 3.1 Pro Preview, while deprecating older Gemini 2.5 endpoints.

| Model | Input Price (per 1M tokens) | Output Price (per 1M tokens) | Notes |
|---|---|---|---|
| Sonar | $1.00 | $1.00 | Citation tokens no longer billed (as of 2026) |
| Sonar Pro | $3.00 | $15.00 | Citation tokens no longer billed (as of 2026) |
| Sonar Deep Research | $2.00 | $8.00 | Citation tokens: $2.00/1M; Search: $5.00/1K queries |

Additional request fees apply to Sonar, Sonar Pro, and Sonar [Reasoning](/wiki/reasoning) Pro based on search context size. Pro subscribers receive a $5 monthly API credit. The Search API, for applications needing raw web results without synthesized answers, is priced at $5 per 1,000 requests.[^21]

### How much is Perplexity AI worth?

Perplexity has raised more than $1.7 billion across multiple rounds, with its valuation climbing from roughly $150 million in 2023 to approximately $22.6 billion by January 2026.[^18][^19]

| Date | Round | Amount | Valuation | Notable Investors |
|---|---|---|---|---|
| 2023 | Series A | $26M | ~$150M | NEA, [Databricks](/wiki/databricks) |
| January 2024 | Series B | $73.6M | $520M | IVP, NEA, Jeff Bezos, [NVIDIA](/wiki/nvidia) |
| April 2024 | Series B-1 | $63M | $1.04B | Daniel Gross, Bessemer Venture Partners |
| June 2024 | Series C | Undisclosed | $3B | [SoftBank Vision Fund](/wiki/vision_fund) 2 |
| December 2024 | Series D | $500M | $9B | Accel, IVP, Jeff Bezos, [NVIDIA](/wiki/nvidia) |
| May 14, 2025 | Series E | $500M | $14B | Institutional investors |
| July 2025 | Top-up | $100M | ~$16B | Various investors |
| September 2025 | Series E extension | $200M | $20B | Institutional investors |
| December 2025 | Undisclosed | Undisclosed | ~$21B | Cristiano Ronaldo (among others) |
| January 2026 | Series E-6 | Undisclosed | ~$22.6B | Existing investors |

Notable individual investors include Amazon founder Jeff Bezos, Shopify CEO Tobias Lutke, Figma CEO Dylan Field, and professional footballer Cristiano Ronaldo. Aravind Srinivas has repeatedly emphasized the unusual dynamic in which Jeff Bezos is both a personal investor in Perplexity and the founder of Amazon, which sued the company in late 2025 over agentic shopping.

### Revenue Model and Growth

Perplexity generates revenue primarily through subscriptions. The company's reported annualized revenue grew from approximately $80 million in late 2024 to roughly $148-200 million by late 2025, and the Financial Times reported in April 2026 that annual recurring revenue had topped $450 million in March 2026 following the launch of the Computer agent and a move to usage-based pricing.[^42] The company's December 2024 internal roadmap targeted $656 million in ARR for 2026.

The company's revenue sources include:

- **Subscriptions.** Perplexity Pro ($20/month), Perplexity Max ($200/month), and Enterprise Pro ($40/user/month) form the core of the business.
- **Comet Plus.** A $5-per-month add-on (bundled with Pro and Max) that funds a $42.5 million revenue-sharing pool for publishers.
- **API access.** Developers pay per-token and per-request fees for Sonar API and Agent API usage.
- **Commerce.** Buy with Pro generates revenue through merchant partnerships.
- **Content partnerships.** Licensing deals with content providers, including a multi-year agreement with Getty Images (signed October 2025) for displaying licensed imagery within search results.
- **Distribution deals.** Partnerships with device manufacturers, including system-level integration on the Samsung Galaxy S26 announced in early 2026.

In early 2024, Perplexity experimented with AI-integrated advertising. In February 2026, the company formally transitioned to a subscription-first model by discontinuing advertising, citing a desire to preserve user trust in search results.

### Cloud Infrastructure

Perplexity operates a multi-cloud strategy. Amazon Web Services remains the company's preferred cloud provider, but in January 2026 Perplexity announced a three-year, approximately $750 million commitment with Microsoft Azure to secure GPU capacity for Deep Research, Model Council, and Agent API workloads.[^35] The Azure deal also provides access to Microsoft's Foundry service for routing queries to frontier models from Anthropic, OpenAI, and xAI. The agreement was widely interpreted as a strategic hedge given the simultaneous legal dispute with Amazon over agentic shopping.

### Distribution Partnerships

#### Samsung Galaxy S26

In February 2026, Perplexity and Samsung Electronics announced a strategic partnership integrating Perplexity at the operating-system level on the **Samsung Galaxy S26** lineup, with a dedicated "Hey Plex" wake word. Perplexity also powers search and reasoning inside Samsung's Bixby assistant and is available as an optional default search engine in Samsung Internet. The deal made Perplexity the first non-Google company to receive OS-level access on a Samsung flagship device.[^36]

#### Snap

In November 2025, Snap and Perplexity announced a multi-year, approximately $400 million deal under which Perplexity's conversational answer engine would become a default AI experience inside Snapchat. Perplexity was to pay Snap roughly $400 million across twelve months in a mix of cash and equity. On May 6, 2026, Snap disclosed in its Q1 2026 earnings report that the companies had "amicably ended the relationship in Q1." A Perplexity spokesperson said the planned feature was "not the right fit" for either company, and the integration was never broadly rolled out.[^37]

### How is Perplexity different from Google Search?

Perplexity positions itself as a direct alternative to traditional search engines, particularly Google Search. While Perplexity does not yet rival Google's overall market share, it has carved out a growing niche in the AI-powered search segment. Within the AI chatbot and search market, Perplexity competes with [ChatGPT](/wiki/chatgpt), [Microsoft Copilot](/wiki/microsoft_copilot), Google's [Gemini](/wiki/gemini)-powered AI Overviews, and You.com.

Perplexity differentiates itself from competitors through several strategic choices:

- **Direct answers with citations** rather than ranked link lists
- **Transparency** about sources, with inline references users can verify
- **No advertising** (as of February 2026), positioning itself as a user-aligned tool
- **Conversational follow-up** allowing users to refine queries iteratively
- **Multi-model flexibility** letting Pro and Max users select the best model for their task
- **Agentic capabilities** via Comet and Computer, which can take actions across web and local apps

### Publisher Controversies and Legal Challenges

Perplexity has faced significant criticism and legal action from publishers, news organizations, and platforms that allege the company's web crawling and content summarization practices constitute copyright infringement or unauthorized access.

#### Key Lawsuits

| Date | Plaintiff | Key Allegations |
|---|---|---|
| October 2024 | Forbes | Cease-and-desist; allegedly reproduced content with minimal attribution |
| October 2024 | News Corp (Wall Street Journal, New York Post) | Unauthorized scraping and reproduction of copyrighted articles |
| August 2025 | Nikkei, Asahi Shimbun | Japanese publishers filed copyright claims |
| August 2025 | Encyclopaedia Britannica, Merriam-Webster | Reference content scraped and summarized without permission |
| November 2025 | Amazon | Agentic shopping via Comet allegedly violated Amazon's terms of service |
| December 5, 2025 | The New York Times | Large-scale copying and distribution of NYT content to power commercial AI products |
| December 2025 | Chicago Tribune | Copyright infringement claims |

The **New York Times lawsuit**, filed in the U.S. District Court for the Southern District of New York on December 5, 2025, alleged that Perplexity scrapes articles from nytimes.com and obtains them from third-party databases to build a private index feeding its RAG system.[^16] The complaint cited research claiming that Perplexity used undeclared user agents, disguised its crawlers to appear as ordinary web browsers, relied on hidden IP addresses, and used third-party crawling tools to avoid detection. The NYT specifically cited Perplexity's "skip the links" tagline as an invitation for users to avoid the original publishers.[^16] On February 27, 2026, Perplexity filed a motion to dismiss the New York Times and Chicago Tribune lawsuits.[^38]

The **Amazon lawsuit**, filed in November 2025, alleged that Perplexity took steps to "conceal" its AI agents so they could continue to scrape Amazon's website without authorization. In a ruling on March 10, 2026, U.S. District Judge Maxine Chesney granted Amazon a preliminary injunction blocking Comet from using its AI agent to access password-protected sections of Amazon's site to shop on behalf of customers.[^39] The court found that while Comet may have had the user's permission, it lacked Amazon's authorization to enter logged-in areas of the site. On April 1, 2026, Perplexity filed an appeal seeking to overturn the injunction.[^39] The case is widely seen as an early bellwether for the legal status of agentic commerce.

#### Perplexity's Response

Perplexity has taken several steps to address publisher concerns:

- **Publishers' Program.** Perplexity launched a revenue-sharing program with participating outlets including Gannett, TIME, Fortune, and the Los Angeles Times.
- **Comet Plus.** In August 2025, Perplexity introduced the $5/month subscription add-on that allocates 80% of net revenue (within a $42.5 million pool) to participating publishers across three payout categories: direct traffic, citations, and agent-driven use.[^26]
- **Getty Images partnership.** In October 2025, Perplexity signed a multi-year licensing agreement with Getty Images for displaying licensed imagery within search results.

The legal outcomes of these lawsuits will likely have significant implications for how AI search engines can use copyrighted web content, for the boundaries of agentic web automation, and for the broader relationship between [generative AI](/wiki/generative_ai) companies and content publishers.

### Other Strategic Moves

#### TikTok Merger Proposal (January 2025)

On January 18, 2025, the day before a scheduled U.S. ban on TikTok was to take effect, Perplexity submitted a proposal to merge with TikTok U.S. The proposal envisioned a new holding company ("NewCo") combining Perplexity with TikTok's U.S. business, with ByteDance retaining its proprietary recommendation algorithm. The structure included a provision under which the U.S. government could own up to 50% of the merged entity after a future IPO at a minimum valuation of approximately $300 billion. CEO Aravind Srinivas publicly confirmed the bid on social media. The proposal was not accepted, and TikTok's U.S. status was ultimately resolved through a separate transaction.[^40]

### Recent Developments (2025 and 2026)

Perplexity has expanded rapidly through 2025 and into 2026, adding new products and capabilities at a fast pace.

**2025 milestones:**
- January 2025: TikTok U.S. merger proposal
- February 2025: Launch of Sonar models and developer API
- March 2025: Release of Sonar Pro with improved factuality
- May 2025: PayPal and Venmo checkout integration; $500M Series E at $14B
- July 2025: Launch of Comet browser for macOS and Windows; launch of Perplexity Max subscription ($200/month)
- August 2025: Comet Plus announced with $42.5M publisher pool
- September 2025: $200M raised at $20B valuation
- October 2025: Getty Images licensing deal
- November 2025: Comet for Android; $400M Snap partnership announced; Amazon lawsuit filed
- December 2025: New York Times and Chicago Tribune lawsuits filed; investment from Cristiano Ronaldo; valuation reaches approximately $21B

**2026 milestones:**
- January 2026: $750M, three-year commitment with Microsoft Azure for GPU infrastructure; Comet Plus broadly available; Series E-6 round at ~$22.6B
- February 2026: Launch of Model Council; launch of Perplexity Computer; Samsung Galaxy S26 partnership ("Hey Plex"); discontinuation of advertising; motion to dismiss NYT/Chicago Tribune lawsuits filed
- March 2026: Comet for iOS; Comet Enterprise; Personal Computer for Mac (waitlist); Skills and MCP support for Computer; Voice Mode upgraded to GPT Realtime 1.5; Amazon preliminary injunction granted against Comet shopping
- April 2026: Personal Computer rolled out to Max subscribers; Computer enhancements including Microsoft Teams integration; iPad multitasking update for Comet; appeal filed against Amazon injunction; Financial Times reports ARR topping $450M
- May 2026: All-new native Mac app; Personal Computer opened to all Pro and Max subscribers; Computer for Enterprise launched at Ask 2026 developer conference; GPT-5.5 deployed as default Computer orchestration model; Snowflake and Databricks connectors; Virtual Try-On shopping; live finance data embedded in answers; Snap partnership formally ended

## See also

- [AI Content Detectors](/wiki/ai_content_detectors)
- [AI search](/wiki/ai_search)
- [Aravind Srinivas](/wiki/aravind_srinivas)
- [Burstiness](/wiki/burstiness)
- [Claude Opus 4.6](/wiki/claude_opus_4_6)
- [Cross-entropy](/wiki/cross-entropy)
- [Deep Research](/wiki/deep_research)
- [Entropy](/wiki/entropy)
- [Language model](/wiki/language_model)
- [Large language model](/wiki/large_language_model)
- [Model Context Protocol](/wiki/model_context_protocol)
- [N-gram](/wiki/n-gram)
- [Natural Language Processing](/wiki/natural_language_processing)
- [Perplexity AI](/wiki/perplexity_ai)
- [Perplexity Comet](/wiki/perplexity_comet)
- [Retrieval Augmented Generation](/wiki/retrieval_augmented_generation)
- [Search Engine](/wiki/search_engine)
- [Generative AI](/wiki/generative_ai)
- [SimpleQA](/wiki/simpleqa)
- [Transformer](/wiki/transformer)
- [BLEU](/wiki/bleu)
- [Loss function](/wiki/loss_function)
- [Evaluation metrics](/wiki/evaluation_metrics)

## References

[^1]: Shannon, C. E. (1948). "A Mathematical Theory of Communication." *Bell System Technical Journal*, 27(3), 379-423.
[^2]: Jelinek, F., Mercer, R. L., Bahl, L. R., & Baker, J. K. (1977). "Perplexity: A Measure of the Difficulty of Speech Recognition Tasks." *Journal of the Acoustical Society of America*, 62(S1), S63.
[^3]: Bengio, Y., Ducharme, R., Vincent, P., & Janvin, C. (2003). "A Neural Probabilistic Language Model." *Journal of Machine Learning Research*, 3, 1137-1155.
[^4]: Radford, A., Wu, J., Child, R., Luan, D., Amodei, D., & Sutskever, I. (2019). "Language Models are Unsupervised Multitask Learners." *OpenAI Technical Report*.
[^5]: Brown, T. B., Mann, B., Ryder, N., et al. (2020). "Language Models are Few-Shot Learners." *Advances in Neural Information Processing Systems*, 33, 1877-1901.
[^6]: Dai, Z., Yang, Z., Yang, Y., Carbonell, J., Le, Q. V., & Salakhutdinov, R. (2019). "Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context." *Proceedings of ACL 2019*.
[^7]: Merity, S., Keskar, N. S., & Socher, R. (2018). "Regularizing and Optimizing LSTM Language Models." *Proceedings of ICLR 2018*.
[^8]: Melis, G., Kocisk, T., & Blunsom, P. (2020). "Mogrifier LSTM." *Proceedings of ICLR 2020*.
[^9]: Liu, Y., Ott, M., Goyal, N., et al. (2019). "RoBERTa: A Robustly Optimized BERT Pretraining Approach." *arXiv preprint arXiv:1907.11692*.
[^10]: Mielke, S. J. (2019). "Can You Compare Perplexity Across Different Segmentations?" Blog post, Johns Hopkins University.
[^11]: Tian, E. (2023). "GPTZero: Towards a Detection System for AI-Generated Text." Princeton University.
[^12]: Hugging Face. "Perplexity of fixed-length models." Transformers Documentation. https://huggingface.co/docs/transformers/perplexity
[^13]: Perplexity AI. "Meet New Sonar." Perplexity Blog, February 2025. https://www.perplexity.ai/hub/blog/meet-new-sonar
[^14]: Perplexity AI. "Introducing the Sonar Pro API." Perplexity Blog, March 2025. https://www.perplexity.ai/hub/blog/introducing-the-sonar-pro-api
[^15]: Vespa.ai. "How Perplexity uses Vespa.ai to power fast, accurate, and trusted answers for millions of users." https://vespa.ai/perplexity/
[^16]: TechCrunch. "The New York Times is suing Perplexity for copyright infringement." December 5, 2025. https://techcrunch.com/2025/12/05/the-new-york-times-is-suing-perplexity-for-copyright-infringement/
[^17]: TechCrunch. "Perplexity launches a $200 monthly subscription plan." July 2, 2025. https://techcrunch.com/2025/07/02/perplexity-launches-a-200-monthly-subscription-plan/
[^18]: TechCrunch. "Perplexity reportedly raised $200M at $20B valuation." September 10, 2025. https://techcrunch.com/2025/09/10/perplexity-reportedly-raised-200m-at-20b-valuation/
[^19]: DemandSage. "Perplexity AI Statistics 2026." https://www.demandsage.com/perplexity-ai-statistics/
[^20]: 9to5Mac. "Perplexity AI 'Comet' browser for iPhone now available." March 18, 2026. https://9to5mac.com/2026/03/18/perplexity-brings-ai-comet-browser-to-iphone/
[^21]: Perplexity AI. "Sonar API Pricing." Perplexity Documentation. https://docs.perplexity.ai/docs/getting-started/pricing
[^22]: GPTZero. "What is perplexity and burstiness for AI detection?" https://gptzero.me/news/perplexity-and-burstiness-what-is-it/
[^23]: Merity, S. et al. (2016). "Pointer Sentinel Mixture Models." arXiv:1609.07843. (WikiText benchmark)
[^24]: Fortune. "Will Perplexity kill Google? Probably not, but Aravind Srinivas's startup is shaking up the AI race." https://fortune.com/article/perplexity-ceo-aravind-srinivas-ai/
[^25]: 9to5Mac. "Perplexity just gave its Comet AI browser an upgrade for iPad users with these features." April 28, 2026. https://9to5mac.com/2026/04/28/perplexity-just-gave-its-comet-ai-browser-an-upgrade-for-ipad-users-with-these-features/
[^26]: Axios. "Perplexity says publishers will get 80% cut of new subscription product's revenue." August 26, 2025. https://www.axios.com/2025/08/26/perplexity-comet-plus-subscription
[^27]: TechCrunch. "Perplexity's new Computer is another bet that users need many AI models." February 27, 2026. https://techcrunch.com/2026/02/27/perplexitys-new-computer-is-another-bet-that-users-need-many-ai-models/
[^28]: Perplexity AI Changelog. "Improved Computer Models and Enterprise Updates - May 4, 2026." https://www.perplexity.ai/changelog/improved-computer-models-and-enterprise-updates---may-4-2026
[^29]: MacRumors. "Perplexity Launches Personal Computer for Mac, Turning a Mac mini Into an Always-On AI Agent." April 16, 2026. https://www.macrumors.com/2026/04/16/perplexity-personal-computer-for-mac/
[^30]: 9to5Mac. "Perplexity AI app introduces 'all-new native Mac experience' for Personal Computer." May 7, 2026. https://9to5mac.com/2026/05/07/perplexity-ai-app-introduces-all-new-native-mac-experience-for-personal-computer/
[^31]: Perplexity AI. "Introducing Model Council." https://www.perplexity.ai/hub/blog/introducing-model-council
[^32]: Perplexity AI. "Introducing Perplexity Deep Research." https://www.perplexity.ai/hub/blog/introducing-perplexity-deep-research
[^33]: Perplexity AI Changelog (May 2026). Virtual Try-On feature announcement.
[^34]: Perplexity AI Changelog (May 2026). Finance live price embedding in answers.
[^35]: Data Center Dynamics. "Perplexity signs $750m cloud agreement with Microsoft." January 2026. https://www.datacenterdynamics.com/en/news/perplexity-signs-750m-cloud-agreement-with-microsoft/
[^36]: Perplexity AI. "Perplexity APIs deliver powerful AI to the world's largest Android device maker." https://www.perplexity.ai/hub/blog/perplexity-apis-deliver-powerful-ai-to-the-world%E2%80%99s-largest-android-device-maker
[^37]: TechCrunch. "Snap says its $400M deal with Perplexity 'amicably ended'." May 6, 2026. https://techcrunch.com/2026/05/06/snap-says-its-400m-deal-with-perplexity-amicably-ended/
[^38]: National Today. "Perplexity AI Seeks to Dismiss Copyright Lawsuits from NY Times, Chicago Tribune." March 2, 2026. https://nationaltoday.com/us/il/chicago/news/2026/03/02/perplexity-ai-seeks-to-dismiss-copyright-lawsuits-from-ny-times-chicago-tribune/
[^39]: CNBC. "Amazon wins court order to block Perplexity's AI shopping agent." March 10, 2026. https://www.cnbc.com/2026/03/10/amazon-wins-court-order-to-block-perplexitys-ai-shopping-agent.html
[^40]: Fox Business. "Perplexity AI bids on TikTok, CEO says it's a win-win for Trump." January 2025. https://www.foxbusiness.com/technology/perplexity-ai-bids-tiktok-ceo-says-its-win-win-trump
[^41]: Fridman, L. (2024). "Aravind Srinivas: Perplexity CEO on Future of AI, Search & the Internet." Lex Fridman Podcast #434 (transcript). https://lexfridman.com/aravind-srinivas-transcript/
[^42]: Yahoo Finance / Financial Times. "Perplexity ARR tops $450M after pricing shift, FT reports." April 8, 2026. https://ca.finance.yahoo.com/news/perplexity-arr-tops-450m-pricing-132500539.html