Research & Analysis
Last reviewed
May 9, 2026
Sources
26 citations
Review status
Source-backed
Revision
v3 ยท 3,958 words
Improve this article
Add missing citations, update stale details, or suggest a clearer explanation.
Last reviewed
May 9, 2026
Sources
26 citations
Review status
Source-backed
Revision
v3 ยท 3,958 words
Add missing citations, update stale details, or suggest a clearer explanation.
Research & Analysis describes the application of artificial intelligence, and especially large language models, to the work of finding information, reading sources, summarising documents, exploring datasets, and producing analytic deliverables. This page is a gateway to the tools and concepts in that space. It covers general purpose answer engines, scientific literature assistants, document question answering systems, data analytics copilots, spreadsheet copilots, business intelligence assistants, survey and user research platforms, and vertical products for legal, financial, and patent work.
The shift from traditional keyword search to retrieval-augmented language models began in 2020 and accelerated after the launch of ChatGPT in November 2022. By 2025 the largest professional information vendors, Thomson Reuters, RELX (LexisNexis), Bloomberg, and S&P Global, had each shipped generative interfaces over their proprietary archives, and a wave of startups had built point solutions for systematic reviews, due diligence, financial modelling, and patent search. Most of these systems combine a hosted LLM with retrieval-augmented generation so that answers are grounded in citable source documents rather than the model's parametric memory.
See also: Academic Research ChatGPT Plugins, Search Engine ChatGPT Plugins, and Data Visualization ChatGPT Plugins.
General research assistants are answer engines that combine a web crawl with a generative model. They differ from a classic search engine by writing a synthesised response with inline footnotes, instead of returning a list of links.
Perplexity AI, founded in August 2022 by Aravind Srinivas, Denis Yarats, Johnny Ho, and Andy Konwinski, is the most widely used standalone product in this category. Perplexity's free tier serves a basic web answer, while Perplexity Pro routes through frontier models and adds longer follow-up threads. The company raised about $200 million in September 2025 at a $20 billion post-money valuation, with backers including Nvidia, Databricks, Jeff Bezos, and Tobias Lutke. A separate Sonar API exposes the underlying retrieval layer to developers.
ChatGPT Search is the search surface inside ChatGPT. OpenAI opened it to free users in late 2024 and added it to the default model behaviour in 2025. ChatGPT Search reads pages in real time, cites publishers, and routes long requests to the Deep Research mode that produces multi-page reports. Deep Research is benchmarked separately on systems such as Deep Research Bench and DeepResearch Bench.
Claude from Anthropic added native web search inside Claude.ai in 2025, with citations rendered alongside each paragraph. The Computer Use and tool-use APIs let third parties build their own research front ends on top of the same Claude models. Gemini from Google ships a Deep Research mode that runs longer multi-step browsing sessions and writes a report; the same mode is exposed through the Gemini Advanced subscription and inside Google Workspace.
You.com launched in late 2021 and pivoted to generative answers in 2023 with You Chat. Phind, a Y Combinator company, focuses on developer questions; it ranks Stack Overflow, GitHub README files, and official docs ahead of general web pages and adds a code interpreter that runs Python in the browser. Brave Search AI, run by Brave Software, layers an LLM summary over Brave's independent web index and emphasises private browsing.
| Tool | Vendor | Model strategy | Differentiator |
|---|---|---|---|
| Perplexity | Perplexity AI | Routes across GPT, Claude, and in-house Sonar | Standalone consumer product, Sonar API |
| ChatGPT Search | OpenAI | GPT family with Deep Research | Built into the chat product, large free tier |
| Claude with Search | Anthropic | Claude Opus, Sonnet, Haiku | Long context window, paragraph level citations |
| Gemini Deep Research | Gemini family | Tied to Workspace and Google Search index | |
| You.com | You.com Inc. | Multi-model router | Modes for research, writing, code |
| Phind | Phind Inc. | Custom and frontier models | Developer focus, in-browser code execution |
| Brave Search AI | Brave Software | Mixed | Independent web index, no third party trackers |
General assistants struggle with peer reviewed sources because most journal articles sit behind paywalls and are not in the open web crawl. A separate group of products has been built specifically for scientific literature.
Elicit, incubated at the non-profit research lab Ought and now an independent public benefit corporation, launched in 2023 with co-founders Andreas Stuhlmuller and Jungwon Byun. Elicit pulls publications from Semantic Scholar and extracts study population, intervention, and outcome fields from each paper into a comparison matrix. The 2025 release added a structured systematic review workflow that runs an extraction template across hundreds of papers in parallel.
Consensus, founded in 2021 by Christian Salem and Eric Olson, indexes more than 220 million papers and answers natural language questions with a Consensus Meter that summarises agreement across the underlying literature. Its Scholar Agent, released in 2025, is built on GPT-5 and the OpenAI Responses API. The platform reports more than 8 million users.
Semantic Scholar is the underlying corpus that powers many of the products on this page. It is a free service from the Allen Institute for AI (AI2) and has indexed roughly 220 million papers as of 2026. Its open API and the related Semantic Scholar Open Research Corpus (S2ORC) are widely used to build downstream tools.
Scite.ai introduces the concept of Smart Citations: it labels each citation as supporting, contrasting, or merely mentioning the cited paper, so that a reader can see at a glance whether a finding has been replicated. The Scite Assistant added an LLM front end in 2024 that answers questions and grounds each claim in a list of supporting and contrasting citations.
Connected Papers and ResearchRabbit visualise citation networks rather than answer questions in prose. Connected Papers builds a force-directed graph around a seed paper, useful for understanding the cluster of work most closely related to a single article. ResearchRabbit, acquired by Litmaps in 2025, supports ongoing literature collections that grow as new papers appear; it was re-released in November 2025 on a freemium plan.
| Tool | Vendor | Corpus | Primary use |
|---|---|---|---|
| Elicit | Elicit Inc. (formerly Ought) | Semantic Scholar | Systematic reviews, data extraction |
| Consensus | Consensus | 220M papers | Yes/no questions with Consensus Meter |
| Semantic Scholar | Allen Institute for AI | 220M papers | Free academic search and API |
| Scite.ai | Research Solutions | Cited references | Smart Citations and Scite Assistant |
| Connected Papers | Connected Papers Ltd. | Semantic Scholar | Single-seed citation graphs |
| ResearchRabbit | Litmaps (acquired 2025) | Semantic Scholar, PubMed | Ongoing reading lists, alerts |
A second cluster of tools answers questions over documents that the user provides, rather than the open web. The pattern is sometimes called retrieval over private corpora or RAG over uploads.
NotebookLM is Google's flagship product in this category. It launched as Project Tailwind at Google I/O 2023 and was rebranded to NotebookLM later that year. Each notebook accepts up to a few hundred sources, including PDFs, Google Docs, web URLs, and YouTube transcripts; the model answers only from those sources and includes inline citations to the underlying passages. The 2024 Audio Overview feature generates a two-host podcast summary of the notebook, and Google added interactive audio (the user can break in and ask the hosts a question) in December 2024. NotebookLM Plus, launched in early 2025, raised the document and notebook limits and added shareable team notebooks. Video Overviews, slide decks, and structured Data Tables outputs followed in 2025.
ChatGPT and Claude both support direct file uploads with question answering against the contents. ChatGPT exposes this in the standard chat window and through Custom GPTs, where a knowledge base of up to 20 files can be attached to a tailored assistant. Claude's Projects feature stores up to 200,000 tokens of reference material and reuses it across every conversation in the project.
Humata, launched in 2022, is a focused PDF question answering product used in regulated industries; it advertises GDPR compliant hosting and a team workspace that retains source citations. Scholarcy summarises long PDFs into structured flashcards that highlight study purpose, sample size, methods, and findings; it is widely used in graduate programmes and research libraries.
For work that involves running calculations on a dataset, the dominant tools are LLM-driven Python and SQL environments rather than chat surfaces.
ChatGPT's Code Interpreter, also marketed as Advanced Data Analysis and as the Python tool inside ChatGPT, runs sandboxed Python with NumPy, pandas, scikit-learn, and matplotlib. Users upload a CSV or Excel file and ask questions in plain language; the model writes code, runs it, and shows both the chart and the underlying script. The same pattern appears in Claude's Analysis tool and in Gemini's Python execution environment.
Julius AI is a standalone product that competes directly with Code Interpreter for analytics workflows. Julius accepts CSV, Excel, PDF, and database connections to Snowflake, BigQuery, and Postgres; it advertises support for files up to 32 GB, much larger than the 512 MB cap on ChatGPT uploads. The product writes Python and R, builds visualisations, runs predictive models, and produces formatted memos for stakeholders.
Hex is a notebook environment for data teams that combines SQL, Python, and a no-code report builder in a single canvas. Its Notebook Agent, released in 2025, is built on Claude Sonnet 4 and chains together SQL queries, transformations, and visualisation cells to answer a single business question. Hex publishes case studies with Anthropic, Ramp, Figma, and others.
Polymer is a self-serve business intelligence tool that turns a Google Sheet, CSV, or warehouse table into an interactive dashboard with no setup. It bundles its own AI Assistant that suggests visualisations and writes commentary on a chart's findings.
Spreadsheets remain the working surface for most analysts, and the major office suites have integrated AI directly into the cell grid.
Microsoft Copilot for Excel, part of Microsoft 365 Copilot, generates formulas, formats tables, runs what-if analyses, and writes summaries from a worksheet. It can produce charts from a natural language prompt and surface anomalies in long tables. The Excel implementation runs alongside a Python in Excel feature that lets users execute Python code on data inside a workbook.
Gemini in Google Sheets adds a side panel that drafts formulas, generates charts, and answers questions about the active sheet. It is included with the Gemini Business and Enterprise tiers of Google Workspace. The Sheets implementation reads the cell grid as structured context, so the model can refer to specific named ranges in its replies.
Equals is a query-driven spreadsheet built around natural language SQL: every cell can pull data live from a connected database without writing a query by hand. Rows is a similar product oriented to business operations teams, with prebuilt connectors for Stripe, HubSpot, Google Analytics, and Shopify. Both products advertise an LLM that converts a plain English question into the structured query under the hood.
| Tool | Vendor | Surface | Notable feature |
|---|---|---|---|
| Excel Copilot | Microsoft | Microsoft 365 | Formulas, charts, anomaly detection, Python in Excel |
| Gemini in Sheets | Google Workspace | Side panel, named range awareness | |
| Equals | Equals Inc. | Standalone web sheet | Live database queries from cells |
| Rows | Rows GmbH | Standalone web sheet | Prebuilt SaaS connectors |
The established BI vendors have shipped generative layers that sit on top of an existing semantic model.
Tableau Pulse from Salesforce announced in May 2023 and reached general availability in February 2024. Pulse watches a small set of business metrics defined by an analyst, then writes plain English summaries of how each metric is moving and which dimensions are driving the change. Pulse for Salesforce, released in March 2025, embeds the same insights inside Sales Cloud. Pulse is included with all Tableau Cloud editions.
Power BI Copilot, part of Microsoft Fabric, was generally available across Power BI from 2024. Inside a Power BI report it can author a new page from a prompt, write Data Analysis Expressions (DAX) for measures, generate narrative summaries, and answer questions across the full report rather than a single page. A standalone Copilot experience added in 2025 lets users ask questions across every report and semantic model they have access to in their tenant; a Fabric data agent can be invoked from the same surface for questions over an entire warehouse.
Looker (from Google Cloud), Qlik, ThoughtSpot, and Domo have all added comparable Copilot features, generally on the same RAG over a semantic model pattern. The point is that the LLM is constrained to the dimensions and measures the analyst has defined, which keeps the answers grounded and prevents the model from inventing fields.
Product teams use AI to read and cluster qualitative interviews and survey responses.
Dovetail is a research repository that automatically tags transcripts, clusters similar quotes into themes, and generates an executive summary at the project level. It has integrations with Zoom, Microsoft Teams, and Otter for ingesting recorded sessions.
Maze is a usability testing platform that uses AI to draft moderator scripts, score open ended responses, and surface usability friction across a study. The 2025 release added an AI Interviewer that conducts and transcribes unmoderated voice interviews at scale. Both tools are positioned for product, design, and UX teams rather than academic researchers.
Legal information is one of the most heavily invested vertical applications, because the work is high value, the source material is well structured, and law firms have a long history of paying for premium databases.
Harvey was incubated by the OpenAI Startup Fund in late 2022 and is the highest profile pure-play legal AI startup. By March 2026 the company had raised more than $1 billion across nine rounds and was valued at $11 billion in a round co-led by GIC and Sequoia Capital. Harvey is built for law firm workflows: contract review, deposition prep, due diligence, and litigation drafting. Its agents pull from firm document management systems and from public legal corpora.
Casetext launched CoCounsel in March 2023 as one of the first GPT-4 based legal assistants. Thomson Reuters acquired Casetext in August 2023 for $650 million in cash. CoCounsel 2.0, released in 2024, runs across multiple frontier models and feeds Westlaw content to the LLMs. Westlaw Precision, the flagship Thomson Reuters research product, uses RAG over the Westlaw classification system and editorial archive (more than 150 years of head notes and key numbers). Thomson Reuters reported more than 1.5 million AI Assisted Research searches across about 6,000 customers in the first year. In August 2025 Thomson Reuters launched CoCounsel Legal with Deep Research, an agentic capability that produces multi-section memos with arguments on both sides of an issue.
Lexis+ AI, launched in 2023 by RELX subsidiary LexisNexis, is the equivalent product on the LexisNexis platform. The next generation, Lexis+ with Protege, was announced in May 2026 and adds workflow automation, agentic task completion, and routing across LLMs from Anthropic, Google, and OpenAI. The platform is built on LexisNexis editorial content and Shepard's citator intelligence; customer inputs are not used to train any underlying LLM.
A second wave of niche legal AI tools has emerged for specific tasks: Eve and EvenUp for plaintiffs' personal injury work, Spellbook for transactional drafting, and LegalGPT-style assistants on top of base ChatGPT (see LegalGPT and Legal ChatGPT Plugins).
| Tool | Vendor | Status | Use case |
|---|---|---|---|
| Harvey | Harvey Inc. | $11B valuation, March 2026 | Big law workflows, due diligence |
| Casetext CoCounsel | Thomson Reuters | Acquired August 2023 | Document review, depositions |
| Westlaw Precision AI | Thomson Reuters | RAG over Westlaw archive | Statutory and case law research |
| Lexis+ with Protege | RELX (LexisNexis) | Launched May 2026 | Drafting, research, agentic workflows |
Financial research has its own set of vertical AI products focused on filings, transcripts, and structured market data.
Bloomberg has integrated AI throughout its terminal. The company published BloombergGPT in March 2023, a 50 billion parameter model trained on 363 billion tokens of Bloomberg's proprietary financial corpus and 345 billion tokens of general purpose data. The model powers natural language search, sentiment analysis on news, and conversion of plain English questions into Bloomberg Query Language (BQL). The terminal's later AI Assistant features (transcript summarisation, document Q&A, earnings comparison) sit on top of the same internal stack.
Hebbia, founded in 2020 by former Stanford PhD student George Sivulka, raised $130 million in a Series B led by Andreessen Horowitz in July 2024 at a $700 million valuation. Hebbia's Matrix product is a large-scale document RAG system used by hedge funds and private equity firms for due diligence. The company reports that 30 percent of asset managers use Hebbia and that the platform processed more than one billion pages in the year ending September 2025, partly through a partnership with FactSet.
AlphaSense, founded in 2011 by Jack Kokko and Raj Neervannan, started as a search engine over financial filings and earnings call transcripts and added a generative summarisation layer in 2023. The company is valued at $4 billion (2024) and lists 88 percent of the S&P 100 as customers. Its index spans more than 500 million documents in 37 languages, and the 2025 acquisition of Tegus expanded its expert call transcript library.
Daloopa, founded in 2019 in New York by Thomas Li, Jeremy Huang, and Daniel Chen, automates financial model building. Its agents extract structured KPIs from filings, presentations, and supplements, then write the values directly into an analyst's Excel model with cell-level audit links back to the source. The company reports coverage of 5,500+ tickers and serves both buy-side and sell-side firms.
See also AI in finance and Finance ChatGPT Plugins for adjacent products.
| Tool | Vendor | Coverage | Differentiator |
|---|---|---|---|
| BloombergGPT and Bloomberg AI Assistant | Bloomberg L.P. | Bloomberg terminal data | 50B parameter domain model, BQL generation |
| Hebbia | Hebbia Inc. | Buy-side documents | Document agents at billion-page scale |
| AlphaSense | AlphaSense Inc. | 500M+ filings and transcripts | Search plus summary, S&P 100 customer base |
| Daloopa | Daloopa Inc. | 5,500+ tickers | Auto-builds Excel models with cell-level citations |
Patent search is a specialised research domain because the corpus is large (over 200 million documents from more than 180 patent offices), the language is technical and domain-specific, and the consequences of a missed prior art reference can run to many millions of dollars.
PatSnap is the dominant AI-driven patent platform, with more than 15,000 customers worldwide. It indexes 202 million+ patents along with non-patent literature and chemical structure data, and translates documents from major filing languages into English. PatSnap shipped a domain-specific LLM in 2024 trained on its proprietary innovation data; its Novelty Search AI Agent uses RAG over claims and prior art to reduce model hallucination. The platform also integrates with FreedomToOperate and competitive landscape workflows.
Other vendors in the space include LexisNexis IP (PatentSight, TotalPatent One), Clarivate (Derwent Innovation), and Questel (Orbit Intelligence), each of which has shipped a generative answer feature on top of its existing patent corpus.
Research and analysis is one of the highest stakes domains for LLM deployment, because errors in a citation or a financial figure can survive into a published memo or filing. Several recurring failure modes are worth flagging.
Fabricated citations, sometimes called hallucinations, are the most public risk. In the 2023 Avianca litigation in the Southern District of New York, attorneys submitted a brief that cited six cases, none of which existed. The cases had been generated by ChatGPT and the lawyer had not checked them against a primary source. The judge sanctioned the firm; the case is now a standard reference for why every legal AI vendor stresses retrieval grounding and citation links back to a verifiable document. Even high quality systems still occasionally invent a section number, swap two case names, or paraphrase a holding into something that the underlying source does not actually say. Cross-checking each citation against the primary source remains a basic professional duty.
Data confidentiality is the second concern, especially in finance and law. Sending a client document to a public chat surface can violate engagement letters, attorney-client privilege, GDPR, HIPAA, or institutional confidentiality policies. Vendors targeting professional users address this with enterprise tiers that contractually exclude inputs from training, host the model in a single-tenant or BYO-cloud configuration, and pass SOC 2 Type II and ISO 27001 audits. The Lexis+ with Protege launch announcement, for example, explicitly states that customer inputs are not used to train any LLM. Buyers should confirm the relevant data processing addendum and check whether the vendor uses retention windows for prompt logs.
A third concern is reproducibility. Model outputs vary between runs and across model versions, so the same prompt can return different summaries on different days. For systematic reviews and regulated workflows, teams typically pin a model version, log every prompt and response, and treat the LLM as a draft tool rather than a final author. Many of the vertical products listed above ship audit logs and revision histories specifically to address this.
Finally, evaluation is hard. Standard benchmarks like Deep Research Bench, DeepResearch Bench, and LegalBench provide a starting point, but real research work involves judgement calls that benchmarks do not capture. Most professional teams run their own internal accuracy reviews against a known good corpus before rolling a tool out broadly.