#Introducing Parallel FindAll

Parallel's new FindAll API turns natural language queries into custom datasets from the web. It finds entities like companies, people, or locations based on your criteria, then enriches them with structured data—all with citations. FindAll Pro achieves 61% recall, 3x better than competitors.

Tags:Product Release,Benchmarks

Reading time:4 min

Today, we're announcing the newest product in our suite of**Web Agent APIs**: the**FindAll API**.

**FindAll** is the best way to create your own custom database from the web, with just a simple natural language query. It’s available now to try in the ParallelDeveloper Platform[Developer Platform]($https://platform.parallel.ai/play/monitor).

##Turn the web into your own structured dataset

**FindAll** finds any set of entities (companies, people, events, locations, houses, etc.) based on a set of match criteria. For example, with**FindAll,**you can run a natural language query like “Find all dental practices located in Ohio that have 4+ star Google reviews.”

Find all dental practicies in ohio with a 4+ star rating on google — An example of a FindAll query

This is a powerful way to discover the complete long tail of interesting entities from the web and filter them down with match criteria that are personalized to your unique use case. The result is an extensible tool that can produce high-quality datasets on demand, as opposed to buying static, stale, and generic datasets.

##How FindAll works

FindAll executes a three-stage pipeline optimized for both coverage and efficiency:

**1. Generate candidates from web data: FindAll** searches across our proprietary web index to identify potential entities matching your query. Unlike traditional search, which returns a fixed result set,**FindAll** generates candidates dynamically based on your specific criteria.

**2. Evaluate against match conditions:**Each candidate is evaluated against your match conditions using multi-hop reasoning across web sources. Only candidates which satisfy all conditions reach matched status and are included in the results. This staged approach means you only pay to process entities that actually matter.

**3. Extract Structured Enrichments:**For matched entities,**FindAll** automatically orchestrates our**Task API**[**Task API**]($https://docs.parallel.ai/task-api/task-quickstart) to extract any additional fields you've specified— from basic attributes like revenue and employee count to complex data points like the strategic initiatives a company is prioritizing.

Illustration demonstrating deep research API concepts, web search capabilities, or AI agent integration features

Every data point returned includes comprehensive verification through our**Basis framework**[**Basis framework**]($https://docs.parallel.ai/task-api/guides/access-research-basis)— citations linking to source materials, detailed reasoning for match decisions, relevant excerpts from web pages, and calibrated confidence scores. This granular attribution enables human-in-the-loop workflows for verifiability and provenance.

##State-of-the-art performance

To test the performance of**FindAll**, we created our own benchmark of 40 complex multi-criteria queries covering public companies, startups, SMBs, specialized entities, and people (e.g., executives, researchers, and professionals). Recall measures the proportion of all correct matches within the entire competitive set of successfully identified entities.

Some sample questions:

-"Find all former McKinsey & Company consultants who are currently employed in C-level or VP positions at healthcare technology startups with Series A or later funding" — combines employment history, current role level, industry focus, and funding stage.
-"Find all wedding venues in Florida with capacity between 150-300 guests that offer both indoor and outdoor ceremony options, provide in-house catering, and have availability in 2025" — combines location, capacity range, facility features, service offerings, and temporal availability.
-"Find all climate technology startups that have active pilot programs with Fortune 500 companies, raised pre-Series A funding, and focus on carbon capture or renewable energy storage" — combines industry focus, corporate partnerships, funding stage, and specific technology areas.

****

**FindAll Pro** achieves state-of-the-art results with 61% recall, ~3X higher than OpenAI Deep Research, Anthropic Deep Research, and Exa. Higher recall means that Parallel**FindAll**finds more correct matches for a given query.** FindAll****Base** also achieves 30% recall while being the lowest cost on the market, making it the most cost-effective yet performant option.

WISER-FindAll

COST (CPM)

RECALL (%)

Loading chart...

CPM: USD per 1000 requests. Cost is shown on a Log scale.

Parallel

Others

BrowseComp benchmark analysis: CPM: USD per 1000 requests. Cost is shown on a Log scale. . Evaluation shows Parallel's enterprise deep research API for AI agents achieving up to 48% accuracy, outperforming GPT-4 browsing (1%), Claude search (6%), Exa (14%), and Perplexity (8%). Enterprise-grade structured deep research performance across Cost (CPM) and Recall (%). State-of-the-art enterprise deep research API with structured data extraction built for ChatGPT deep research and complex multi-hop AI agent workflows.

###Benchmark

This benchmark, created by Parallel, contains 40 complex multi-criteria queries covering public companies, startups, SMBs, specialized entities, and people (e.g., executives, researchers, professionals).

###Methodology

To measure recall we take the number of correct matches / total entities in the ground truth dataset. The ground truth dataset is created by taking the union of all correct matches across the competitor set. Cost is calculated as the average cost to find 1000 correct matches.

###Testing dates

Nov 13th-17th, 2025

###Parallel-FindAll

| Series   | Model                   | Cost (CPM) | Recall (%) || -------- | ----------------------- | ---------- | ---------- || Parallel | FindAll Base            | 60         | 30.3       || Parallel | FindAll Core            | 230        | 52.5       || Parallel | FindAll Pro             | 1430       | 61.3       || Others   | OpenAI Deep Research    | 250        | 21         || Others   | Anthropic Deep Research | 1000       | 15.3       || Others   | Exa                     | 110        | 19.2       |

CPM: USD per 1000 requests. Cost is shown on a Log scale.

###Benchmark

###Methodology

###Testing dates

Nov 13th-17th, 2025

FindAll can be used to find a broad set of entities across a range of criteria. There are many powerful and diverse use cases we’ve seen:

-**Finding sales leads that match your ICP**: “Find all F500 companies with a senior AI leader that joined the company in the last 6 months”
-**Finding acquisition targets as a hedge fund**: "Find all residential roofing companies in Charlotte, NC with 10-50 employees"
-**Finding public companies to invest in**: "Find all S&P 500 companies that cited tariffs as a key risk in their latest 10-K"
-**Finding competitors to keep track of**: "Find all productivity tools targeting remote teams that launched in the last year"
-**Creating market maps**: "Find all AI infrastructure providers that raised Series B in the last 6 months"
-**Finding potential suppliers and factories**: "Find all semiconductor equipment manufacturers with facilities in Southeast Asia."
-**Researching regulatory environments:**“Find all environmental lawsuits in the United States where a court ruling was reached in 2025”

##Get started creating entire datasets from the web

The**FindAll** API is available today. Get started with ourDeveloper Platform[Developer Platform]($https://platform.parallel.ai/) or dive into thedocumentation[documentation]($https://docs.parallel.ai/findall-api/findall-quickstart).

###Create a FindAll run

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
import requestsurl ="https://api.parallel.ai/v1beta/findall/runs"payload = {"objective":"<string>","entity_type":"<string>","match_conditions": [        {"name":"<string>","description":"Company must have SOC2 Type II certification (not Type I). Look for evidence in: trust centers, security/compliance pages, audit reports, or press releases specifically mentioning 'SOC2 Type II'. If no explicit SOC2 Type II mention is found, consider requirement not satisfied."        }    ],"generator":"base","match_limit":123}headers = {"x-api-key":"<api-key>","Content-Type":"application/json"}response = requests.post(url, json=payload, headers=headers)print(response.json())```import requests
 
url = "https://api.parallel.ai/v1beta/findall/runs"
 
payload = {
    "objective": "<string>",
    "entity_type": "<string>",
    "match_conditions": [
        {
            "name": "<string>",
            "description": "Company must have SOC2 Type II certification (not Type I). Look for evidence in: trust centers, security/compliance pages, audit reports, or press releases specifically mentioning 'SOC2 Type II'. If no explicit SOC2 Type II mention is found, consider requirement not satisfied."
        }
    ],
    "generator": "base",
    "match_limit": 123
}
headers = {
    "x-api-key": "<api-key>",
    "Content-Type": "application/json"
}
 
response = requests.post(url, json=payload, headers=headers)
 
print(response.json())
```

##About Parallel Web Systems

Parallel develops critical web search infrastructure for AI. Our suite of web search and agent APIs is built on a rapidly growing proprietary index of the global internet. These solutions transform human tasks that previously took days and weeks into agentic tasks that now take seconds and minutes.

Fortune 100 and 500 companies use Parallel’s web intelligence APIs in insurance, finance, and retail, as well as AI-first businesses like Clay, Starbridge, and Sourcegraph.

By Parallel

November 18, 2025

##Related Posts39

-[How Amp’s coding agents build better software with Parallel Search](https://parallel.ai/blog/case-study-amp)

Tags:Case Study

Reading time:3 min

Movatterモバイル変換

#Introducing Parallel FindAll

##Turn the web into your own structured dataset

##How FindAll works

##State-of-the-art performance

###**Benchmark**

###**Methodology**

###**Testing dates**

###Parallel-FindAll

###**Benchmark**

###**Methodology**

###**Testing dates**

FindAll can be used to find a broad set of entities across a range of criteria. There are many powerful and diverse use cases we’ve seen:

##Get started creating entire datasets from the web

##About Parallel Web Systems

##Related Posts39

-[How Amp’s coding agents build better software with Parallel Search](https://parallel.ai/blog/case-study-amp)

-[Latency improvements on the Parallel Task API](https://parallel.ai/blog/task-api-latency)

-[Introducing Parallel Extract](https://parallel.ai/blog/introducing-parallel-extract)

-[Introducing Parallel Monitor](https://parallel.ai/blog/monitor-api)

-[Parallel raises $100M Series A to build web infrastructure for agents](https://parallel.ai/blog/series-a)

-[How Macroscope reduced code review false positives with Parallel](https://parallel.ai/blog/case-study-macroscope)

-[Introducing Parallel Search: the highest accuracy web search API engineered for AI](https://parallel.ai/blog/introducing-parallel-search)

-[Parallel processors set new price-performance standard on SealQA benchmark](https://parallel.ai/blog/benchmarks-task-api-sealqa)

-[Introducing LLMTEXT, an open source toolkit for the llms.txt standard](https://parallel.ai/blog/LLMTEXT-for-llmstxt)

-[How Starbridge powers public sector GTM with state-of-the-art web research](https://parallel.ai/blog/case-study-starbridge)

-[Building a market research platform with Parallel Deep Research](https://parallel.ai/blog/cookbook-market-research-platform-with-parallel)

-[How Lindy brings state-of-the-art web research to automation flows](https://parallel.ai/blog/case-study-lindy)

-[Introducing the Parallel Task MCP Server](https://parallel.ai/blog/parallel-task-mcp-server)

-[Introducing the Core2x Processor for improved compute control on the Task API](https://parallel.ai/blog/core2x-processor)

-[How Day AI merges private and public data for business intelligence](https://parallel.ai/blog/case-study-day-ai)

-[Full Basis framework for all Task API Processors](https://parallel.ai/blog/full-basis-framework-for-task-api)

-[Building a real-time streaming task manager with Parallel](https://parallel.ai/blog/cookbook-sse-task-manager-with-parallel)

-[How Gumloop built a new AI automation framework with web intelligence as a core node](https://parallel.ai/blog/case-study-gumloop)

-[Introducing the TypeScript SDK](https://parallel.ai/blog/typescript-sdk)

-[Building a serverless competitive intelligence platform with MCP + Task API](https://parallel.ai/blog/cookbook-competitor-research-with-reddit-mcp)

-[Introducing Parallel Deep Research reports](https://parallel.ai/blog/deep-research-reports)

-[A new pareto-frontier for Deep Research price-performance](https://parallel.ai/blog/deep-research-benchmarks)

-[Building a Full-Stack Search Agent with Parallel and Cerebras](https://parallel.ai/blog/cookbook-search-agent)

-[Webhooks for the Parallel Task API](https://parallel.ai/blog/webhooks)

-[Introducing Parallel: Web Search Infrastructure for AIs](https://parallel.ai/blog/introducing-parallel)

-[Introducing SSE for Task Runs](https://parallel.ai/blog/sse-for-tasks)

-[A new line of advanced processors: Ultra2x, Ultra4x, and Ultra8x](https://parallel.ai/blog/new-advanced-processors)

-[Introducing Auto Mode for the Parallel Task API](https://parallel.ai/blog/task-api-auto-mode)

-[A state-of-the-art search API purpose-built for agents](https://parallel.ai/blog/search-api-benchmark)

-[Parallel Search MCP Server in Devin](https://parallel.ai/blog/parallel-search-mcp-in-devin)

-[Introducing Tool Calling via MCP Servers](https://parallel.ai/blog/mcp-tool-calling)

-[Introducing the Parallel Search MCP Server](https://parallel.ai/blog/search-mcp-server)

-[Introducing Source Policy](https://parallel.ai/blog/source-policy)

-[The Parallel Task Group API](https://parallel.ai/blog/task-group-api)

-[State of the Art Deep Research APIs](https://parallel.ai/blog/deep-research)

-[Introducing the Parallel Search API](https://parallel.ai/blog/parallel-search-api)

-[Introducing the Parallel Chat API](https://parallel.ai/blog/chat-api)

-[Introducing Basis with Calibrated Confidences](https://parallel.ai/blog/introducing-basis-with-calibrated-confidences)

-[Introducing the Parallel Task API](https://parallel.ai/blog/parallel-task-api)

Info

###Benchmark

###Methodology

###Testing dates

###Benchmark

###Methodology

###Testing dates