Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Perplexity Scraper Track brand mentions, analyze rankings, and gain competitor intelligence from Perplexity. Get started in minutes.

NotificationsYou must be signed in to change notification settings

oxylabs/perplexity-scraper

Repository files navigation

Oxylabs promo code

YouTube

ThePerplexity Scraper by Oxylabs allows developers to send prompts to Perplexity and automatically collect both AI-generated responses and structured metadata. Instead of just raw HTML, it can also provide results as parsed JSON, website PNG, XHR/Fetch requests, or Markdown output.

You can use theOxylabs’ Web Scraper API with Perplexity for AI content auditing, research tracking, and analyzing SEO performance. It handles dynamic AI-generated content, fully supports real-time SERP extraction, and integrates seamlessly with Oxylabs' global proxy infrastructure, without the need to manage proxies, browsers, or worry about anti-bot systems.

How it works

The Perplexity scraper handles the rendering, parsing, and delivery of results in any requested format. You need to provide your prompt, credentials, and a few optional parameters, as shown below.

Request sample (Python)

importjsonimportrequests# API parameters.payload= {'source':'perplexity','prompt':'top 3 smartphones in 2025, compare pricing across US marketplaces','geo_location':'United States','parse':True}# Get a response.response=requests.post('https://realtime.oxylabs.io/v1/queries',auth=('USERNAME','PASSWORD'),json=payload)# Print response to stdout.print(response.json())# Save response to a JSON file.withopen('response.json','w')asfile:json.dump(response.json(),file,indent=2)

More request examples in different programming languages are availablehere.

Note: By default, all requests to Perplexity use JavaScript rendering. Make sure to set a sufficient timeout (e.g. 180s) when using the Realtime integration method.

Request parameters

ParameterDescriptionDefault value
source*Sets the Perplexity scraperperplexity
prompt*The prompt or question to submit to Perplexity.
parseReturns parsed data when set to true.true
geo_locationSpecify a country to send the prompt from.More info.
callback_urlURL to your callback endpoint.More info.

* Mandatory parameters


Output samples

Web Scraper API returns either an HTML document or a JSON object of Perplexity scraper output, which contains structured data from the results page.

HTML example:

HTML Example

Structured JSON output snippet:

{"results": [        {"content": {"url":"https://www.perplexity.ai/search/top-3-smartphones-in-2025-comp-wvA0dso7TgW3NpgF8Jd8tg","model":"turbo","top_images": ["url + title"],"top_sources": ["url + title + source"],"prompt_query":"top 3 smartphones in 2025, compare pricing across US marketplaces","answer_results": ["answer in JSON"],"displayed_tabs": ["search","images","sources"                ],"related_queries": ["How do the prices of the top 3 smartphones compare across US marketplaces","What features make the Galaxy S25 Ultra stand out as the best in 2025","Why is the Pixel 9a considered a top budget option despite its lower price","How does the iPhone 16 Pro Max's pricing differ from Samsung and Google models","What factors should I consider when choosing among these top smartphones in 2025"                ],"answer_results_md": ["answer in Markdown"],"parse_status_code":12000            },"created_at":"2025-07-16 12:14:32","updated_at":"2025-07-16 12:15:28","page":1,"url":"https://www.perplexity.ai/search/top-3-smartphones-in-2025-comp-wvA0dso7TgW3NpgF8Jd8tg","job_id":"7351222707934990337","is_render_forced":false,"status_code":200,"parser_type":"perplexity","parser_preset":null        }    ]}

You can find the full output example filehere in this repository.

Alternatively, you can extract the data in the Markdown format for easier data integration workflows involving AI tools.

JSON output structure

Structured Perplexity scraper output includes fields such asurl,model,answer_results, and more. The table below breaks down the page elements we parse, along with descriptions, data types, and relevant metadata.

Note: The number of items and fields for a specific result type may vary depending on the submitted prompt.

FieldDescriptionType
urlThe URL of Perplexity's conversation.string
pagePage number.integer
contentAn object containing parsed Perplexity page data.object
modelPerplexity model used to generate the answer.string
prompt_queryThe original prompt submitted to Perplexity.string
displayed_tabsTabs displayed in Perplexity's interface (e.g., shopping, images).list
answer_resultsThe complete Perplexity response containing text or nested content.list/string
answer_results_mdThe entire answer rendered in Markdown format.string
related_queriesA list of queries related to the main prompt.list
top_imagesA list of top images with their titles and URLs.array
top_sourcesA list of top cited sources with their titles, sources, and URLs.array
inline_productsA list of inline products with titles, prices, links, and other metadata.array
additional_results.hotels_resultsA list of hotels with titles, URLs, addresses, and other hotel details.array
additional_results.places_resultsA list of places with titles, URLs, coordinates, and other metadata.array
additional_results.videos_resultsA list of videos with thumbnails, titles, URLs, and sources.array
additional_results.shopping_resultsA list of shopping items with titles, prices, URLs, and other product metadata.array
additional_results.sources_resultsA list of cited sources with their titles and URLs.array
additional_results.images_resultsA list of related images with titles, image URLs, and source page URLs.array
parse_status_codeStatus code of the parsing operation.integer
created_atThe timestamp when the scraping job was created.timestamp
updated_atThe timestamp when the scraping job was finished.timestamp
job_idThe ID of the job associated with the scraping job.string
geo_locationProxy location from which the prompt was submitted.string
status_codeThe status code of the scraping job.More info.integer
parser_typeThe type of the parser used for breaking down the HTML content.string

Additional results and inline products

Along with the main AI response, the Perplexity scraper can return extra data underadditional_results, such as:

  • images_results
  • sources_results
  • shopping_results
  • videos_results
  • places_results
  • hotels_results

These arrays are extracted from the tabs on the original results page and are included only if relevant content is available:

Perplexity tabs

Moreover, theinline_products array contains products that are directly embedded in the response:

Embedded results

Practical Perplexity scraper use cases

  1. AI content auditing: Compare quality, consistency, and reliability of Perplexity-generated responses.
  2. Research tracking: Monitor how Perplexity summarizes or interprets information across time.
  3. SEO performance comparison: Track your brand mentions and content rankings to optimize your visibility strategies.

Why choose Oxylabs?

  • Superior success rates: Experience the most reliable scraping even on high-profile and dynamic AI-driven sources.
  • Maintenance-free: Our API handles all the infrastructure, from proxy management to IP rotation and anti-bot systems.
  • Dedicated support: Get expert help whenever needed, from integration to debugging.

FAQ

Is scraping Perplexity AI allowed?

Perplexity does not provide a public API for all its features, so scraping falls into a gray area depending on its Terms of Service. We recommend reviewing their policies carefully and ensuring compliance. Oxylabs provides the technical capability, but it’s up to you to use it responsibly and in line with applicable regulations.

Does the scraper return only JSON?

No, the Perplexity scraper can return multiple formats depending on your needs. The scraper can return results as raw HTML, structured JSON, Markdown output, website PNG, or capture XHR/Fetch requests.

What’s the recommended timeout for real-time requests?

Since Perplexity responses are dynamically generated, requests can take longer than standard web scraping. We recommend setting a timeout of at least 180 seconds when using the Realtime integration method to avoid incomplete results. For larger or more complex prompts, consider asynchronous methods like Push-Pull.

Learn more

For a deeper dive into available parameters, advanced integrations, and additional examples, check out thePerplexity Scraper documentation.

Contact us

If you have questions or need support, reach out to us athello@oxylabs.io or through ourlive chat.

About

Perplexity Scraper Track brand mentions, analyze rankings, and gain competitor intelligence from Perplexity. Get started in minutes.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors2

  •  
  •  

[8]ページ先頭

©2009-2025 Movatter.jp