Your Search FoundationSupercharged.

Models

Contact

Our Customers

Newsroom

Featured

Tech blog

A Practical Guide to Implementing DeepSearch/DeepResearch

February 25, 2025 • 19 minutes read

Tech blog

Snippet Selection and URL Ranking in DeepSearch/DeepResearch

March 12, 2025 • 11 minutes read

Tech blog

Long-Context Embedding Models are Blind Beyond 4K Tokens

March 07, 2025 • 14 minutes read

RSS

For Better Search

Our frontier models form the search foundation for high-quality enterprise search and RAG systems.

Start instantly—no credit card or registration needed!

We are SOC 2 Type 1 & 2 compliant with the American Institute of Certified Public Accountants (AICPA).

User.jina.ai to read a URL and fetch its content

Uses.jina.ai to search the web and get SERP

Parameters

The target URL to fetch content from

Add API Key for Higher Rate Limit

Enter your Jina API key to access a higher rate limit. For latest rate limit information, please refer to the table below.

Learn more

Browser Engine (Quality/Speed)

Choose the browser engine for fetching the webpage content. This affects the quality, speed, completeness, accessibility of the content.

Default

Content Format

You can control the level of detail in the response to prevent over-filtering. The default pipeline is optimized for most websites and LLM input.

Default

JSON Response

The response will be in JSON format, containing the URL, title, content, and timestamp (if available). In Search mode, it returns a list of five entries, each following the described JSON structure.

Timeout

Maximum page load wait time, use this if you find default browser engine is too slow on simple webpage.

Token Budget

Limits the maximum number of tokens used for this request. Exceeding this limit will cause the request to fail.

Use ReaderLM-v2

Experimental

Uses ReaderLM-v2 for HTML to Markdown conversion, to deliver high-quality results for websites with complex structures and contents. Costs 3x tokens!

Learn more

CSS Selector: Only

List of CSS selectors to target specific page elements.

body

.class

#id

CSS Selector: Wait-For

CSS selectors to wait for before returning results.

body

.class

#id

CSS Selector: Excluding

CSS selectors for elements to remove (headers, footers, etc.).

header

.class

#id

Remove All Images

Remove all images from the response.

Gather All Links At the End

A "Buttons & Links" section will be created at the end. This helps the downstream LLMs or web agents navigating the page or take further actions.

None

Gather All Images At the End

An "Images" section will be created at the end. This gives the downstream LLMs an overview of all visuals on the page, which may improve reasoning.

None

Viewport Config

POST

Sets browser viewport dimensions for responsive rendering.

Learn more

Forward Cookie

Our API server can forward your custom cookie settings when accessing the URL, which is useful for pages requiring extra authentication. Note that requests with cookies will not be cached.

Learn more

<cookie-name>=<cookie-value>

<cookie-name-1>=<cookie-value>; domain=<cookie-1-domain>

Image Caption

Captions all images at the specified URL, adding 'Image [idx]: [caption]' as an alt tag for those without one. This allows downstream LLMs to interact with the images in activities such as reasoning and summarizing.

Use a Proxy Server

Our API server can utilize your proxy to access URLs, which is helpful for pages accessible only through specific proxies.

Learn more

Use a Country-Specific Proxy Server

Set country code for location-based proxy server. Use 'auto' for optimal selection or 'none' to disable.

Bypass Cached Content

Our API caches URL contents for a certain amount of time. Set it to true to ignore the cached result and fetch the content from the URL directly.

Do Not Cache & Track!

When enabled, the requested URL won't be cached and tracked on our server.

Github Flavored Markdown

Opt in/out features from GFM (Github Flavored Markdown).

Enabled

Stream Mode

Stream mode is beneficial for large target pages, allowing more time for the page to fully render. If standard mode results in incomplete content, consider using Stream mode.

Learn more

Customize Browser Locale

Control the browser locale to render the page. Lots of websites serve different content based on the locale.

Learn more

Strictly comply robots policy

Define bot User-Agent to check against robots.txt before fetching content.

iframe Extraction

Processes content from all embedded iframes in the DOM tree.

Shadow DOM Extraction

Extracts content from all Shadow DOM roots in the document.

Follow Redirect

Choose whether to resolve to the final destination URL after following all redirects. Enable to follow the full redirect chain.

Local PDF/HTML file

POST

Use Reader on your local PDF and HTML file by uploading them. Only support pdf and html files. For HTML, please also specify a reference URL for better parsing related CSS/JS scripts.

Pre-run JavaScript

POST

Executes preprocessing JS code (inline string or remote URL).

Learn more

Heading Style

Sets markdown heading format (passed to Turndown).

Number Sign Headings

Horizontal Rule Style

Defines markdown horizontal rule format (passed to Turndown).

Bullet Point Style

Sets bullet list marker character (passed to Turndown).

Emphasis Style

Defines markdown emphasis delimiter (passed to Turndown).

Strong Emphasis Style

Sets markdown strong emphasis delimiter (passed to Turndown).

Link Style

Determines markdown link format (passed to Turndown).

Inline

EU Compliance

All infrastructure and data processing operations reside entirely within EU jurisdiction.

Request

GET

Bash

Language

curl https://r.jina.ai/https://example.com

API key

Available tokens

This is your unique key. Store it securely!

Our Publications

Understand how our frontier search models were trained from scratch, check out our latest publications. Meet our team at EMNLP, SIGIR, ICLR, NeurIPS, and ICML!