lukasschwab/arxiv.pyPublic

NotificationsYou must be signed in to change notification settings
Fork147
Star1.4k

Python wrapper for the arXiv API

License

MIT license

1.4k stars 147 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 154 Commits
.github		.github
arxiv		arxiv
tests		tests
.gitignore		.gitignore
.python-version		.python-version
LICENSE.txt		LICENSE.txt
Makefile		Makefile
README.md		README.md
requirements.txt		requirements.txt
ruff.toml		ruff.toml
setup.cfg		setup.cfg
setup.py		setup.py

Repository files navigation

arxiv.py

Python wrapper forthe arXiv API.

arXiv is a project by the Cornell University Library that provides open access to 1,000,000+ articles in Physics, Mathematics, Computer Science, Quantitative Biology, Quantitative Finance, and Statistics.

Usage

Installation

$ pip install arxiv

In your Python script, include the line

importarxiv

Examples

Fetching results

importarxiv# Construct the default API client.client=arxiv.Client()# Search for the 10 most recent articles matching the keyword "quantum."search=arxiv.Search(query="quantum",max_results=10,sort_by=arxiv.SortCriterion.SubmittedDate)results=client.results(search)# `results` is a generator; you can iterate over its elements one by one...forrinclient.results(search):print(r.title)# ...or exhaust it into a list. Careful: this is slow for large results sets.all_results=list(results)print([r.titleforrinall_results])# For advanced query syntax documentation, see the arXiv API User Manual:# https://arxiv.org/help/api/user-manual#query_detailssearch=arxiv.Search(query="au:del_maestro AND ti:checkerboard")first_result=next(client.results(search))print(first_result)# Search for the paper with ID "1605.08386v1"search_by_id=arxiv.Search(id_list=["1605.08386v1"])# Reuse client to fetch the paper, then print its title.first_result=next(client.results(search_by_id))print(first_result.title)

Fetching results with a custom client

importarxivbig_slow_client=arxiv.Client(page_size=1000,delay_seconds=10.0,num_retries=5)# Prints 1000 titles before needing to make another request.forresultinbig_slow_client.results(arxiv.Search(query="quantum")):print(result.title)

Logging

To inspect this package's network behavior and API logic, configure aDEBUG-level logger.

>>>import logging, arxiv>>> logging.basicConfig(level=logging.DEBUG)>>> client= arxiv.Client()>>> paper=next(client.results(arxiv.Search(id_list=["1605.08386v1"])))INFO:arxiv.arxiv:Requesting 100 results at offset 0INFO:arxiv.arxiv:Requesting page (first: False, try: 0): https://export.arxiv.org/api/query?search_query=&id_list=1605.08386v1&sortBy=relevance&sortOrder=descending&start=0&max_results=100DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): export.arxiv.org:443DEBUG:urllib3.connectionpool:https://export.arxiv.org:443 "GET /api/query?search_query=&id_list=1605.08386v1&sortBy=relevance&sortOrder=descending&start=0&max_results=100&user-agent=arxiv.py%2F1.4.8 HTTP/1.1" 200 979

Types

Client

AClient specifies a reusable strategy for fetching results from arXiv's API. For most use cases the default client should suffice.

Clients configurations specify pagination and retry logic.Reusing a client allows successive API calls to use the same connection pool and ensures they abide by the rate limit you set.

Search

ASearch specifies a search of arXiv's database. UseClient.results to get a generator yieldingResults.

Result

TheResult objects yielded byClient.results include metadata about each paper and helper methods for downloading their content.

The meaning of the underlying raw data is documented in thearXiv API User Manual: Details of Atom Results Returned.

Result also exposes helper methods for downloading papers:Result.download_pdf andResult.download_source.

About

Python wrapper for the arXiv API

Releases30

2.3.1 Latest

Nov 13, 2025

+ 29 releases

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

arxiv.py

Usage

Installation

Examples

Fetching results

Fetching results with a custom client

Logging

Types

Client

Search

Result

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases30

Packages

Used by14.3k

Contributors21

Uh oh!

Languages

Movatterモバイル変換

License

lukasschwab/arxiv.py

Folders and files

Latest commit

History

Repository files navigation

arxiv.py

Usage

Installation

Examples

Fetching results

Fetching results with a custom client

Logging

Types

Client

Search

Result

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases30

Packages0

Used by14.3k

Contributors21

Uh oh!

Languages

Packages