A screenshot of the arXiv taken in 1994,[9] using the browserNCSA Mosaic. At the time,HTML forms were a new technology.arXiv's yearly submission rate growth over 30 years since its beginning with topics labeled by the standard abbreviations used on arxiv.org[10]
arXiv was made possible by the compactTeX file format, which allowed scientific papers to be easily transmitted over theInternet and renderedclient-side.[11] Around 1990,Joanne Cohn began emailingphysics preprints to colleagues as TeX files, but the number of papers being sent soon filled mailboxes to capacity.[12]Paul Ginsparg recognized the need for central storage, and in August 1991 he created a centralrepository mailbox stored at theLos Alamos National Laboratory (LANL) that could be accessed from any computer.[13] Additional modes of access were soon added:FTP in 1991,Gopher in 1992, and theWorld Wide Web in 1993.[5][14] The terme-print was quickly adopted to describe the articles.
It began as a physics archive, called theLANL preprint archive, but soon expanded to include astronomy, mathematics, computer science, quantitative biology and, most recently, statistics. Its originaldomain name was xxx.lanl.gov. Due to LANL's lack of interest in the rapidly expanding technology, in 2001 Ginsparg changed institutions toCornell University and changed the name of the repository to arXiv.org.[15]Ginsparg brainstormed the new name with his wife; the domain "archive" was already claimed, so "chi" was replaced with "X" standing in as theGreek letter chi and the "e" dropped for symmetry around the "X".[16]
arXiv was an early adopter and promoter ofpreprints.[17] Its success in sharing preprints was one of the precipitating factors that led to the later movement inscientific publishing known asopen access.[17]Mathematicians and scientists regularly upload their papers to arXiv.org for worldwide access[18] and sometimes for reviews before they are published inpeer-reviewedjournals. Ginsparg was awarded aMacArthur Fellowship in 2002 for his establishment of arXiv.[19] The annual budget for arXiv was approximately $826,000 for 2013 to 2017, funded jointly by Cornell University Library, theSimons Foundation (in both gift andchallenge grant forms) and annual fee income from member institutions.[20] This model arose in 2010, when Cornell sought to broaden the financial funding of the project by asking institutions to make annual voluntary contributions based on the amount of download usage by each institution. Each member institution pledges a five-year funding commitment to support arXiv. Based on institutional usage ranking, the annual fees are set in four tiers from $1,000 to $4,400. Cornell's goal is to raise at least $504,000 per year through membership fees generated by approximately 220 institutions.[21]
In September 2011, Cornell University Library took overall administrative and financial responsibility for arXiv's operation and development. Ginsparg was quoted in theChronicle of Higher Education as joking that it "was supposed to be athree-hour tour, not a life sentence".[22] However, Ginsparg remains on the arXiv's Scientific Advisory Board and its Physics Advisory Committee.[23][24]
In January 2022, arXiv began assigningDOIs to articles, in collaboration withDataCite.[25]
arch-ive/YYMMNNN for older papers, e.g.hep-th/9901001.
Different versions of the same paper are specified by a version number at the end. For example,1709.08980v1. If no version number is specified, the default is the latest version.
arXiv uses a category system. Each paper is tagged with one or more categories. Some categories have two layers. For example,q-fin.TR is the "Trading and Market Microstructure" category within "quantitative finance". Other categories have one layer. For example,hep-ex is "high energy physics experiments".
Although arXiv is notpeer reviewed, a collection of moderators for each area review thesubmissions; they may recategorize any that are deemed off-topic,[26] or reject submissions that are not scientific papers, or sometimes for undisclosed reasons.[27] The lists of moderators for many sections of arXiv are publicly available,[28] but moderators for most of the physics sections remain unlisted.
Additionally, an "endorsement" system was introduced in 2004 as part of an effort to ensure content is relevant and of interest to current research in the specified disciplines.[29] Under the system, for categories that use it, an author must be endorsed by an established arXiv author before being allowed to submit papers to those categories. Endorsers are not asked to review the paper for errors but to check whether the paper is appropriate for the intended subject area.[26] New authors from recognized academic institutions generally receive automatic endorsement, which in practice means that they do not need to deal with the endorsement system at all. However, the endorsement system has attracted criticism for allegedly restricting scientific inquiry.[30][31]
A majority of thee-prints are also submitted tojournals for publication, but some work, including some very influential papers, remain purely as e-prints and are never published in a peer-reviewed journal. A well-known example of the latter is an outline of a proof ofThurston's geometrization conjecture, including thePoincaré conjecture as a particular case, uploaded byGrigori Perelman in November 2002.[32] Perelman appears content to forgo the traditional peer-reviewed journal process, stating: "If anybody is interested in my way of solving the problem, it's all there [on the arXiv] – let them go and read about it".[33] Despite this non-traditional method of publication, other mathematicians recognized this work by offering theFields Medal andClay Mathematics Millennium Prizes to Perelman, both of which he refused.[34]
While arXiv does contain some dubious e-prints, such as those claiming to refute famous theorems or proving famous conjectures such asFermat's Last Theorem using only high-school mathematics, a 2002 article which appeared inNotices of the American Mathematical Society described those as "surprisingly rare".[35] arXiv generally re-classifies these works, e.g. in "General mathematics", rather than deleting them;[36] however, some authors have voiced concern over the lack of transparency in the arXiv screening process.[27]
It has been reported that 14,000 preprints have been withdrawn at arXiv, most commonly due to "crucial errors".[37] A lesser number of the withdrawals were due to the preprint being subsumed by another publication. The report itself was posted at arXiv December, 2024.
Papers can be submitted in any of several formats, includingLaTeX, andPDF printed from aword processor other than TeX or LaTeX. Thesubmission is rejected by the arXiv software if generating the finalPDF file fails, if any image file is too large, or if the total size of the submission is too large. arXiv now allows one to store and modify an incomplete submission, and only finalize the submission when ready. The time stamp on the article is set when the submission is finalized.
A screenshot of viewing a paper's abstract on arxiv.org in 2021
The standard access route is through the arXiv.org website, which is publicly accessible and does not require an account. Other interfaces and access routes have also been created by other un-associated organisations.
Metadata for arXiv is made available throughOAI-PMH, the standard foropen access repositories.[38] Content is therefore indexed in all major consumers of such data, such asBASE,CORE andUnpaywall. As of 2020, the Unpaywall dump links over 500,000 arxiv URLs as theopen access version of a work found inCrossRef data from the publishers, making arXiv a top 10 global host ofgreen open access.
Finally, researchers can select sub-fields and receive daily e-mailings orRSS feeds of all submissions in them.
Some are copyright to the publisher, but the author has the right to distribute them and has given arXiv a non-exclusive irrevocable license to distribute them.
Most are copyright to the author, and arXiv has only a non-exclusive irrevocable license to distribute them.
^Greechie, Richard; Pulmannova, Sylvia; Svozil, Karl (July 2005), "Preface to the Proceedings of Quantum Structures 2002",International Journal of Theoretical Physics,44 (7):691–692,Bibcode:2005IJTP...44..691G,doi:10.1007/s10773-005-7053-z,S2CID121442106,The new endorsement system may contribute to an effective barrier, a digital divide