Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Anna's Archive

This is a good article. Click here for more information.
From Wikipedia, the free encyclopedia
Shadow library search engine

Anna's Archive
Anna's Archive homepage (January 15, 2025)
Type of site
Founder(s)Anna Archivist, Pirate Library Mirror
URL
CommercialNo
RegistrationOptional
LaunchedNovember 2022; 3 years ago (2022-11)
Part ofa series on
File sharing
Development and societal aspects
By country or region

Anna's Archive is anopen sourcesearch engine forshadow libraries that was launched by thepseudonymous Anna shortly after law enforcement efforts toshut down Z-Library in 2022. The site aggregates records fromZ-Library,Sci-Hub, andLibrary Genesis (LibGen), among other sources. It calls itself "the largest truly open library in human history",[† 1] and has said it aims to "catalog all the books in existence" and "track humanity's progress toward making all these books easily availablein digital form". It claims not to be liable for downloads ofcopyrighted works, since the site indexesmetadata but does not directly host any files, instead linking to third-party downloads. It has nonetheless faced governmentblocks and legal action from copyright holders and publishingtrade associations for engaging in large-scalecopyright infringement.

Origins

[edit]

Anna's Archive emerged out of thePirate Library Mirror (PiLiMi) project, an anonymous effort to mirror shadow libraries that completed a full copy of Z-Library in September 2022. PiLiMi acknowledged that it "deliberately violated the copyright law in most countries".[1][2] The project's initial focus was onpreservation rather than on making its data searchable.[3] Days after US law enforcement seized several Z-Library domains and arrested its alleged operators in November 2022, PiLiMi member Anna (also known as Anna Archivist) launched Anna's Archive, which initially displayed results from Z-Library and LibGen.[1][2][4][5]

Website and operations

[edit]

Anna's Archive has been variously described as a search engine,[4] ametasearch engine,[1] and a shadow library itself.[2] The site does not itself host any files (which it claims makes it nonliable for downloads of copyrighted works), but it links to third-party downloads provided by anonymous partners.[† 1][6][7] It also offers downloads through theIPFS protocol.[a][1][8] Its source code is dedicated to thepublic domain under theCC0 license.[† 3] It operates threemirror sites under differenttop-level domains, currently.li,.se, and.org.[† 1]

The site's "source libraries" include LibGen, Sci-Hub, Z-Library, theInternet Archive (including "Borrowing Unavailable" items), DuXiu, MagzDB, Nexus/STC, andHathiTrust;Open Library,WorldCat, andGoogle Books are listed as metadata-only sources.[† 4] Some of these datasets are already publicly accessible, while others arescraped or otherwise privately acquired for distribution.[† 4][9] They are then released in bulk[b] withtorrent files so as to make them resilient towebsite takedowns.[† 1] As of July 2025,[update] Anna's Archive includes 52,875,045 books and 98,598,895 papers,[† 1][failed verification] and its unified list of torrents totals roughly 1.1 petabytes in size.[† 6]

A 2025 study comparing the coverage of conventionallibrary databases to various alternatives (including scholarly search engines, other web-based databases, academic social networks, and piracy sites) found that Anna's Archive had among the most comprehensive full-text coverage, but criticized it for having an unintuitive interface.[10] In March 2025, it averaged over 650,000 daily downloads, roughly 10 times the estimated distribution of theNew York Public Library.[11]

Finances

[edit]

High-speed downloads on Anna's Archive are only available to users with a paid membership, while nonmembers must use slower options withbrowser verification to prevent abuse by bots. It describes itself as anonprofit, claiming that membership fees and donations are mostly spent on server infrastructure and that none are personally used by the site's operators.[† 1] It awards memberships and monetary "bounties" to some volunteer contributors.[† 7]

Anna's Archive offers high-speed access to its full collection viaSFTP to groups traininglarge language models (LLMs) in exchange for large contributions of money or data.[12] It said it provided such access to about 30 companies as of January 2025, primarily based in China, including both LLM companies anddata brokers.[13][14]DeepSeek's VL model was trained on data from the site.[15] Some lawyers have criticized claims that this constitutesfair use under US copyright law, citing precedent for the importance of market harm.[14]

Motivation

[edit]

Anna's Archive is a non-profit project with two goals:

1.Preservation: Backing up all knowledge and culture of humanity.

2.Access: Making this knowledge and culture available to anyone in the world.

Anna's Archive, FAQ[† 1]

Anna's Archive has said its objectives are to "catalog all the books in existence" and "track humanity's progress toward making all these books easily available in digital form".[4] It has been described as both continuing and greatly extending the ambitions of earlier shadow libraries with its vision of a "universal library" that preserves as many books as possible. It has been interpreted as part of an ascendant "culture of mistrust towards corporations, institutions, governments, and laws... that perhaps began with thefinancial collapse of 2008 and theOccupy Wall Street movements" which saw the rise of decentralizing technologies.[11]

Anna has justified their opposition to copyright on ethical grounds, stating that they "believe that preserving and hosting these files is morally right"[11] and that they and other shadow librarians believe that "information wants to be free".[16] They have suggested thatcopyright law must be reformed as a matter ofnational security, proposing that Western countries make legal carveouts fortext and data mining so as to remain ahead in theAI arms race.[13]

Anna cites programmer and information activistAaron Swartz as inspiring the project's collection of metadata.[† 1] The site recommends Swartz's writings as well as Stephen Witt'sHow Music Got Free andMichele Boldrin andDavid K. Levine'sAgainst Intellectual Monopoly, which criticize existing copyright law and have been associated with thecopyleft movement.[11]

Site blocks and legal issues

[edit]
Map of countries blocking Anna's Archive:
  Currently blocked

United States

[edit]

Since 2023, Anna's Archive domains have appeared in the annualNotorious Markets List of theOffice of the United States Trade Representative, which highlights digital and physical markets allegedly involved in large-scaleintellectual property infringement. These reports describe the site as related to Sci-Hub and LibGen.[17][18][19] In response to a request for comment by the Office on its 2023 List, theAssociation of American Publishers identified Anna's Archive as an infringing site, and analyzed itscryptocurrency wallets to find that it had received over $29,000 in funds as of July 2023.[20][21]

In response to a March 2024 lawsuit accusingNvidia of training LLMs ondata from a shadow library,[22] the company disputed the characterization of Anna's Archive and other repositories as "shadow libraries", despite Anna's own use of the term.[23][24][relevant?]

OCLC lawsuit

[edit]

In October 2023, Anna's Archive was reported to have scraped the entirety ofWorldCat, the world's largestbibliographic database, and made its proprietary data freely available, which Anna described as "a major milestone in mapping out all the books in the world".[9]OCLC, WorldCat's maintainer, responded by suing the site inan Ohio federal court in January 2024, claiming the scrape was achieved throughcyberattacks on its servers.[6] It sought over $5 million in totaldamages and aninjunction to stop Anna's Archive from scraping or sharing its data.[25] OCLC clarified that although its internal systems were not breached, it believes the site's actions legally constitute hacking.[26] The only named defendant denied any involvement with the scrape or Anna's Archive.[27] Technology writerGlyn Moody criticized the suit as "costly and pointless", saying it went against OCLC's stated mission of making information accessible.[28]

In July 2024, in the wake of the suit, the.org mirror of Anna's Archive was replaced with a new.gs mirror to avoid falling under US jurisdiction; however, soon afterward, the.gs domain was suspended and the mirror reverted to the original.org domain.[25][29]

In March 2025, the court deferred judgement on aspects of the case to theSupreme Court of Ohio over concerns about its legal novelty, denying both a motion fordefault judgement from OCLC and amotion to dismiss from the named defendant.[30] In April, OCLC reached an agreement with the named defendant to drop her from the case, focusing instead on obtaining judgement against the site itself.[31]

Meta lawsuit

[edit]

In February 2025, internal emails were unsealed in a lawsuit againstMeta in a California court for allegedly training its AI models on copyrighted works which revealed that the company had downloaded over 81 terabytes of data through Anna's Archive torrents, in addition to data previously downloaded from LibGen. The plaintiffs in the case, a group of authors includingRichard Kadrey,Sarah Silverman, andChristopher Golden, alleged that CEOMark Zuckerberg personally authorized the use of shadow libraries. The company had argued that its use of copyrighted data in AI training constituted fair use.[32][33][34]

In June 2025, the court partially ruled in favor of Meta, finding that the training was "highly transformative" and therefore fair use.Vince Chhabria, the judge in the case, emphasized that the ruling did not mean that Meta's actions were in fact legitimate, but said that the plaintiffs failed to develop strong arguments. He identified "market dilution" as a convincing argument for financial harm not pursued by the plaintiffs — the idea that "by training generative AI models with copyrighted works, companies are creating something that often will dramatically undermine the market for those works".[35][36][37]

Italy

[edit]

In January 2024,Italy's national communications agency ordered majorinternet service providers (ISPs) in the country to block Anna's Archive due to a copyright complaint by theItalian Publishers Association.[38] An investigation by the Digital Services Directorate confirmed the presence of copyrighted works on the site and found that some of its servers were likely owned by a Ukrainian hosting provider, but failed to uncover the identity of its operators.[2]

Netherlands

[edit]

In March 2024, theRotterdamDistrict Court ordered major ISPs in the Netherlands to block Anna's Archive and LibGen due to a request by advocacy groupBREIN. The order was "dynamic", meaning that if the blocked sites changed domains or IP addresses in the future, ISPs would be obligated to update their blocks.[39][40][41][42]

United Kingdom

[edit]

In December 2024, the UKPublishers Association won an order from theHigh Court of Justice requiring major ISPs to block Anna's Archive and other copyright-infringing sites, extendinga list of sites blocked since 2015 under section 97A of theCopyright, Designs and Patents Act. The Association said it identified over one million records of copyrighted books and journal articles on Anna's Archive domains.[43][44]

Belgium

[edit]

In July 2025, a group of organizations representing Belgian authors and copyright holders – including the Association of Belgian Publishers (ADEB), the Civil Society of Multimedia Authors (La Scam), the Cooperative for the Perception and Compensation of Belgian Publishers (Copiebel), Librius, the Educational and Scientific Publishers Group (GEWU), the General Publishers Ground (GAU), and the Flemish Authors' Association (VAV) – successfully petititoned theCommercial Court to issue judgement against five alleged piracy sites: Anna's Archive, LibGen, Sci-Hub, Z-Library, andOceanofPDF. The judge orderedFPS Economy's anti-piracy service to block the sites in the interim. In the event of noncompliance, the sites face fines of up to 500,000 euros.[45][46][47][48]

Germany

[edit]

On October 11, 2025,TorrentFreak reported that major ISPs in Germany had blocked access to the main domains of Anna's Archive. The blockade was initiated by theClearing Body for Copyright on the Internet (CUII), a coalition of rightsholders and ISPs that coordinates voluntary site blocking measures.[49]

Other issues

[edit]

Anna's Archive was amongGoogle Search's ten most reported domains forDMCA takedown by June 2024.[50] By November 2025, Google had removed 749 million Anna's Archive URLs from its search results, representing 5 percent of all takedown requests sent to the search engine since 2012. These requests came from over 1,000 authors and publishers.[7] It has been one of the most targeted sites of Dutch anti-piracy serviceLink-Busters, which sends takedown requests to Google and other search engines on behalf of major publishers.[51][52][53]

In January 2025, the messaging appTelegram suspended the Anna's Archive channel for copyright infringement, despite the operators reportedly taking precautions to avoid infringing posts on the app.Z-Library's Telegram channel was suspended the same week, and neither was alerted of the action. The removals were speculated to be linked to legal action byan Indian court.[54]

Notes

[edit]
  1. ^According to Anna's personal blog, they no longer host IPFS themselves because they believe it is not yet suitable for their purposes.[† 2]
  2. ^According to a post on Anna's blog, the project's data is standardized under the custom Anna's Archive Containers format to allow for incremental releases.[† 5]

References

[edit]
  1. ^abcdVan der Sar, Ernesto (April 16, 2024).""Anna's Archive" Opens the Door to Z-Library and Other Pirate Libraries".TorrentFreak. Retrieved2024-08-19.
  2. ^abcdMaxwell, Andy (January 4, 2024)."Silenzio! 'Anna's Archive' Shadow Library Blocked Following Publishers' Complaint".TorrentFreak. Retrieved2024-12-29.
  3. ^Booth, Callum (July 4, 2022)."The Pirate Library Mirror wants to preserve all human knowledge... illegally".TNW. Retrieved2024-10-19.
  4. ^abcManos, Leda (November 22, 2022)."Free Z-Library E-Book Download Search Engine "Anna's Archive" Launches Amid Arrests".LA Weekly. Retrieved2024-12-29.
  5. ^Newson, Georgie (December 14, 2022)."In the Shadow Library".LRB Blog. Retrieved2025-01-22.
  6. ^abVan der Sar, Ernesto (February 7, 2024)."Lawsuit Accuses Anna's Archive of Hacking WorldCat, Stealing 2.2 TB Data".TorrentFreak. Retrieved2024-12-30.
  7. ^abBinder, Matt (November 5, 2025)."Google reportedly blocks 749 million Anna's Archive URLs".Mashable. Retrieved2025-11-07.
  8. ^Son, Jihun; Kim, Gyubin; Jung, Hyunwoo; Bang, Jewan; Park, Jungheum (October 1, 2023)."IF-DSS: A forensic investigation framework for decentralized storage services".Forensic Science International: Digital Investigation.46 301611.doi:10.1016/j.fsidi.2023.301611.ISSN 2666-2817.
  9. ^abVan der Sar, Ernesto (October 3, 2023)."Anna's Archive Scraped WorldCat to Help Preserve 'All' Books in the World".TorrentFreak. Retrieved2024-08-19.
  10. ^Walters, William H. (July 8, 2025)."Comparing conventional and alternative mechanisms of discovering and accessing the scientific literature".Proceedings of the National Academy of Sciences.122 (27) e2503051122.Bibcode:2025PNAS..12203051W.doi:10.1073/pnas.2503051122.PMC 12260569.PMID 40591597.
  11. ^abcdKozina, Elizaveta; Toson, Christian (March 2025)."Anna, the Universal Library".La Rivista di Engramma (222):243–259.doi:10.25432/1826-901X/2025.222.0032.
  12. ^Van der Sar, Ernesto (January 31, 2025)."Pirate Libraries Are Forbidden Fruit for AI Companies. But at What Cost?".TorrentFreak. Retrieved2025-02-01.
  13. ^abVan der Sar, Ernesto (February 1, 2025)."Anna's Archive Urges AI Copyright Overhaul to Protect National Security".TorrentFreak. Retrieved2025-02-02.
  14. ^abPoeuymirou, Margaux; Pritt, Maxwell (June 16, 2025)."Why AI's use of shadow libraries should alarm us all".Daily Journal. Retrieved2025-10-08.
  15. ^Lu, Haoyu; Liu, Wen; Zhang, Bo; Wang, Bingxuan; Dong, Kai; Liu, Bo; Sun, Jingxiang; Ren, Tongzheng; Li, Zhuoshu (March 11, 2024),DeepSeek-VL: Towards Real-World Vision-Language Understanding,arXiv:2403.05525
  16. ^Woodcock, Claire (November 30, 2022)."'Shadow Libraries' Are Moving Their Pirated Books to The Dark Web After Fed Crackdowns".VICE. Retrieved2025-04-15.
  17. ^Maxwell, Andy (January 31, 2024)."World's Most Notorious Pirate Sites Listed in New USTR Report".TorrentFreak. Retrieved2025-01-17.
  18. ^"2023 Review of Notorious Markets for Counterfeiting and Piracy"(PDF).United States Trade Representative. January 30, 2024. Retrieved2025-01-23.
  19. ^"2024 Review of Notorious Markets for Counterfeiting and Piracy"(PDF).United States Trade Representative. January 8, 2025. Retrieved2025-01-23.
  20. ^Van der Sar, Ernesto (October 13, 2023)."Pirate Sites Exploit 'Interplanetary File System' Gateways, Publishers Warn".TorrentFreak. Archived fromthe original on 2025-09-10. Retrieved2025-01-17.
  21. ^"Comment from Association of American Publishers".Regulations.gov. October 9, 2023. Archived fromthe original on 2025-09-10. Retrieved2025-01-17.
  22. ^Belanger, Ashley (March 11, 2024)."Nvidia sued over AI training data as copyright clashes continue".Ars Technica. Archived fromthe original on 2025-09-10. Retrieved2025-01-18.
  23. ^Belanger, Ashley (May 28, 2024)."Nvidia denies pirate e-book sites are "shadow libraries" to shut down lawsuit".Ars Technica. Archived fromthe original on 2025-09-10. Retrieved2025-01-18.
  24. ^Van der Sar, Ernesto (May 27, 2024)."NVIDIA Denies Copyright Infringement Claims in Authors' AI Lawsuit".TorrentFreak. Archived fromthe original on 2025-09-10. Retrieved2025-01-18.
  25. ^abVan der Sar, Ernesto (July 8, 2024)."Anna's Archive Faces Millions in Damages and a Permanent Injunction".TorrentFreak. Retrieved2024-12-30.
  26. ^Price, Gary (February 7, 2024)."Report: "Lawsuit Accuses Anna's Archive of Hacking WorldCat, Stealing 2.2 TB Data"".Library Journal infoDOCKET. Retrieved2025-01-20.
  27. ^Van der Sar, Ernesto."Key Defendant in Anna's Archive Lawsuit Denies Any Involvement With the Site".TorrentFreak. Retrieved2024-08-19.
  28. ^Moody, Glyn (August 21, 2024)."OCLC says "what is known must be shared", but sues Anna's Archive to stop it sharing knowledge".Walled Culture. Retrieved2025-01-19.
  29. ^Van der Sar, Ernesto (July 18, 2024)."Anna's Archive Loses .GS Domain Name But Remains Resilient".TorrentFreak. Retrieved2024-12-29.
  30. ^Van der Sar, Ernesto (March 31, 2025)."Anna's Archive Scraping: Court Defers Key Questions to State Supreme Court".TorrentFreak. Retrieved2025-03-30.
  31. ^Van der Sar, Ernesto (April 17, 2025)."Alleged Anna's Archive Operator Dropped from U.S. 'Scraping' Lawsuit".TorrentFreak. Retrieved2025-04-18.
  32. ^Belanger, Ashley (February 6, 2025).""Torrenting from a corporate laptop doesn't feel right": Meta emails unsealed".Ars Technica. Retrieved2025-02-09.
  33. ^Van der Sar, Ernesto (February 6, 2025)."Meta Torrented over 81 TB of Data Through Anna's Archive, Despite Few Seeders".TorrentFreak. Retrieved2025-02-09.
  34. ^Pontefract, Dan."Authors Challenge Meta's Use Of Their Books For Training AI".Forbes. Retrieved2025-03-27.
  35. ^"AG on Meta AI Ruling: Meta Gets a Technical Win, but the Law Favors Authors".Authors Guild. June 26, 2025. Retrieved2025-07-19.
  36. ^Nawotka, Ed (June 26, 2025)."Meta Wins AI Copyright Case, But Judge Writes Roadmap for Authors' Revenge".Publishers Weekly. Retrieved2025-07-19.
  37. ^Simons, Matt (June 25, 2025)."Judge reluctantly sides with Meta in Sarah Silverman, authors' AI copyright suit".Courthouse News Service. Retrieved2025-07-19.
  38. ^Stefanello, Viola (January 12, 2024)."Che fine ha fatto il movimento per il libero accesso alle pubblicazioni accademiche" [What happened to the movement for open access to academic publications?].Il Post (in Italian). Retrieved2025-01-19.
  39. ^Van der Sar, Ernesto (March 23, 2024)."Dutch Court Orders ISP to Block 'Anna's Archive' and 'LibGen'".TorrentFreak. Retrieved2024-12-29.
  40. ^"Blokkering shadow libraries bevolen" [Blocking shadow libraries ordered].BREIN (in Dutch). March 21, 2024. Retrieved2025-01-17.
  41. ^"BREIN wil blokkering shadow libraries" [BREIN wants to block shadow libraries].ICT Magazine (in Dutch). April 4, 2024. Retrieved2025-01-18.
  42. ^"Succesvolle toepassing Convenant Blokkeren Websites voor Library Genesis en Anna's Archive" [Successful application of Covenant Blocking Websites for Library Genesis and Anna's Archive].Recht.nl (in Dutch). April 26, 2024. Retrieved2025-01-18.
  43. ^Battersby, Matilda (December 20, 2024)."Publishers Association wins High Court bid ordering internet service providers to block pirate websites".The Bookseller. Retrieved2025-01-21.
  44. ^Joynson, Jasmine (December 20, 2024)."Authors and Publishers Win High Court Support in Fight Against Infringement".Publishers Association. Retrieved2025-01-21.
  45. ^Bouhadjera, Hocine (July 17, 2025)."LibGen, Z-Library, Anna's Archive... condamnés en Belgique pour piratage de livres".ActuaLitté (in French). Retrieved2025-07-18.
  46. ^Sacré, Jean-François (July 17, 2025)."Des associations belges d'éditeurs et de droits d'auteur font condamner des sites pirates".l'Echo (in French). Retrieved2025-07-18.
  47. ^Sacré, Jean-François (July 1, 2025)."Associations d'éditeurs et sociétés de gestion de droits s'attaquent à 5 sites pirates".l'Echo (in French). Retrieved2025-07-18.
  48. ^Alléaume, Kaelig (July 18, 2025)."La justice belge condamne les sites LibGen, Z-Library et Anna's Archive pour piratage de livres".Archimag (in French). Retrieved2025-07-19.
  49. ^Van der Sar, Ernesto (October 11, 2025)."German Pirate Site Blockades Target Anna's Archive, FitGirl and RPG Only".TorrentFreak. Retrieved2025-10-21.
  50. ^Van der Sar, Ernesto (June 22, 2024)."Google Search Processed a Billion DMCA Takedowns in Four Months".TorrentFreak. Retrieved2025-01-18.
  51. ^Van der Sar, Ernesto (May 31, 2024)."Link-Busters Flagged Over 56 Million 'Pirate' URLs to Google in a Week".TorrentFreak. Retrieved2025-01-18.
  52. ^Van der Sar, Ernesto (July 29, 2024)."Link-Busters Sent a Billion DMCA Takedown Requests to Google Search".TorrentFreak. Retrieved2025-01-18.
  53. ^Van der Sar, Ernesto (January 17, 2025)."More Than Half of All Google Search Takedowns Now Come from Link-Busters".TorrentFreak. Retrieved2025-01-18.
  54. ^Van der Sar, Ernesto (January 15, 2025)."Telegram Shuts Down Z-Library & Anna's Archive Channels Over Copyright Infringement".TorrentFreak. Retrieved2025-01-16.

Primary sources

[edit]
  1. ^abcdefgh"Frequently Asked Questions (FAQ)".Anna's Archive. Retrieved2025-07-30.
  2. ^"Putting 5,998,794 books on IPFS".Anna's Blog. November 19, 2022. Retrieved2025-01-15.
  3. ^"AnnaArchivist / annas-archive".GitLab. Retrieved2025-01-23.
  4. ^ab"Datasets".Anna's Archive. Retrieved2025-01-15.
  5. ^"Anna's Archive Containers (AAC): standardizing releases from the world's largest shadow library".Anna's Blog. August 15, 2023. Retrieved2025-01-17.
  6. ^"Torrents".Anna's Archive. Retrieved2025-05-30.
  7. ^"Volunteering & Bounties".Anna's Archive. Retrieved2025-01-18.

External links

[edit]
Wikimedia Commons has media related toAnna's Archive.
Topics
and issues
Roles
and expertise
Methods
and techniques
Conservation
and restoration
of immovable
cultural property

by item type
Conservation
and restoration
of movable
cultural property

by item type
Intangible
cultural heritage

preservation
Notable
projects
Concepts and
practices
Key concepts
Research and science
Data, information,
and knowledge
Communication
and learning
Media
Education
Journalism
Products
Economic principles
Politics and governance
Organizations
Activists
Projects and
movements
Tools
Active
Dedicated
Metasearch engines
Defunct
or
Inactive
Issues
Concepts
Movements
Organizations
Pro-copyright
Pro-copyleft
People
Documentaries
Books
Portals:
Retrieved from "https://en.wikipedia.org/w/index.php?title=Anna%27s_Archive&oldid=1322908306"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp