Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

archive.today

From Wikipedia, the free encyclopedia
Web archive
For a guide to using archive.today within Wikipedia, seeHelp:Using archive.today.

archive.today
Screenshot of the archive.today home page
Type of site
Web archiving
Available inMultilingual
URL
RegistrationNo
LaunchedMay 16, 2012; 13 years ago (2012-05-16)[2]

archive.today (formerlyarchive.is) is aweb archiving website that savessnapshots on demand. It has support forJavaScript-heavy sites such asGoogle Maps andTwitter.[3] Archive.today records two snapshots: one replicates the original webpage including any functional live links; the other is ascreenshot of the page.[4]

History

[edit]

Archive.today was founded in 2012. The site originally branded itself as archive.today, but changed the primarymirror to archive.is in May 2015.[5] It began to deprecate the archive.is domain in favor of other mirrors in January 2019.[6]

In 2021, archive.today had saved about 500 million pages.[7]

Features

[edit]
This sectionrelies excessively onreferences toprimary sources. Please improve this section by addingsecondary or tertiary sources.(July 2022) (Learn how and when to remove this message)

Archive.today can capture individual pages in response to explicit user requests.[8][9][10] Since its beginning, it has supportedcrawling pages withURLs containing the now-deprecatedhash-bang fragment (#!).[11]

Archive.today records only text and images, excludingXML,RTF,spreadsheet (xls orods) and othernon-static content. However, videos for certain sites, likeTwitter, are saved.[12] It keeps track of the history of snapshots saved, requesting confirmation before adding a new snapshot of an already saved page.[13][14]

Pages are captured at a browser width of 1,024 pixels.CSS is converted toinline CSS, removingresponsive web design and selectors such as:hover and:active. Content generated usingJavaScript during the crawling process appears in a frozen state.[15]HTML class names are preserved inside theold-classattribute.Whentext is selected, a JavaScript applet generates aURL fragment seen in the browser'saddress bar that automatically highlights that portion of the text when visited again.

Web pages can beduplicated from archive.today toweb.archive.org assecond-level backup, but archive.today does not save its snapshots inWARC format. The reverse—from web.archive.org to archive.today—is also possible,[16] but the copy usually takes more time than a direct capture. Historically, website owners had the option to opt out ofWayback Machine through the use of therobots exclusion standard (robots.txt), and these exclusions were also applied retroactively.[17] Archive.today does not obey robots.txt because it acts "as a direct agent of the human user."[10] As of 2019, theWayback Machine also no longer obeys robots.txt.

The research toolbar enables advanced keywords operators, using* as thewildcard character. A couple ofquotation marks address the search to an exact sequence of keywords present in the title or in the body of the webpage, whereas theinsite operator restricts it to a specific Internet domain.[18]

Once a web page is archived, it cannot be deleted directly by any Internet user.[19]

Removing advertisements, popups or expanding links from archived pages is possible by asking the owner to do it on his blog.[20]

While saving adynamic list, archive.today search box shows only a result that links the previous and the following section of the list (e.g. 20 links for page).[21] The other web pages saved are filtered, and sometimes may be found by one of their occurrences.[13][clarification needed]

The search feature is backed by Google CustomSearch. If it delivers no results, archive.today attempts to utilizeYandex Search.[22]

While saving a page, a list of URLs for individual page elements and their content sizes,HTTP statuses andMIME types is shown. This list can only be viewed during the crawling process.[citation needed]

Users can download archived pages as a ZIP file, except pages archived since 29 November 2019,[update][23] when archive.today changed their browser engine fromPhantomJS toChromium (non-headless).[24]

In July 2013, Archive.today began supporting theAPI of theMemento Project.[25][26]

Worldwide availability

[edit]

Australia and New Zealand

[edit]
See also:Internet censorship in Australia andInternet censorship in New Zealand

In March 2019, the site was blocked for six months by several internet providers inAustralia andNew Zealand in the aftermath of theChristchurch mosque shootings in an attempt to limit distribution of the footage of the attack.[27][28]

China

[edit]
See also:Internet censorship in China

According toGreatFire.org, archive.today has been blocked in mainland China since March 2016,[update][29] archive.li since September 2017,[update][30] archive.fo since July 2018,[update][31] as well as archive.ph since December 2019.[update][32]

Finland

[edit]

On 21 July 2015, the operators blocked access to the service from allFinnishIP addresses, stating on Twitter that they did this in order to avoid escalating a dispute they allegedly had with the Finnish government.[33]

Russia

[edit]
See also:Internet censorship in Russia

In 2016, the Russian communications agencyRoskomnadzor began blocking access to archive.is from Russia.[34][35]

Cloudflare DNS availability

[edit]

Since May 2018[36][37]Cloudflare's1.1.1.1DNS service would not resolve archive.today's web addresses, making it inaccessible to users of the Cloudflare DNS service. Both organizations claimed the other was responsible for the issue. Cloudflare staff stated that the problem was on archive.today's DNS infrastructure, as itsauthoritative nameservers return invalid records when Cloudflare's network systems made requests to archive.today. archive.today countered that the issue was due to Cloudflare requests not being compliant with DNS standards, as Cloudflare does not sendEDNS Client Subnet information in its DNS requests.[38][39]

See also

[edit]

References

[edit]
  1. ^@archiveis (30 October 2019)."a current list of all tor domains and clear net domains" (Tweet) – viaTwitter.
  2. ^"When did the Archive-is site originally launch?".Archive.today Blog. 18 February 2014.Archived from the original on 20 March 2021. Retrieved10 April 2021 – viaTumblr.
  3. ^Brinkmann, Martin (22 April 2015)."Create publicly available web page archives with Archive.is".Ghacks.Archived from the original on 12 April 2019. Retrieved13 June 2015.
  4. ^Brunelle, Justin F.; Kelly, Mat; Weigle, Michele C.; Nelson, Michael L. (25 January 2015)."The impact of JavaScript on archivability"(PDF).International Journal on Digital Libraries.17 (2):95–117.doi:10.1007/s00799-015-0140-8.S2CID 8433375.Archived(PDF) from the original on 27 May 2019.
  5. ^"Why did you change the URL back from archive-today to archive-is?".Archive.is Blog. 3 May 2015.Archived from the original on 1 June 2015. Retrieved6 January 2019.
  6. ^@archiveis (4 January 2019)."Please do not use archive.IS mirror for linking, use others mirrors [.TODAY .FO .LI .VN .MD .PH]. .IS might stop working soon" (Tweet).Archived from the original on 6 January 2019 – viaTwitter.
  7. ^Patokallio, Jani (5 August 2023)."archive.today: On the trail of the mysterious guerrilla archivist of the Internet".Gyrovague.Archived from the original on 13 August 2023. Retrieved1 January 2024.
  8. ^Dascalescu, Dan (18 February 2013)."Web page archiving".Dan Dascalescu's Wiki. Archived fromthe original on 22 September 2013. Retrieved3 October 2013.
  9. ^Koebler, Jason (29 October 2014)."Dear GamerGate: Please Stop Stealing Our Shit".Motherboard.Archived from the original on 27 May 2019. Retrieved22 March 2017.There is no way for a website to protect itself from having an Archive.today user mirror the site.
  10. ^ab"Archive.today FAQ".archive.today. Retrieved15 February 2019.
  11. ^"Home page of Archive.is in 2013". Archived fromthe original on 12 January 2013.
  12. ^"Archive.today blog".Archived from the original on 7 September 2021.
  13. ^abArchiving Websites with the Archive.is, 15 April 2016,archived from the original on 27 January 2022, retrieved27 January 2022
  14. ^"Example snapshot history on archive.is".
  15. ^JavaScript-generated loading animation ofDailymotion videoappearing in a frozen state
  16. ^"Example: Page saved from Web Archive to Archive.is" (in Spanish). Archived fromthe original on 24 March 2019. Retrieved23 October 2019.
  17. ^"FAQs - Some sites are not available because of Robots.txt or other exclusions. What does that mean?".Internet Archive. Archived fromthe original on 15 April 2011.
  18. ^For example, the string insite: https://en.wikipedia.org "World Cup" returns the"World+Cup"/ related snapshots
  19. ^"Some Frequently Asked Question".Archive.today Blog. 24 January 2013.Archived from the original on 26 September 2013. Retrieved12 November 2018 – viaTumblr.
  20. ^"Example user request on the Archive.is blog".Archive.is blog.Archived from the original on 29 April 2022. Retrieved7 April 2022.
  21. ^Example of dynamic list:"au:"thomas aquinas"".WorldCat.Archived from the original on 23 March 2019. Retrieved15 December 2018.
  22. ^"Just realized that I can search for keywords in the search bar for archive today, was this a recently added feature?".Archive.is. 18 January 2022.Archived from the original on 27 January 2022. Retrieved27 January 2022.
  23. ^"The "download zip" button has been giving a "Not found" error for quite some time".Archive.is blog. 17 July 2020.Archived from the original on 3 October 2020.
  24. ^"What scraper or headless browser are you using? it works so well".Archive.is blog. 20 May 2020.Archived from the original on 21 May 2020. Retrieved14 February 2025.
  25. ^Nelson, Michael L. (9 July 2013)."Archive.is Supports Memento".Research and Teaching Updates. Web Science and Digital Libraries Research Group atOld Dominion University.Archived from the original on 27 July 2013. Retrieved17 September 2013.
  26. ^"archive.is".Memento Protocol Information. Memento Development Group. Archived fromthe original on 15 September 2013. Retrieved17 September 2013.
  27. ^"ISPs in AU and NZ start censoring the internet without legal precedent".Private Internet Access. 19 March 2019.Archived from the original on 28 April 2023. Retrieved20 March 2019.
  28. ^"New Zealand ISPs Say They're Blocking Sites That Fail To Remove Christchurch Shooting Video".Gizmodo Australia. 19 March 2019. Archived fromthe original on 18 May 2019. Retrieved20 March 2019.
  29. ^"archive.is is 100% blocked in China".GreatFire Analyzer. 12 August 2018.Archived from the original on 12 August 2018.
  30. ^"archive.li is 100% blocked in China".Great Fire Analyzer. 12 August 2018.Archived from the original on 12 August 2018.
  31. ^"archive.fo is 100% blocked in China".Great Fire Analyzer. 12 August 2018.Archived from the original on 12 August 2018.
  32. ^"archive.ph is 100% blocked in China".en.greatfire.org.Archived from the original on 29 April 2022. Retrieved7 April 2022.
  33. ^Lapintie, Lassi (22 July 2015)."Suomalaisilta estettiin haktivistien suosimalla verkkosivulla käynti" [Finns' access to website used by hacktivists blocked].Iltalehti (in Finnish).Archived from the original on 27 May 2019. Retrieved4 March 2016.
  34. ^Elistratov, Vladimir (29 January 2016)."Roskomnadzor zablokiroval servis archive.is, khranyashchiy kopii veb-saytov"Роскомнадзор заблокировал сервис archive.is, хранящий копии веб-сайтов.TJournal (in Russian).Archived from the original on 30 August 2017. Retrieved30 January 2016.
  35. ^Cushing, Tim (4 February 2016)."Russia Blocks Another Archive Site Because It Might Contain Old Pages About Drugs".Techdirt.Archived from the original on 23 March 2019. Retrieved26 February 2016.
  36. ^"Archive.is – Error 1001".Cloudflare Community. 15 May 2018.Archived from the original on 2 December 2021. Retrieved2 December 2021.
  37. ^"Archive.today & related sites failing again".Cloudflare Community. 3 March 2024.Archived from the original on 3 April 2024. Retrieved20 March 2024.
  38. ^@archiveis (16 July 2018)."'Having to do' is not so direct here. Absence of EDNS and massive mismatch (not only on AS/Country, but even on the continent level) of where DNS and related HTTP requests come from causes so many troubles so I consider EDNS-less requests from Cloudflare as invalid" (Tweet).Archived from the original on 2 August 2023 – viaTwitter.
  39. ^"Comment by Matthew Prince on Hacker News".Hacker News. 4 May 2019. Archived fromthe original on 13 May 2022. Retrieved4 October 2021.

External links

[edit]
Wikimedia Commons has media related toarchive.today.
Concepts
Techniques
By type
Organizations
Lists
Search engines
News
File storage and file sharing
Email and
instant messaging
Social media and forums
Darknet markets
Archives
Activism
Operating systems
Government
Pornography
Other
Retrieved from "https://en.wikipedia.org/w/index.php?title=Archive.today&oldid=1314308878"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp