Archive.today records simultaneously two different 'snapshots' of a web-page. One is "Webpage" which includes any functional live links that are in the original. The other is "Screenshot" which provides a static and non-interactive visualization of the representation.
Archive.today can capture individual pages in response to explicit user requests.
Since its beginning, Archive.Today supports crawling
pages with URLs
containing a now-deprecated hash-bang fragment
Archive.today records only text and images, excluding XML
) and other non-static
content. However, videos for certain sites, like Twitter
, are saved.
It keeps track of the history of snapshots saved, returning to the user a request for confirmation before adding a new snapshot of an already saved Internet address.
Web pages cannot be duplicated
as second-level backup
, as archive.is places an exclusion for Wayback Machine and does not save its snapshots in WARC format. The reverse—from web.archive.org
but the copy usually takes more time than a direct capture. Some web sites get deleted from Internet Archive
's listings retroactively or blocked from being saved due to their robots.txt
file, but Archive.today does not use this.
The research toolbar enables advanced keywords operators, using *
as the wildcard character
. A couple of quotation marks
address the search to an exact sequence of keywords present in the title or in the body of the webpage, whereas the insite
operator restricts it to a specific Internet domain.
Once a web page is archived, it cannot be deleted directly by any Internet user.
While saving a dynamic list
, archive.today searchbox shows only a result that links the previous and the following section of the list (e.g. 20 links for page).
The other web pages saved are filtered, and sometimes may be found by one of their occurrences.
The search feature is backed by Google CustomSearch. If it delivers no results, archive.is attempts to utilize Yandex Search
If a page has already been archived, archive.is asks the user to confirm archiving a new revision, instead of immediately archiving it.
While loading a page, a list of URLs to individual page elements among their content sizes, HTTP statuses
and MIME types
is shown. This list can only be viewed during the crawling process.
One can download archived pages as a ZIP file, except pages archived since 29 November 2019, when Archive.Today changed their browser engine from PhantomJS
Archive.today was founded in 2012. The site originally branded itself as archive.today, but in May 2015, changed the primary mirror to archive.is.
In January 2019, it began to deprecate the archive.is domain in favor of the archive.today mirror.
In March 2019, the site was blocked for six months by several Australian
internet providers in the aftermath of the Christchurch mosque shootings
in an attempt to limit distribution of the footage of the attack.
It is still blocked in Australia as of July, 2021.
According to GreatFire
.org, archive.today has been blocked in China since March 2016,
archive.li since September 2017,
and archive.fo since July 2018.
On 21 July 2015, the operators blocked access to the service from all Finnish IP addresses
, stating on Twitter
that they did this in order to avoid escalating a dispute they allegedly had with the Finnish government.
, only HTTP access is possible; HTTPS connections are blocked.
- ^ Archive.today page with ads at the Wayback Machine (archived 2021-03-07)
- ^ Archive.is blog — When did the Archive-is site originally launch? at archive.today (archived 20 March 2021)
- ^ Archive.is — Викиреальность at archive.today (archived 29 April 2021)
- ^ Brinkmann, Martin (22 April 2015). "Create publicly available web page archives with Archive.is". Ghacks. Archived from the original on 12 April 2019. Retrieved 13 June 2015.
- ^ Dascalescu, Dan (18 February 2013). "Web page archiving – Dan Dascalescu's Wiki (review)". Wiki.dandascalescu.com. Archived from the original on 22 September 2013. Retrieved 3 October 2013.
- ^ Koebler, Jason (29 October 2014). "Dear GamerGate: Please Stop Stealing Our Shit". Motherboard. Archived from the original on 27 May 2019. Retrieved 22 March 2017. There is no way for a website to protect itself from having an Archive.today user mirror the site.
- ^ "archive.is/faq". archive.is. Retrieved 15 February 2019.
- ^ "Home page of Archive.is in 2013". Archived from the original on 12 January 2013. It can save pages from Web 2.0 sites even with hashbang URLs, for example http://twitter.com/#!/medvedevrussia
- ^ "Archive.today blog".
- ^ "Example snapshot history on archive.is".
- ^ "Example: Page saved from Web Archive to Archive.is". Archived from the original on 20 May 2013. Retrieved 23 October 2019.
- ^ "Archive.today FAQ".
- ^ For example, the string insite: https://en.wikipedia.org "World Cup" returns the "World+Cup"/ related snapshots
- ^ "Some Frequently Asked Question". archive.is blog. 24 January 2013. Archived from the original on 26 September 2013. Retrieved 12 November 2018.
- ^ "Example of dynamic list retrieved by Worldcat".
- ^ "Archive.is blog". 17 July 2020. Archived from the original on 3 October 2020.
- ^ Nelson, Michael L. (9 July 2013). "Archive.is Supports Memento". Research and Teaching Updates. Web Science and Digital Libraries Research Group at Old Dominion University. Archived from the original on 27 July 2013. Retrieved 17 September 2013.
- ^ "archive.is". Memento Protocol Information. Memento Development Group. Archived from the original on 15 September 2013. Retrieved 17 September 2013.
- ^ "Why did you change the URL back from archive-today to archive-is?". Archive.is Blog. 3 May 2015. Archived from the original on 1 June 2015. Retrieved 6 January 2019.
- ^ @archiveis (4 January 2019). "Please do not use archive.IS mirror for linking, use others mirrors [.TODAY .FO .LI .VN .MD .PH]. .IS might stop working soon" (Tweet). Archived from the original on 6 January 2019 – via Twitter.
- ^ "ISPs in AU and NZ start censoring the internet without legal precedent". Private Internet Access. 19 March 2019. Retrieved 20 March 2019.
- ^ "New Zealand ISPs Say They're Blocking Sites That Fail To Remove Christchurch Shooting Video". Gizmodo Australia. 19 March 2019. Archived from the original on 18 May 2019. Retrieved 20 March 2019.
- ^ "archive.is is 100% blocked in China". GreatFire Analyzer. 12 August 2018. Archived from the original on 12 August 2018.
- ^ "archive.li is 100% blocked in China". Great Fire Analyzer. 12 August 2018. Archived from the original on 12 August 2018.
- ^ "archive.fo is 100% blocked in China". Great Fire Analyzer. 12 August 2018. Archived from the original on 12 August 2018.
- ^ Lapintie, Lassi (22 July 2015). "Suomalaisilta estettiin haktivistien suosimalla verkkosivulla käynti" [Finns' access to website used by hacktivists blocked]. Iltalehti (in Finnish). Archived from the original on 27 May 2019. Retrieved 4 March 2016.
- ^ Elistratov, Vladimir (29 January 2016). "Roskomnadzor zablokiroval servis archive.is, khranyashchiy kopii veb-saytov" Роскомнадзор заблокировал сервис archive.is, хранящий копии веб-сайтов. TJournal (in Russian). Archived from the original on 30 August 2017. Retrieved 30 January 2016.
- ^ Cushing, Tim (4 February 2016). "Russia Blocks Another Archive Site Because It Might Contain Old Pages About Drugs". Techdirt. Archived from the original on 23 March 2019. Retrieved 26 February 2016.
Last edited on 20 September 2021, at 20:13
Content is available under CC BY-SA 3.0
unless otherwise noted.