Nova Resource:Dumps/Archive.org: Difference between revisions

From Wikitech
Content deleted Content added
Hydriz (talk | contribs)
Creating new page
 
Hydriz (talk | contribs)
+cat
Line 10: Line 10:
# Main database dumps
# Main database dumps
# Full media tarballs (at least for those <10GB)
# Full media tarballs (at least for those <10GB)

[[Category:Dumps]]

Revision as of 07:59, 14 November 2012

Archive.org refers to the Internet Archive, which is a library of stuff, mainly scanned books, but can contain almost anything that is of free content.

We are currently working on moving the public datasets to the Archive for preservation, although right now its mainly being handled by volunteers (specifically Hydriz and Nemo).

Archiving from Labs

There is a project on Wikimedia Labs called "Dumps" that is dedicated to running the archiving processes by volunteers. Currently, the datasets that are being archived are:

  1. Adds/Changes dumps (source)
  2. Incremental media tarballs (source)

Currently not running but is being planned:

  1. Main database dumps
  2. Full media tarballs (at least for those <10GB)