Help:Toolforge/Dumps: Difference between revisions

From Wikitech
Content deleted Content added
→‎Dumps in general: wordsmithing links
→‎Older dumps: add a link to list of mirrors for downloading older dumps
Line 16: Line 16:
== Older dumps ==
== Older dumps ==


* Can be manually downloaded from the [https://dumps.wikimedia.org/ Wikimedia downloads] server.
* Can be manually downloaded from the [https://dumps.wikimedia.org/ Wikimedia downloads] server, or from [[m:Mirroring_Wikimedia_project_XML_dumps#Current_Mirrors|mirrors]] which may have better bandwidth.
* <code>/public/dumps/pagecounts-raw</code> contains some years of the [https://dumps.wikimedia.org/other/pagecounts-ez/ pagecount/projectcount data].
* <code>/public/dumps/pagecounts-raw</code> contains some years of the [https://dumps.wikimedia.org/other/pagecounts-ez/ pagecount/projectcount data].



Revision as of 19:55, 24 September 2019

Help check the accuracy of the information on this page: https://phabricator.wikimedia.org/T233664

Overview

This page contains information about dumps and Toolforge

Dumps generated by Wikimedia projects

Toolforge has access to a directory storing the dumps generated by Wikimedia projects: public Wikimedia datasets.

Recent dumps

The most recent two dumps can be found in:

/public/dumps/public

This directory is read-only, but you can copy the files to your tool's home directory if necessary. Ideally you can find (or build!) a library that can be used to read data from the dumps without decompressing them. See meta:Data dumps/Other tools for some examples.

Older dumps

Dumps in general

Communication and support

Support and administration of the WMCS resources is provided by the Wikimedia Foundation Cloud Services team and Wikimedia movement volunteers. Please reach out with questions and join the conversation:

Discuss and receive general support
Stay aware of critical changes and plans
Track work tasks and report bugs

Use a subproject of the #Cloud-Services Phabricator project to track confirmed bug reports and feature requests about the Cloud Services infrastructure itself

Read stories and WMCS blog posts

Read the Cloud Services Blog (for the broader Wikimedia movement, see the Wikimedia Technical Blog)