Help:Cloud Services introduction Poster-format overview
Wikimedia Cloud Services (
WMCS) provides tools, services, and support for technical collaborators who want to contribute in Wikimedia software projects. Use Cloud Services to host your software tools for the
Wikimedia movement, without charge.
Quarry and
PAWS empower
technically curious to advanced users to query wiki replicas and create scripts, tutorials, and data visualizations to analyze and improve Wikimedia projects.
Toolforge is for
intermediate to advanced users working on tools, bots, webservices that support Wikimedia projects.
Infrastructure as a service
Cloud VPS is for
advanced users who need to administer their own servers for Wikimedia operations and software development.
Toolforge is one of the projects hosted by Wikimedia Cloud VPS. It is a shared hosting (platform as a service) environment for volunteers to develop and run
tools,
continuous bots, web services, scheduled jobs, and data analysis.
To use Toolforge you will need some programming knowledge, an understanding of Unix command line, and version control via
Gerrit and Git.
Users of the Toolforge project create so-called "tool" accounts (technically service groups). These accounts allow one or more users to collaborate to manage the software source code, configuration, and jobs for that tool or bot.
The Toolforge administrators manage a pool of virtual servers which provide a shared project hosting environment that can be used by Toolforge users. These resources include
web servers,
databases and
other data storage, and a
distributed job processing system. These services provide a reliable and scalable hosting environment for volunteers to develop and operate their tools and bots.
Cloud VPS (Virtual Private Server) is a
cloud computing environment powered by
OpenStack. It offers collaboratively owned collections of virtual private servers. You can use this infrastructure to create and maintain open source software projects that help the
Wikimedia movement.
The environment includes access to a variety of data services. Cloud VPS allows developers and system administrators to try out improvements to Wikimedia infrastructure (including MediaWiki), power research and analytics, and host projects that are not viable in the Toolforge environment.
Cloud VPS is for the advanced users to get involved in Wikimedia operations and software development. Cloud VPS contains
many projects, each of which uses one or more instances.
Cloud VPS instances go through a request and approval processes. Instances are not permanent and are reviewed periodically for potential deletion/removal. Cloud VPS instances are resource intensive. Before requesting, explore whether Toolforge or another service will adequately meet your needs.
How is Cloud VPS organized? Cloud VPS is divided into projects. Each project has separate members and administrators who can create and maintain virtual machines ("instances") for use by that project. Each project can have own its own access policies, DNS records, etc.
What is a Cloud VPS project? A project is a unit of privilege separation inside the Cloud VPS environment. Each project has separate management of membership, virtual machines, HTTPS proxies, firewall rules, etc. Examples of projects include
Toolforge and the
Beta Cluster.
Cloud VPS is a virtualization cluster and hosts various virtual machines (called instances) using
OpenStack Compute. This is slightly different from your normal servers that you ssh to (i.e. Toolserver), as virtual machines do not exist physically, but reside inside a much bigger machine called the host machine. More details about the physical setup of Cloud VPS can be found under
Portal:Cloud VPS/Infrastructure.
What is the difference between Cloud VPS and Toolforge? Cloud VPS is an Infrastructure as a service (IaaS) solution. It provides virtual machines, storage, firewall, and HTTPS proxy resources to projects. The members of each individual project are responsible for managing applications, data, runtime, middleware, and operating systems themselves.
Data Services are a collection of products which provide access to copies of Wikimedia's production wiki databases (with private information redacted) and access to
Wikimedia data dumps. Use data services to create replicas of the production databases and other data for analysis and experimentation.
There are also services to interact with data in a web browser: Quarry and PAWS.
Quarry is a public querying interface for
Wiki Replicas, a set of live replica
SQL databases of public Wikimedia wikis. Quarry is designed to make running queries against Wiki Replicas easy. Quarry also allows researchers to share and review each other's queries.
Quarry queries are run by individual users. They can be saved and published and forked by other users.
To use Quarry you need only a Wikimedia login and a web browsers. A basic understanding of SQL is recommended. Learn about
SQL queries.
PAWS is a Jupyter notebook installation hosted by Wikimedia. PAWS notebooks can be used for creating tutorials, running live code, creating data visualizations, running bots using Pywikibot, and more.
PAWS notebooks are maintained by a single user. They can be downloaded and forked by other users.
To use PAWS you need only a Wikimedia login and a web browser. Knowledge of Python is helpful, but not required.
Which service is right for you? Activity / Needs | Quarry (DaaS) | PAWS (DaaS) | Toolforge (PaaS) | Cloud VPS (IaaS) |
Browser based | ✔ | ✔ | | |
Terminal based | | | ✔ | ✔ |
Write queries against replica databases | ✔ | ✔ | ✔ | |
Run database dumps | | ✔ | ✔ | |
Write and run bots | | ✔ | ✔ | |
Run web services | | | ✔ | |
Build tools to improve Wikimedia projects | | | ✔ | |
Schedule or run continuous jobs | | | ✔ | |
Administer your own virtual server | | | | ✔ |
Need your own subdomain | | | ✔ | ✔ |
Write documentation and create tutorials | | ✔ | | |
Work with co-maintainers and co-admins | | | ✔ | ✔ |
User knowledge | curious—advanced | curious—advanced | intermediate—advanced | advanced |
Service concept | Data as a service | Data as a service | Platform as a service | Infrastructure as a service |
Communication and support Discuss and receive general support
Receive mail announcements about critical changes
Learn about major near-term plans
Read news and stories about Wikimedia Cloud Services
From 2011 until early 2017, Wikimedia Cloud Services was known as
Wikimedia Labs. However, the term
Labs was used for
several different things.
Last edited on 19 May 2022, at 18:27
Wikitech
Content is available under CC BY-SA 3.0 unless otherwise noted.