Help:Toolforge/Elasticsearch: Difference between revisions

From Wikitech
Content deleted Content added
→‎Overview: Give more context for CloudElastic link
→‎About Elasticsearch: link to enwiki rather than marketing site
Line 7: Line 7:
==About Elasticsearch==
==About Elasticsearch==


'''[https://www.elastic.co/products/elasticsearch Elasticsearch]''' is a full-text search system built on Apache Lucene. It can be used to index and search data stored as JSON documents.
'''[[w:Elasticsearch|Elasticsearch]]''' is a full-text search system built on Apache Lucene. It can be used to index and search data stored as JSON documents.


Elasticsearch is the technology used to power Wikimedia's CirrusSearch system.
Elasticsearch is the technology used to power Wikimedia's CirrusSearch system.

Revision as of 15:11, 22 September 2020

Overview

This page contains information about local Elasticsearch services in Toolforge. For information about the replica of the Wikimedia CirrusSearch Elasticsearch indices available from Toolforge and Cloud VPS instances, see CloudElastic.

About Elasticsearch

Elasticsearch is a full-text search system built on Apache Lucene. It can be used to index and search data stored as JSON documents.

Elasticsearch is the technology used to power Wikimedia's CirrusSearch system.

Elasticsearch for Toolforge

An Elasticsearch version 7 cluster for all tools is available at http://elasticsearch.svc.tools.eqiad1.wikimedia.cloud, on the non-standard port 80.

This Elasticsearch cluster is a shared resource. All documents indexed in it can be read by anonymous users from within Toolforge. Write access is needed to create new indexes, and a password is needed to store or update documents.

Read-only access

The Elasticsearch servers allow anyone to read any of the indexes that it contains. This access is limited to other hosts in the Toolforge project (e.g. the OGE job grid, Kubernetes containers, and the bastion servers).

The Elasticsearch service is available on port 80 at http://elasticsearch.svc.tools.eqiad1.wikimedia.cloud

Note: The default Elasticsearch port (9200) is not used.

Write access

Elasticsearch does not offer multi-tenant access control in its open source version.

PUT, POST, or DELETE requests sent to the Elasticsearch servers require HTTP Basic Authentication using a username and password specific to each tool.

Requests for write access can be made by filing this Phabricator task.

When credentials have been created they will be placed in /data/project/$TOOL/.elasticsearch.ini. Access requests are currently processed manually and may take a few days to be fulfilled.


Communication and support

Support and administration of the WMCS resources is provided by the Wikimedia Foundation Cloud Services team and Wikimedia movement volunteers. Please reach out with questions and join the conversation:

Discuss and receive general support
Stay aware of critical changes and plans
Track work tasks and report bugs

Use a subproject of the #Cloud-Services Phabricator project to track confirmed bug reports and feature requests about the Cloud Services infrastructure itself

Read stories and WMCS blog posts

Read the Cloud Services Blog (for the broader Wikimedia movement, see the Wikimedia Technical Blog)