Movatterモバイル変換


[0]ホーム

URL:


Jump to content
MediaWiki
Search

Wikimedia Site Reliability Engineering

From mediawiki.org
Translate this page
Languages:
Site Reliability Engineering
Group:Technology
Team members:In teams:
Collaboration Services
Lukasz Sobanski
Daniel Zahn,Jelto Wodstrcil,Arnold Okoth
Data Center Operations
Willy Pao
Rob Halsell, Chris Johnson, Papaul Tshibamba, Jenn Hancock
Data Persistence
Kwaku Addo Ofori
Manuel Arostegui,Jaime Crespo,Matthew Vernon,Amir Sarabadani,Eric Evans,Federico Ceratto
Infrastructure Foundations
Joanna Boruń
Luca Toscano,Riccardo Coccioli,Chris Danis, Cathal Mooney, Moritz Mühlenhoff, Arzhel Younsi,Jesse Hathaway,Simon Lyngshede
Observability
Filippo Giunchedi, Keith Herron,Cole White,Andrea Denisse Gómez-Martínez,Tiziano Fogli
Service Operations
Alexandros Kosiaris
Giuseppe Lavagetto,Reuven Lazarus,Effie Mouzeli,Janis Meybohm,Clément Goubert,Blake Jensen,Matthieu Lec'hvien,Kavitha Appakayala
Traffic
Kwaku Addo Ofori
Brandon Black,Brett Cornwall, Valentin Gutierrez,Sukhbir Singh,Fabrizio Furnari
Backlog:#sre
Management:Mark Bergsma

TheSite Reliability Engineering team, orSRE for short, is the team responsible for developing and maintaining Wikimedia's production infrastructure. Previously known as Technical Operations, they are in charge of making sure all Wikimedia's sites and services used by the public (including MediaWiki and all associated services) run reliably, securely, and with high performance.

Notify us of emergencies withKlaxon.

#wikimedia-sreconnect

Additional documentation related to our infrastructure and the team's work can be found onWikitech.

The team's structure

Collaboration Services

We are responsible for building and maintaining the infrastructure aspects of the source code management, CI and CD, task and ticket management systems as well as hosting non-MediaWiki websites and other collaboration services.

Data Center Operations

The Data Center Operations team is responsible for all of Wikimedia’s data center deployments and logistics as well as maintaining our presence in locations across the world. They perform on-site work and maintain the full 5-year life cycle (specs, purchasing, physical install, break/fix and decommissioning) for all hardware.

#wikimedia-dcopsconnect

Infrastructure Foundations

The team focuses on building and maintaining our base platform (“metal cloud”) that forms the foundations upon which nearly everything else in our infrastructure builds upon. On top of our bare metal deployments, their responsibilities include (but are not limited to) configuration management systems, infrastructure automation, orchestration tooling, infrastructure security and network operations.

#wikimedia-sre-foundationsconnect

Observability

The Observability team, or "o11y" for short, works across SRE and Technology to provide teams with diagnostic tools, platforms, and insights into how systems and services perform. It leverages technologies such as Grafana, Kibana/Logstash, OpenSearch, Prometheus, AlertManager and more.

#wikimedia-observabilityconnect

Traffic

The Traffic team is responsible for the critical first layer of high-traffic infrastructure which now spans much of the globe, including our TLS termination and caching layers (ATS, Varnish), load balancing, DNS and our own network.

#wikimedia-trafficconnect

Data Persistence

The Data Persistence team focuses on Wikimedia’s persistent data storage and retrieval systems, including (No)SQL databases, (distributed) object storage, file storage and backup systems.

#wikimedia-data-persistenceconnect

Service Operations

The Service Operations team takes care of public and “user-visible” services in close collaboration with both the Technology and Product teams. This includes our MediaWiki platform, the SOA service infrastructure based on Kubernetes, as well as community and developer-facing services like Gitlab, Gerrit, Phabricator and VRTS.

#wikimedia-serviceopsconnect

Contacting the team

If you need to get in touch with the team, there are detailed instructions onwikitech:SRE Team requests.

Retrieved from "https://www.mediawiki.org/w/index.php?title=Wikimedia_Site_Reliability_Engineering&oldid=8030309"
Category:

[8]ページ先頭

©2009-2025 Movatter.jp