reliability
Here are 370 public repositories matching this topic...
Language:All
Sort:Most stars
A powerful flow control component enabling reliability, resilience and monitoring for microservices. (面向云原生微服务的高可用流控防护组件)
- Updated
Oct 24, 2024 - Java
A curated list of Site Reliability and Production Engineering resources.
- Updated
Jun 10, 2024
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
- Updated
Feb 22, 2025 - JavaScript
The most reliable AI agent framework that supports MCP.
- Updated
Jul 18, 2025 - Python
Compilation of public failure/horror stories related to Kubernetes
- Updated
Aug 23, 2020 - HTML
It's just fascinating. How is modern software designed? 🤔 Some design-level considerations for scalability, maintainability eventual consistency, availability & reliability. 👨💻 Interview Prep. 👨💻
- Updated
Feb 21, 2024
Hands on labs and code to help you learn, measure, and build using architectural best practices.
- Updated
Jul 17, 2025 - Python
Chaos Engineering Toolkit & Orchestration for Developers
- Updated
Jul 20, 2024 - Python
A free book about developing secure and robust systems software.
- Updated
Jul 6, 2025 - Rust
A curated list of Site Reliability and Production Engineering Tools
- Updated
Apr 4, 2025
Sample implementations for cloud design patterns found in the Azure Architecture Center.
- Updated
Jul 14, 2025 - C#
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
- Updated
May 21, 2025
An active monitoring software to detect failures before your customers do.
- Updated
Jul 18, 2025 - Go
A hosted disposable email telegram bot; Extremely privacy friendly; Proudly hosted for community.
- Updated
Sep 10, 2024 - Java
An open source Valkey client library that supports Valkey, and Redis open source 6.2, 7.0 and 7.2. Valkey GLIDE is designed for reliability, optimized performance, and high-availability, for Valkey and Redis OSS based applications. GLIDE is a multi language client library, written in Rust with programming language bindings, such as Java and Python
- Updated
Jul 18, 2025 - Java
An always-on framework that performs end-to-end functional network testing for reachability, latency, and packet loss
- Updated
Apr 8, 2024 - Go
A framework for rapid development of reliable asynchronous software.
- Updated
Nov 4, 2020 - C#
An Open-Source Collection of 230+ Flash Cards to Help You Succeed in Your System Design Interview and More 💯
- Updated
Oct 6, 2024
Chaos and resiliency testing tool for Kubernetes with a focus on improving performance under failure conditions. A CNCF sandbox project.
- Updated
Jul 16, 2025 - Python
Improve this page
Add a description, image, and links to thereliability topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thereliability topic, visit your repo's landing page and select "manage topics."