reliability
Here are 710 public repositories matching this topic...
Language:All
Sort:Most stars
A powerful flow control component enabling reliability, resilience and monitoring for microservices. (面向云原生微服务的高可用流控防护组件)
- Updated
Jan 26, 2026 - Java
A curated list of Site Reliability and Production Engineering resources.
- Updated
Aug 28, 2025
A curated collection of publicly available resources on how technology and tech-savvy organizations around the world practice Site Reliability Engineering (SRE)
- Updated
Nov 17, 2025 - JavaScript
Agent Framework For Fintech and Banks
- Updated
Feb 19, 2026 - Python
Compilation of public failure/horror stories related to Kubernetes
- Updated
Aug 23, 2020 - HTML
It's just fascinating. How is modern software designed? 🤔 Some design-level considerations for scalability, maintainability eventual consistency, availability & reliability. 👨💻 Interview Prep. 👨💻
- Updated
Feb 21, 2024
Hands on labs and code to help you learn, measure, and build using architectural best practices.
- Updated
Jan 14, 2026 - Python
Chaos Engineering Toolkit & Orchestration for Developers
- Updated
Jul 20, 2024 - Python
A curated list of Site Reliability and Production Engineering Tools
- Updated
Feb 9, 2026
A free book about developing secure and robust systems software.
- Updated
Jul 6, 2025 - Rust
Sample implementations for cloud design patterns found in the Azure Architecture Center.
- Updated
Feb 17, 2026 - C#
Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models
- Updated
May 21, 2025
An open source Valkey client library that supports Valkey, and Redis open source 6.2, 7.0 and 7.2. Valkey GLIDE is designed for reliability, optimized performance, and high-availability, for Valkey and Redis OSS based applications. GLIDE is a multi language client library, written in Rust with programming language bindings, such as Java and Python
- Updated
Feb 20, 2026 - Java
An active monitoring software to detect failures before your customers do.
- Updated
Feb 20, 2026 - Go
A hosted disposable email telegram bot; Extremely privacy friendly; Proudly hosted for community.
- Updated
Feb 15, 2026 - Java
Chaos and resiliency testing tool for Kubernetes with a focus on improving performance under failure conditions. A CNCF sandbox project.
- Updated
Feb 19, 2026 - Python
An always-on framework that performs end-to-end functional network testing for reachability, latency, and packet loss
- Updated
Apr 8, 2024 - Go
A framework for rapid development of reliable asynchronous software.
- Updated
Nov 4, 2020 - C#
An Open-Source Collection of 230+ Flash Cards to Help You Succeed in Your System Design Interview and More 💯
- Updated
Oct 6, 2024
Improve this page
Add a description, image, and links to thereliability topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thereliability topic, visit your repo's landing page and select "manage topics."