主题
Cloud / SRE
Operational notes on reliability, incident response, observability, and production infrastructure.
This topic collects field notes about operating systems in production: reliability reviews, incident response patterns, observability choices, and operational tradeoffs.
推荐阅读路径
iTerm SSH Profile 突然无法登录时的排查
一篇关于 SSH profile 因 too many authentication failures 失败的简短排障笔记。
迁移 AWS S3 File Gateway 时,如何不破坏用户盘符映射
一个替换旧 S3 File Gateway、重建缓存并用 SSM State Manager 迁移 Windows 用户盘符的实践模式。
项目
这个主题下还没有公开项目。