Step-by-step playbooks for when things go sideways. Interactive checklists with copy-paste commands.
Diagnose and resolve a full disk on Linux. Covers finding large files, cleaning logs, package caches, and reclaiming space from deleted-but-open files.
Identify and resolve processes causing high CPU utilization on Linux. Covers real-time monitoring, runaway process detection, and safe mitigation options.
Diagnose memory pressure and OOM killer events on Linux. Covers identifying memory-hungry processes, leak detection, cache management, and service recovery.
Diagnose and fix SSH connection failures. Covers network connectivity, sshd configuration, firewall rules, fail2ban, and daemon troubleshooting.
Diagnose and fix a systemd service that fails to start or keeps crashing. Covers unit file inspection, dependency resolution, port conflicts, and manual testing.
Diagnose DNS resolution failures on Linux systems and Kubernetes clusters. Covers resolver config, upstream DNS, systemd-resolved, CoreDNS, and DNSSEC issues.
Diagnose high memory usage on Linux. Covers understanding free vs available memory, finding memory-hungry processes, detecting leaks, managing swap, and cache behavior.
Interactively diagnose and fix a full disk on Linux. Branches into /var issues, Docker cleanup, log rotation, inode exhaustion, and deleted-but-open files.
Diagnose high CPU or memory usage on Linux. Branches into process identification, OOM killer analysis, memory leaks, CPU steal, and swap management with targeted fix commands.