McGarrah Technical Blog

SSH Key-Based Access to a Proxmox Cluster

· 5 min read
Typing passwords every time you SSH into a Proxmox node gets old fast — especially with a six-node cluster. Here's how to set up SSH key authentication and an SSH config so you can just type 'ssh harlan' and be in.

Hybrid Ceph Storage: SSD WAL/DB Acceleration with USB Drive Data

· 16 min read
Running Ceph on USB drives sounds crazy until you put the WAL and DB on an SSD. Here's how separating metadata onto a Crucial MX500 transformed my 15-OSD homelab cluster from sluggish to surprisingly capable — at a fraction of all-SSD costs.

When ZFS and Ceph Problems Collide: Diagnosing Overlapping Failures on Proxmox

· 9 min read
A routine ZFS scrub alert on harlan turned into a multi-hour debugging session when a hostid mismatch fix collided with a pre-existing Ceph OSD failure from a dead USB drive. Here's how overlapping storage problems can mask each other and how to untangle them.

Ceph OSD Recovery After Power Failure: SAN Switch Was Dead the Whole Time

· 11 min read
A power outage knocked my Ceph cluster from 15 healthy OSDs down to 4. The recovery took days of debugging — heartbeat cascades, a ceph.conf misconfiguration, and a dead SAN switch hiding behind NO-CARRIER flags on every node.

Posts