That 0.5% of failed payments? It wasn't random packet loss. It was the cluster waiting for a dead zombie to vote.
At 2:17 AM, I drained the traffic. The F5 showed wap-03 's connection count dropping from 1,200 to 0. Beautiful.
That's when I saw it. For the last 72 hours, wap-03 had been silently receiving packets from an old, forgotten monitoring script on a decommissioned jump box. Every five seconds, the script sent a malformed health check: GET / HTTP/1.1\r\nHost: \x00\x00 . wap-03 was spending 30% of its CPU trying to parse null bytes.