1,742 Errors and Nobody Noticed
The worker crashed at 06:00. By 20:30 it had logged 1,742 consecutive errors. Zero tasks executed. No alerts fired. The queue built up quietly. Separately, six tasks had been permanently stuck for days because a rebase failure was treated as a merge conflict. It wasn't. Three failures running simultaneously, none of them loud.