Runbook: Textfile Stale
Alert name: TextfileStale
A Prometheus textfile collector .prom file on indri has not been updated for over 1 hour, indicating the metrics exporter script has stopped running.
Affected Textfiles
| File | LaunchAgent | What it monitors |
|---|---|---|
borgmatic.prom | mcquack.eblume.borgmatic | Backup status |
zot.prom | mcquack.eblume.zot | Container registry |
minikube.prom | mcquack.minikube-metrics | Minikube cluster status |
jellyfin.prom | mcquack.eblume.jellyfin-metrics | Media server |
Diagnostic Steps
-
Check which file is stale — the
filelabel in the alert tells you. Verify on indri:ssh indri 'ls -la /opt/homebrew/var/node_exporter/textfile/' -
Check if the LaunchAgent is running:
ssh indri 'launchctl list | grep mcquack' -
Check LaunchAgent logs (plist defines stdout/stderr paths):
ssh indri 'cat ~/Library/Logs/mcquack/<agent-name>.log' -
Try running the exporter manually:
ssh indri 'cat ~/Library/LaunchAgents/mcquack.<agent>.plist' # Find the ProgramArguments, run them manually
Common Causes
- LaunchAgent not loaded —
launchctl load ~/Library/LaunchAgents/mcquack.<agent>.plist - Script error — the exporter script crashed; check logs
- Permissions — the textfile directory is not writable
- Indri reboot — some LaunchAgents may not auto-start
Related
- alloy — Collects textfile metrics via
prometheus.exporter.unix - deploy-infra-alerting — Alerting pipeline overview