Changelog

All notable changes to BlumeOps are documented in this file.

The format is based on Keep a Changelog.

[v1.15.1] - 2026-03-28

Features

  • Add Tor Snowflake proxy on ringtail as a systemd service to support anti-censorship efforts.
  • Add offsite backup for immich photo library to BorgBase, running daily at 4 AM from indri via sifaka SMB mount.
  • Add QArt Tuner — a Go tool that generates QR codes whose data modules form a recognizable image, with an interactive web UI for parameter tuning. Based on the QArt technique by Russ Cox. Lives in utils/qart/.

Infrastructure

  • Migrate Forgejo from Homebrew to source build with mcquack LaunchAgent, matching the pattern used by zot, caddy, and alloy. Upgrades to v14.0.3 (7 security fixes including PKCE bypass and OAuth scope bypass).
  • Add borgmatic pg_dump backups for authentik and immich databases. Authentik uses the existing blumeops-pg cluster on port 5432. Immich requires a new borgmatic role on the immich-pg cluster, a Tailscale service, and Caddy L4 proxy on port 5433.
  • Upgrade External Secrets Operator from v1.3.2 to v2.2.0 and migrate from Helm chart to static kustomize manifests.
  • Add post-deploy maintenance docs and generation pruning task for ringtail.
  • Fix Immich Helm values: resource limits and probe timeouts were silently ignored due to wrong value keys. Resources now actually apply to pods, and liveness/readiness probe timeouts increased from 1s to 5s to prevent kubelet from killing pods during ML inference.
  • Reduce PodNotReady alert lookback window from 5m to 60s to clear faster after rollouts.
  • Tighten ArgoCDAppOutOfSync alert: reduce pending duration from 30m to 5m and lookback window from 5m to 1m so alerts clear faster after sync.
  • Update ringtail flake inputs (nixpkgs, home-manager).
  • Upgrade Homepage dashboard from v1.10.1 to v1.11.0
  • Upgrade nvidia-device-plugin from v0.18.2 to v0.19.0

Documentation

  • Review and fix CV service doc (correct URL, forge domain, container tag link) and add private forge repo review guidance to review-services process.
  • Review tailscale-setup tutorial: fix macOS install steps, add --accept-routes tip, correct tag name, add ACL apply instructions, add [[tailscale-operator]] cross-reference.

Miscellaneous

  • Add preserve/* branch prefix exclusion to branch-cleanup task; document Pyroscope profiling work and blockers in observability reference.

[v1.15.0] - 2026-03-24

Features

  • Deploy Prowler CIS scanner as a weekly CronJob on minikube-indri, with reports written to sifaka NFS share.
  • Add Grafana “Alerts” dashboard showing currently firing alerts and recent state changes.
  • Add IaC scanning via Prowler IaC provider (Saturday 2am, Dockerfiles and K8s manifests).
  • Add container image vulnerability scanning via Prowler image provider (Saturday 3am, all blumeops/* images).

Bug Fixes

  • Fix authentik worker OOMKill by setting AUTHENTIK_WORKER_CONCURRENCY=2 (was defaulting to 16 based on CPU count).
  • Remove group: "" from tailscale-operator ignoreDifferences — ArgoCD normalizes away the empty string, causing permanent OutOfSync on the apps app.

Infrastructure

  • Decommission JobSync service — removed ArgoCD app, k8s manifests, container build, Caddy proxy, Homepage entry, docs, and forge mirror. Replaced by datasette-based job tracking (coming soon).
  • Localize authentik-redis container: replace upstream redis:7-alpine with nix-built image from nixpkgs (Redis 8.2.3). Introduces attached service pattern with parent field in service-versions.yaml and version assertion in default.nix to prevent silent version drift.
  • Unified Dockerfile and Nix container build workflows into a single workflow that auto-classifies containers by build type and routes to the correct runner (k8s for Dockerfile, nix-container-builder for Nix). Removed nettest container (outgrown). Nix builds now require an explicit version = "..." declaration — no implicit nixpkgs fallback.
  • Monthly tooling dependency update: bump prek hooks (trufflehog 3.94.0, ruff 0.15.7, shfmt 3.13.0), Fly.io images (nginx 1.29.6, Alloy 1.14.1), actions/checkout v4.3.1→v6.0.2, tighten mise task Python lower bounds (rich 14, typer 0.24, httpx 0.28.1, pyyaml 6.0.2), and bump ansible-lint/ansible-core floors.
  • Upgrade ntfy v2.17.0 → v2.19.2 (adds experimental PostgreSQL support, read replicas, web push fixes)
  • Revert Tailscale operator to v1.94.2 (v1.96.3 images not yet published); keep Fly proxy tailscale wait improvement
  • Add RuntimeDefault seccomp profiles to all managed deployments, statefulsets, and cronjobs.
  • Upgrade Frigate from 0.17.0-rc2 to 0.17.1 (security fixes, bugfixes). Add motion retention tier (365 days), reduce continuous retention from 180 to 30 days.

Documentation

  • Review and fix ArgoCD config tutorial: correct sync policy example, fix typo, add missing cross-references and frontmatter.
  • Review and update 12 reference docs: fix stale image references to point at kustomization manifests instead of hardcoded tags, correct Prometheus scrape target, expand external-secrets stub, add cross-references between backup/disaster-recovery docs, and remove misleading .ts.net URLs from Quick Reference tables.

[v1.14.3] - 2026-03-22

Features

  • Deploy infrastructure alerting pipeline using Grafana Unified Alerting with ntfy push notifications. 7 alert rules with runbooks covering service health, pod readiness, PostgreSQL, textfile freshness, Frigate cameras, and ArgoCD sync status. services-check now queries the alerting API for covered checks.

Bug Fixes

  • Fix Frigate NVR crash by re-adding required mqtt config section (disabled) after Mosquitto removal.
  • Fix borgmatic backup failure: use correct kubectl context (minikube) on indri for Mealie SQLite dump hook

Infrastructure

  • Localize Grafana Alloy container image with dual Dockerfile + Nix builds from forge mirror
  • Upgrade Prometheus from v3.9.1 to v3.10.0 (distroless variants, PromQL fill operators, performance improvements)
  • Bump Frigate recording retention (180d continuous, 30d detections, 730d alerts) and add camera-fps health check to services-check.
  • Improve Frigate health checks in services-check: per-camera FPS validation and NFS storage accessibility check.
  • Increase data retention: Prometheus 15d → 10y, Loki 31d → 365d (PVC sizes unchanged; minikube hostpath doesn’t enforce limits)
  • Standardize OCI labels across all container Dockerfiles with consistent title, description, version, source, and vendor metadata.

Documentation

  • Review and correct Tailscale reference doc: fix ACL path, add missing device tags (ringtail, per-service tags, ci-gateway, flyio-proxy), correct access matrix (PyPI→DevPI, homelab grants), add SSH homelab→homelab rule, document auto approvers, add last-reviewed frontmatter.

AI Assistance

  • Add four Claude Code subagents: infra-health (background health monitor), doc-reviewer (persistent-memory doc review), change-classifier (C0/C1/C2 triage), and mikado-navigator (C2 chain state advisor).

Miscellaneous

  • Standardized USAGE pragmas and typer CLI parsing across all mise tasks: added missing #USAGE directive to mikado-branch-invariant-check, converted pr-comments and op-backup from raw sys.argv to typer for consistency with all other uv python scripts.

[v1.14.2] - 2026-03-17

Features

  • Deploy Mealie recipe manager on minikube-indri for meal planning and prep automation.
  • Add UnPoller deployment to monitor UniFi network metrics via Prometheus

Bug Fixes

  • Fix Caddy v2.11 breaking change: preserve original Host header for HTTPS upstreams.
  • Fix plan-a-meal random recipe queries — add required paginationSeed parameter

Infrastructure

  • Externalize Tailscale operator manifest to forge mirror, removing 495 KB vendored file from the repo.
  • Externalize TeslaMate Grafana dashboards to forge mirror, removing 713 KB of ConfigMaps from the repo.
  • Upgrade Caddy from v2.10.2 to v2.11.2 (7 CVE fixes), create caddy-l4 forge mirror, migrate all ~/code/3rd clones on indri to HTTPS forge.ops.eblu.me remotes.
  • Upgrade borgmatic from 2.0.13 to 2.1.3 on indri (improved borg warning handling, memory/performance improvements)

Documentation

  • Add git last-modified subsort to docs-review script, so ties in review date are broken by least recently updated first.
  • Review jellyfin (10.11.6, current) and automounter (1.11.0) services; add missing frigate share to automounter docs.

[v1.14.1] - 2026-03-14

Features

  • Add docs-preview mise task: builds docs with Dagger and serves them locally in the production quartz container, opening the browser directly to the specified card. Also adds visual preview hints to the docs-review checklist and the review-documentation how-to.

Infrastructure

  • Add jobsync to services-check and homepage dashboard; mark as reviewed at v1.1.4
  • Bump Grafana Alloy to v1.14.0 across all deployments (indri, alloy-k8s, alloy-ringtail, alloy-tracing-ringtail)
  • Upgrade zot container registry from v2.1.13 to v2.1.15 (CVE-2025-30204, open redirect fix). Fix trivy CVE DB downloads by adding /usr/local/bin to LaunchAgent PATH.
  • Remove Mosquitto (MQTT broker) — unused since frigate-notify switched to webapi polling. Deleted ArgoCD app, k8s manifests, namespace, and updated all docs.

Documentation

  • Add how-to card for running the 1Password backup (mise run op-backup), with bidirectional links to restore procedure and service reference.

[v1.14.0] - 2026-03-09

Features

  • Deploy JobSync to ringtail k3s — nix-built container, Tailscale Ingress, Caddy route at jobsync.ops.eblu.me, Ollama integration for AI features.

Bug Fixes

  • Fix 1Password Connect logs showing as errors in Grafana by normalizing numeric log levels (1-5) to standard strings (error/warn/info/debug/trace) in the Alloy log processing pipeline.
  • Fix mikado-branch-invariant-check false positive: close commits without preceding impl commits are valid (e.g., operational tasks with no code changes).

Infrastructure

  • Disable Quartz SPA mode and remove robots.txt crawler exclusions to fix the Facebook crawler spider trap. Remove hand-curated category index files in favor of Quartz auto-generated folder pages.

Documentation

  • Add JobSync reference card, update ringtail workloads table, document observability via Loki, and wire RAPIDAPI_KEY through ExternalSecret for job search automation.
  • Relax wiki-link constraints: allow path-based links for disambiguation, drop global filename uniqueness requirement, remove docs-check-filenames and docs-check-index hooks.

[v1.13.3] - 2026-03-06

Infrastructure

  • Upgrade Dagger engine and CLI from v0.20.0 to v0.20.1.

Documentation

  • Add how-to guide for upgrading Dagger, documenting the correct phase ordering to avoid chicken-and-egg CI failures.

[v1.13.2] - 2026-03-06

Infrastructure

  • Replace nginx spider-trap 404 guards with robots.txt disallowing /explorer/ to prevent crawler-induced infinite URL trees.

[v1.13.1] - 2026-03-06

Infrastructure

  • Add :kustomized sentinel tag to all manifest image references overridden by kustomize, making it clear the real tag lives in kustomization.yaml.
  • Add nginx spider-trap guards to docs.eblu.me Quartz container — blocks recursive crawler paths at /tags/ depth >1 and global depth ≥5.

[v1.13.0] - 2026-03-05

Features

  • Add Authentik OIDC login for ArgoCD — eblume (admins group) gets admin access via SSO while local admin password remains as break-glass.
  • Expose Forgejo publicly at forge.eblu.me via Fly.io reverse proxy with rate limiting, fail2ban, and security hardening.
  • Deploy Ollama LLM server on ringtail with GPU acceleration and declarative model management
  • Add distributed tracing via Grafana Tempo and Beyla eBPF auto-instrumentation. Tempo runs on minikube-indri for trace storage, while a privileged Alloy DaemonSet on ringtail uses Beyla to instrument HTTP services (Frigate, ntfy, Ollama, Immich) without code changes. Grafana gets trace-to-log and trace-to-metrics correlation.
  • Add fly.io nginx proxy observability and application logs to Forgejo dashboard; rename from “Forgejo Repository Health” to “Forgejo”.

Bug Fixes

  • Add per-torrent rate metrics using Transmission’s native rate_download/rate_upload fields. Dashboard panels were querying cumulative byte gauges (torrent size) instead of actual transfer rates.
  • Fix Frigate database loss on pod restart by pointing database path to persistent /db volume
  • Fix runner-job-image Dagger version mismatch: bump from 0.19.11 to 0.20.0 to match upgraded Dagger module.

Infrastructure

  • Home-build grafana-sidecar container image, replacing upstream quay.io/kiwigrid/k8s-sidecar for supply chain control.
  • Add HA (2 replicas + PDB) for CV and Docs services for zero-downtime deploys.
  • Build Loki container image locally instead of pulling from upstream
  • Replace unmaintained metalmatze/transmission-exporter sidecar with homegrown Python exporter using prometheus_client and transmission-rpc. Same metric names, so Grafana dashboards work unchanged.
  • Upgrade Transmission from 4.0.6-r4 to 4.1.1-r1 (Alpine edge community repo)
  • Bump Frigate memory limit from 2Gi to 3Gi to prevent OOMKills under steady-state ONNX + CUDA workload.
  • Add Gandi bookmark to homepage dashboard
  • Allow implicit octals in yamllint and use 0755 directly in k8s manifests instead of decimal or disable-line comments.
  • Upgrade Dagger engine and CLI from v0.19.11 to v0.20.0
  • Upgrade TeslaMate from v2.2.0 to v3.0.0 (dark mode, BRIN index optimization, Elixir 1.19.5, trixie-slim runtime)
  • Add OOMKilled Containers stat panel and Container Restarts timeseries to the Kubernetes Clusters dashboard for persistent OOMKill visibility.
  • Add pre-commit hook to prevent changelog fragments from being placed in subdirectories.
  • Bump kiwix-serve from 3.8.1 to 3.8.2

Documentation

  • Clarify that changelog fragments apply to all change levels (C0, C1, C2), not just C2.
  • Add reference card for the Ollama LLM inference service.
  • Clarify that all mikado frontmatter is removed during chain finalization; clean up stale frontmatter from closed chains; fix ai-docs exit code after plans directory retirement.
  • Retire docs plans directory: deleted completed/abandoned plans, converted migrate-forgejo-from-brew to a mikado chain root card, removed plans references from tutorials and how-to index.
  • Review and fix upgrade-grafana doc: correct image tag reference to kustomization.yaml, add sidecar cross-reference, update stale service-versions notes.
  • Use towncrier orphan fragment naming (+slug.<type>.md) for C0 changes to avoid main.* collisions.

[v1.12.1] - 2026-03-02

Features

  • Mikado branch invariant hook now rejects impl commits that modify Mikado card files (docs with requires:, status:, or branch: mikado/ frontmatter).

Infrastructure

  • Switch git hooks from pre-commit to prek, a faster Rust-native drop-in replacement. Adds built-in checks for case conflicts, private key detection, and executable shebangs. Configuration migrated from .pre-commit-config.yaml to prek.toml.

Documentation

  • Review build-authentik-from-source Mikado chain: fix go-server-derivation path errors, remove stale DRF fork content from mirror doc, add last-reviewed to all cards.

[v1.12.0] - 2026-03-01

Bug Fixes

  • Fix authentik 2026.2.0 startup crash caused by Django migration ordering bug (FieldError: Cannot resolve keyword 'group_id'). Patch ensures authentik_core/0056 runs before authentik_rbac/0010.

Infrastructure

  • Upgrade authentik from 2025.10.1 to 2026.2.0, building core services from source via custom Nix derivations rather than using nixpkgs directly (nixpkgs still provides satellite dependencies like Python, Go, and system libraries). Four components (API client generation, Python backend, web UI, Go server) assembled into a single container image with full supply chain control via forge mirrors.
  • Sync Frigate zone coordinates from live API to manifest (driveway_entrance, driveway)
  • Pin blumeops-pg to PostgreSQL 18.3 (from floating :18 tag at 18.1)

Documentation

  • Review and update authentik-api-client-generation doc: remove stale patch note, fix test-build.nix section, add last-reviewed date.
  • Review all three forgejo-runner Mikado chain docs: stamp last-reviewed, add cross-links, fix configmap.yamlconfig.yaml reference.
  • Review build-grafana-container docs; fix stale grafana.md reference card (Helm → Kustomize).

[v1.11.5] - 2026-02-26

Features

  • Add authenticated GitHub mirror sync with PAT rotation tooling (mirror-update-pats, mirror-create auth support, how-to doc).
  • Add Transmission Grafana dashboard with metrics exporter sidecar for monitoring upload/download speeds, transfer volumes, and per-torrent breakdowns.

Bug Fixes

  • Fix Frigate dashboard “Detection Events Rate” panel showing no data — corrected metric name to frigate_camera_events_total and label to camera.
  • Filter car and bird detections from Frigate driveway zone to stop repeated alerts on parked cars at night

Infrastructure

  • Port CloudNative-PG operator from Helm chart to direct upstream release manifest via forge mirror.
  • Add multi-cluster Kubernetes observability: deploy kube-state-metrics and Alloy on ringtail (k3s), add cluster label to all metrics/logs, replace single-cluster dashboards with multi-cluster Kubernetes dashboard and dedicated Ringtail dashboard with GPU monitoring.
  • Add explicit ExternalSecret defaults for SSA sync parity with ArgoCD v3.3
  • Upgrade ArgoCD from v3.2.6 to v3.3.2 with Server-Side Apply enabled

AI Assistance

  • Bake default bat options into ai-docs mise task so agents no longer need verbose flags at session start.
  • docs-review task now prints the file path instead of the file content, so the LLM reads it directly.

[v1.11.4] - 2026-02-25

Features

  • Add mirror-create mise task for creating upstream mirrors in the mirrors/ Forgejo org

Bug Fixes

  • Fix Grafana OAuth role mapping: INI parser was stripping quotes from role_attribute_path = 'Admin', causing all Authentik users to get Viewer role instead of Admin. Now uses group-based mapping from the admins Authentik group.
  • Fix TeslaMate dashboards showing “No Data”: Grafana 12.x’s grafana-postgresql-datasource plugin requires the database name in jsonData, not just the top-level database field.

Infrastructure

  • Move image tags to kustomize images: transformer across 22 services and replace hand-written ConfigMaps with configMapGenerator: in 12 services, enabling content-hash-based automatic rollouts on config changes.
  • Migrate upstream mirror repos from eblume/ to mirrors/ Forgejo organization
  • Port Prometheus to local container build (3-stage: Node UI, Go binaries, Alpine runtime) for supply chain control via Zot registry.
  • Fix ArgoCD app definitions and credential template to use mirrors/ org after forge mirror migration; bump immich v2.5.2 → v2.5.6.
  • Document AirPlay cross-VLAN firewall rules for Samsung Frame TV (established/related, AirPlay ports, dynamic reverse) and fix rule ordering in segment-home-network plan.
  • Update image tags for all 6 mirror-migrated containers (homepage, navidrome, ntfy, miniflux, prometheus, teslamate)
  • Switch prometheus, teslamate, and miniflux container builds to forge mirrors; create miniflux mirror

Documentation

  • Document squash-merge container tag provenance issue and post-merge workflow for updating manifests to main-SHA tags.
  • Add mise-tasks reference card with categorized task inventory; include in ai-docs context
  • Review 3 how-to docs: stamp provision-authentik-database and use-pypi-proxy, fix wrong policy path and misleading —yes in update-tailscale-acls

[v1.11.3] - 2026-02-23

Features

  • Upgrade Grafana from 11.4.0 to 12.3.3 with home-built container image and Kustomize manifests, replacing the Helm chart deployment.

Bug Fixes

  • Fix Dagger pipelines hanging when called from mise tasks in interactive terminals. Added --progress=plain to all dagger call invocations to prevent SIGTTOU from stopping the process when mise’s child process group is not the terminal foreground group.
  • Fix Grafana TeslaMate dashboards not appearing in a folder — enabled foldersFromFilesStructure so the sidecar’s grafana_folder annotation is respected.
  • Container build workflows now checkout the dispatch ref when building from feature branches, fixing “No Dockerfile — skipping” errors for containers not yet on main.

Infrastructure

  • Fix Frigate Prometheus scrape target to route via Caddy (nvr.ops.eblu.me) after migration to ringtail, and rebuild Grafana dashboard with updated Frigate 0.17 metrics (GPU usage, temperature, skipped FPS, detection events).
  • Update tooling dependencies: pre-commit hooks (trufflehog, ruff, shellcheck, prettier, actionlint), Fly.io Dockerfile (pin nginx 1.28.2-alpine, alloy v1.13.1), and normalize mise task Python lower bounds.
  • Rename containers/forgejo-runner to containers/runner-job-image to distinguish the CI job execution image from the Forgejo runner daemon, fixing a version-check false positive.

Documentation

  • Review deploy-authentik card: rewrite as reproducible process guide, remove stale version info and future work section, mark plan as completed.
  • Formalize C0/C1/C2 change classification: C0 allows direct-to-main commits, C1 adds docs-first workflow with branch deployment, C2 introduces the Mikado Branch Invariant for strict commit ordering on multi-phase changes. Add C2 conventions: C2(<chain>): plan/impl/close/finalize commit messages, mikado/<chain-stem> branch naming, and branch: frontmatter on goal cards. New tooling: docs-mikado --resume for cold-start session pickup and mikado-branch-invariant-check pre-commit hook.
  • Replace Grafana Helm upgrade plan with C2 Mikado chain for upgrading to 12.x with kustomize and home-built containers.

AI Assistance

  • Improved Mikado C2 process: end-of-cycle session prompts, rigorous reset discipline with documented git patterns, and --resume now shows PR number and stash hints.

[v1.11.2] - 2026-02-22

Features

  • Add branch-cleanup mise task and scheduled Forgejo workflow to delete merged branches locally and on the Forgejo remote. Detects squash-merged PRs via the Forgejo API. The workflow runs approximately every 10 days with a configurable age cutoff (default 30 days).
  • Add Forgejo repository health metrics collector and Grafana dashboard with CI/CD, release, and language tracking across all repos.
  • Switch Frigate object detection from YOLO-NAS-S (320x320) to YOLOv9-c (640x640) with CUDA Graphs support, and add frigate-export-model Dagger pipeline + mise task for reproducible model exports.

Infrastructure

  • Simplify service-versions.yaml type taxonomy to argocd | ansible | nixos; add nix-container-builder entry; backfill forgejo and forgejo-runner versions
  • Prepare forgejo-runner v12 upgrade: review config compatibility, add workflow schema validation via Dagger, wire pre-commit hook
  • Upgrade k8s forgejo-runner daemon from v6.3.1 to v12.7.0

Documentation

  • Add Mikado chain for upgrading k8s forgejo-runner from v6.3.1 to v12.x

[v1.11.1] - 2026-02-22

Infrastructure

  • Use Zot registry logo instead of Docker logo on homepage dashboard

[v1.11.0] - 2026-02-22

Features

  • Add agent change process (C0/C1/C2) documentation and docs-mikado tool for Mikado method dependency chain resolution. Rename zk-docs task to ai-docs.
  • Deploy Authentik identity provider on ringtail k3s cluster, replacing Dex as the SSO provider. Includes Nix-built container, CNPG database, Redis, and Caddy routing at authentik.ops.eblu.me.
  • Integrate Forgejo with Authentik OIDC for single sign-on with group-based admin propagation. Enforce TOTP MFA on Authentik authentication flow.
  • Add Authentik SSO to Jellyfin with admin group mapping
  • Container builds now trigger automatically on merge to main (path-based) and use commit-SHA-based image tags (vX.Y.Z-<sha>) for full traceability. The container-tag-and-release task is replaced by container-build-and-release which dispatches workflows via the Forgejo API. Added pre-commit hook to keep container versions in sync with service-versions.yaml.
  • Register Zot as an OIDC client in Authentik via blueprint, with artifact-workloads group, zot-ci service account, and OIDC credentials template for Ansible deployment.
  • Enable OIDC + API key authentication on zot registry with three-tier access control (anonymous read, CI create, admin full). Wire both CI push paths (Dagger and Nix/skopeo) with registry credentials via Forgejo Actions secrets. Allow anonymous Prometheus metrics scraping via accessControl.metrics.users.

Bug Fixes

  • Fix frigate-notify notification pipeline: switch to webapi polling, enable dedup, drop events without snapshots, use hi-res snapshots

Infrastructure

  • Add Mikado prereq for commit-based container tagging scheme to harden-zot-registry chain
  • Convert deploy-authentik plan to C2 Mikado chain entry point.
  • Add flake-update Dagger pipeline for updating ringtail NixOS flake inputs.
  • Upgrade frigate-notify from v0.3.5 to v0.5.4

Documentation

  • Add deployment plan for Authentik identity provider to replace Dex

[v1.10.0] - 2026-02-19

Features

  • Deploy Dex OIDC identity provider on ringtail with Grafana as first SSO client.
  • Added Nix container build for nettest, validating the full nix-container-builder pipeline on ringtail. One git tag now triggers both Dockerfile and Nix workflows — each skips if its build file is absent. Rewrote container-tag-and-release as a typer CLI with —dry-run support. Added container policy.json and registries.conf to ringtail for skopeo.
  • Add NixOS configuration for ringtail (gaming/compute workstation with RTX 4080). Includes declarative disk partitioning via disko, NVIDIA drivers, sway/Wayland desktop, Steam, Tailscale, and Ansible-driven provisioning.
  • Add screen lock, idle timeout, and sleep prevention to ringtail: swaylock locks after 15min, display powers off after 60min, machine never suspends.
  • Systemd Forgejo Actions runner on ringtail (nix-container-builder label) for building containers with nix build and pushing via skopeo. K3s cluster retained for future workloads. 1Password Connect + External Secrets Operator available for k8s secret management.

Bug Fixes

  • Cap detect FPS to 2 and sync motion masks/zones from live config
  • Fix zk-docs task to use new path for troubleshooting doc after how-to reorg.
  • Inhibit swayidle lock screen when a fullscreen window is active on ringtail, preventing screen lock during gamepad-only gaming sessions.
  • Make 1Password secret tasks in ringtail playbook idempotent by checking kubectl apply output instead of always reporting changed.

Infrastructure

  • Port Frigate NVR to ringtail k3s with RTX 4080 GPU acceleration (TensorRT/ONNX), replacing the ZMQ-based Apple Silicon detector on indri.
  • Replace Homepage Helm chart (jameswynn/homepage v2.1.0, pinned at app v1.2.0) with plain kustomize manifests and a custom Dockerfile built from upstream v1.10.1. Gives full version control and matches the pattern used by other blumeops services.
  • Port ntfy to a locally built container image from forge mirror source.
  • Port Mosquitto (MQTT) and ntfy to ringtail k3s; retire Apple Silicon Detector from indri.
  • Ringtail post-install: NixOS config (sway with Catppuccin Macchiato theme, fish, 1Password, Steam, LibreWolf, Bluetooth audio, chezmoi, dev tools, nix-ld), Dagger flake-lock pipeline, improved provision-ringtail workflow, services-check integration, and reference documentation.
  • Add ringtail DeviceTags to Pulumi and allow homelab-to-homelab Tailscale SSH for cross-host ansible/management.
  • Update Frigate zone masks from live config and expand alert notifications to cover both Driveway and Driveway_entrance zones.
  • Add Apple Silicon ZMQ detector for Frigate — inference moves from in-pod ONNX CPU to CoreML on indri via ZMQ, using YOLOv9-m model
  • Deploy Tailscale operator on ringtail k3s cluster
  • Upgrade ntfy from v2.11.0 to v2.17.0 and add ntfy and frigate reference docs.
  • Update External Secrets Operator Helm chart from 1.3.1 to 2.0.0 (operator v1.3.2)
  • Upgrade Frigate NVR from 0.16.4 to 0.17.0-rc2 (prerequisite for Apple Silicon ZMQ detector)

Documentation

  • Add Dex OIDC documentation: reference card, federated login explanation, services-check integration, and updated plan.
  • Update services-check and documentation to reflect Frigate, Mosquitto, and ntfy migration from indri minikube to ringtail k3s (PRs #216, #217).
  • Review and fix update-documentation how-to: add missing cache purge step, clean up fragment types table.

[v1.9.4] - 2026-02-17

Documentation

  • Reorganize how-to guides into deployment/, configuration/, and operations/ subdirectories; review and update gandi-operations doc; fix missing cv.eblu.me CNAME in gandi reference card.

[v1.9.3] - 2026-02-16

Features

  • Add service version review system with mise run service-review task, tracking file, and how-to guide.
  • Add UniFi admin link to homepage dashboard bookmarks.

Infrastructure

  • Eliminate double towncrier run in release workflow — changelog is now built once on the runner, then the pre-processed source tree is passed to a new build_quartz Dagger function for the Quartz site build only.
  • First service version review: pin mosquitto to 2.0.22, bump tailscale-operator to v1.94.2, record 7 reviewed services

[v1.9.2] - 2026-02-16

Features

  • Add how-to guide for building container images and port navidrome to a custom-built container image.

Bug Fixes

  • Fix Frigate repeatedly alerting on parked cars by removing per-object max_frames and setting stationary interval to 0. Make Frigate config writable so UI changes (zones, masks) persist within a pod lifecycle.
  • Switch navidrome to custom container image with dedicated non-root user and fsGroup security context

Documentation

  • Review expose-service-publicly doc: replace stale inline code with references to actual files, add observability sidecar section, fix broken internal link, update templates to current patterns.

[v1.9.1] - 2026-02-15

Documentation

  • Review connect-to-postgres, create-release-artifact-workflow, and deploy-k8s-service docs. Fix stale repoURL, incorrect Caddy config keys, add Tailscale tag documentation, and migrate remaining op item get calls to op read.

[v1.9.0] - 2026-02-14

Features

  • Deploy cloud-free NVR stack: Frigate 0.16.4 (ARM64) with ONNX/YOLO-NAS-s detection, Mosquitto MQTT broker, Ntfy self-hosted push notifications (with iOS APNs relay), and frigate-notify for detection alerting. GableCam (ReoLink Elite Floodlight) connected via RTSP with NFS recordings on sifaka, Grafana dashboard, Prometheus scraping, Homepage integration, and Caddy reverse proxies at nvr.ops.eblu.me and ntfy.ops.eblu.me.

Infrastructure

  • Configure DinD sidecar to use Zot as a pull-through registry mirror for Docker Hub images, reducing bandwidth and avoiding rate limits during Dagger CI builds.
  • Abandon UniFi Pulumi IaC (provider bugs caused network outage); add manual three-network segmentation plan for UX7 web UI.
  • Upgrade Node.js from 20 to 22 (LTS) in Dagger docs build and forgejo-runner container
  • Tier 1 version bumps: upstream images (prometheus, loki, alloy, kube-state-metrics, tailscale, navidrome), helm charts (CloudNativePG, 1Password Connect), and custom containers (miniflux, kubectl, kiwix-serve, nettest, transmission) updated to latest stable versions with Alpine 3.22 base.

Documentation

  • Add how-to guide for connecting to PostgreSQL as a superuser via psql.
  • Review add-ansible-role doc: fix secrets to use op read, match tag format to playbook, fix handler pattern, add last-reviewed date.
  • Review and fix why-gitops doc: correct wiki-links, fix aptbrew, broaden Pulumi scope, add last-reviewed.

[v1.8.2] - 2026-02-13

Features

  • Recategorize homepage groups: “Content” (Immich, Kiwix, Miniflux, DJ, Grafana) and “Misc” (CV, TeslaMate, Transmission, Docs, Prometheus, PyPI)

Infrastructure

  • Move non-secret forgejo-runner env vars from ExternalSecret to deployment spec so version bumps trigger automatic rollouts
  • Add yq to forgejo-runner container and replace sed-based YAML editing in workflows with yq

[v1.8.0] - 2026-02-12

Features

  • Update CV release to v1.0.2
  • Update CV release to v1.0.3.

Bug Fixes

  • Fix cache hit rate panels on APM and Fly.io dashboards showing blank/red or misleading 100% for low-traffic static sites.

Documentation

  • Add reference/tools/ category with Dagger, ArgoCD CLI, Ansible, and Pulumi reference cards

Miscellaneous

  • Add X-Clacks-Overhead header to public proxy for cv and docs: GNU Terry Pratchett.

[v1.7.1] - 2026-02-12

Features

  • Expose CV service publicly at cv.eblu.me via Fly.io proxy.
  • Update CV service to resume release v1.0.1.

Infrastructure

  • Add CV to services-check (tailnet and public endpoints).

Miscellaneous

  • Update CV homepage link to use public URL (cv.eblu.me).
  • Remove /_error test endpoint from Fly.io nginx proxy.

[v1.7.0] - 2026-02-12

Features

  • Add CV/resume web app at cv.ops.eblu.me — container, k8s manifests, Caddy route, and deploy workflow. Content built from separate cv repo.

Infrastructure

  • Extend forgejo_actions_secrets Ansible role to support multiple repos.

Documentation

  • Add CV service reference card and update apps registry, Caddy docs, and services index.
  • Add how-to guide for creating release artifact workflows with Forgejo packages.

[v1.6.9] - 2026-02-11

Bug Fixes

  • Set TZ=America/Los_Angeles in the Dagger build_changelog container so towncrier stamps the correct local date instead of UTC (which showed tomorrow’s date for evening releases).

[v1.6.8] - 2026-02-11

Documentation

  • Update “Deploy K8s Service” how-to with current ProxyGroup ingress pattern.

[v1.6.7] - 2026-02-11

Documentation

  • Close Dagger CI plan (Phases 1–3 complete) and move to completed plans archive.

[v1.6.6] - 2026-02-11

Features

  • Simplify Forgejo runner image (Dagger Phase 3): remove Node.js, Docker CLI, buildx, skopeo, gnupg, lsb-release, and xz-utils. Add tzdata and flyctl. All build tools now live inside Dagger containers.

Bug Fixes

  • Restore Docker CLI to Forgejo runner image — Dagger shells out to docker to provision its BuildKit engine.
  • Restore Node.js to Forgejo runner image — required by actions/checkout@v4 and other JavaScript Actions that were broken by the Phase 3 simplification.

[v1.6.4] - 2026-02-12

Bug Fixes

  • Set Forgejo runner timezone to America/Los_Angeles. The runner previously used UTC, causing towncrier changelog entries to show tomorrow’s date when releases were cut in the evening. Note: the v1.6.2 changelog entry shows 2026-02-12 due to this bug; dates may appear non-sequential as a result.

[v1.6.2] - 2026-02-12

Features

  • Migrate docs build pipeline to Dagger (Phase 2): dagger call build-docs --src=. --version=dev now runs the full Quartz build locally, identically to CI. Adds date-modified frontmatter to all docs and a docs-check-frontmatter pre-commit hook.
  • Adopt Dagger as CI build engine for container images (Phase 1). Replaces the Docker buildx + skopeo composite action with a Dagger Python module. BuildKit’s push is compatible with Zot, eliminating the skopeo workaround.

Bug Fixes

  • Fix blumeops-tasks: migrate from deprecated Todoist REST API v2 to API v1, handle cursor-based pagination, and use op read for 1Password credential retrieval.

[v1.6.1] - 2026-02-11

Bug Fixes

  • Fix Fly.io proxy cache purge command for BusyBox shell compatibility.

[v1.6.0] - 2026-02-11

Bug Fixes

  • Purge Fly.io proxy cache after docs deploy so new releases are served immediately.

[v1.5.4] - 2026-02-11

Bug Fixes

  • Bump Fly.io proxy VM memory from 256MB to 512MB to prevent Alloy OOM kills.

Documentation

  • Add plan documents for Dagger CI/CD adoption and upstream fork strategy.
  • Add plan documents for OIDC provider adoption, zot registry hardening, and expanded network segmentation details.
  • Review security-model.md: fix op CLI pattern, add Tailscale Operator section.

[v1.5.3] - 2026-02-11

Features

  • Add BorgBase offsite backup repository for 3-2-1 backup strategy
  • Fly.io proxy serves a friendly error page when upstreams are unreachable (indri offline, Tailscale tunnel down, etc.). Test at docs.eblu.me/_error.
  • Add op-backup mise task for encrypted 1Password disaster recovery backups via borgmatic
  • Add SMART disk health monitoring for sifaka NAS with smartctl_exporter, Grafana dashboard, Ansible playbook, and Caddy L4 routing via ops.eblu.me.

Bug Fixes

  • Replace op item get --fields with op read in all mise tasks (tailnet-up, tailnet-preview, dns-up, dns-preview) to prevent multi-line secret corruption.
  • Fix 502 errors during Fly.io proxy deploys by deferring health check until Tailscale is connected.
  • Fix minikube ansible role not restarting cluster after power loss — status check only examined host VM state, missing stopped kubelet/apiserver.
  • Log real client IPs in Fly.io proxy access logs using Fly-Client-IP header instead of showing the internal proxy address.

Infrastructure

  • Switch CI container builds from deprecated docker build to docker buildx build (BuildKit).
  • Install docker-buildx-plugin in forgejo-runner image to support docker buildx build.
  • Eliminate 502 errors during Fly.io proxy deploys by starting nginx after Tailscale, switching to bluegreen deploys, and using service-level health checks for traffic gating.

Documentation

  • Add troubleshooting guide for CNI conflict after unclean shutdown to restart-indri how-to.
  • Add migration plan for Forgejo brew-to-source transition
  • Document op read vs op item get convention for 1Password secret retrieval
  • Add power infrastructure reference card documenting the battery-backed UPS chain (Anker SOLIX F2000 → CyberPower UPS → homelab).
  • Add plan and reference card for UniFi Express 7 Pulumi IaC management.
  • Add how-to guide for restoring 1Password backup from borgmatic, with cross-links from disaster recovery, borgmatic, 1password, and backup policy docs

[v1.5.2] - 2026-02-09

Features

  • Filter blumeops-tasks to only show dated/recurring tasks when due today or earlier.
  • Add docs-review mise task that sorts docs by last-reviewed frontmatter date, prioritizing never-reviewed cards. Updated the review-documentation how-to to match.

Bug Fixes

  • Fix fly-deploy WARNING by starting nginx before Tailscale, deferring upstream DNS resolution to request time.

Infrastructure

  • Migrate all Ansible op item get calls to op read URI syntax for cleaner output and remove the regex_replace workaround on the Fly deploy token.
  • Restrict fly.io proxy ACLs to dedicated tag:flyio-target endpoints instead of broad tag:k8s and tag:homelab grants. Migrate all Tailscale Ingresses to a shared ProxyGroup with per-Ingress tag overrides (tag:flyio-target on docs, loki, prometheus). Add autoApprovers for VIP service routes. Enable --accept-routes on indri for ProxyGroup VIP routing.

[v1.5.1] - 2026-02-08

Features

  • Add observability to Fly.io proxy: Alloy collects nginx access logs (→ Loki) and derived metrics (→ Prometheus), with Grafana dashboards for Docs APM and Fly.io proxy health.

Infrastructure

  • Add docs.eblu.me and Fly.io health check to services-check

[v1.5.0] - 2026-02-08

Features

  • Add Fly.io public reverse proxy infrastructure for exposing services to the internet (first target: docs.eblu.me)

Documentation

  • Add how-to guide for exposing services publicly via Fly.io reverse proxy + Tailscale tunnel.
  • Update docs for public proxy: canonical URL is now docs.eblu.me, add Fly.io proxy reference card and operations how-to

[v1.4.2] - 2026-02-08

Documentation

  • Update all docs frontmatter titles from slug-case to human-readable and delete title-test cards.

[v1.4.1] - 2026-02-08

Documentation

  • Remove docs-check-titles pre-commit hook, add repo links to homepage, and test duplicate frontmatter titles.

[v1.4.0] - 2026-02-08

Features

  • Add documentation consistency checks: orphan detection in doc-links, new doc-index (category index coverage), doc-stale (staleness report), and doc-tags (tag inventory).

Bug Fixes

  • Fix broken icons for Pulumi and ArgoCD in homepage Admin bookmarks section.

Infrastructure

  • Add pre-commit to mise.toml project tools.

Documentation

  • Review exploring-the-docs tutorial: simplify wiki-links, fix broken replication/ reference, add Related section, match zk-docs flags to CLAUDE.md. Update use-pypi-proxy to document env-var-based proxy toggle.
  • Add Gandi DNS reference card and operations how-to, rewrite homepage intro for wider audience.
  • Add missing ai changelog fragment type to update-documentation guide, consolidate cicdci-cd and networknetworking tags
  • Updated restart-indri how-to to reflect actual recovery procedure after power outage. Added UPS to indri specs.
  • Fixed zk-docs links after file renames due to relative path issues

Miscellaneous

  • Rename doc-* mise tasks to docs-check-* / docs-review-* for clearer naming convention.

[v1.3.4] - 2026-02-05

Documentation

  • Enforce unique filenames, simple wiki-links (no paths), and no spaces in wiki-link targets for obsidian.nvim compatibility

[v1.3.3] - 2026-02-04

Infrastructure

  • Add IaC for Forgejo Actions secrets via new forgejo_actions_secrets Ansible role, syncing repository secrets from 1Password to Forgejo API

Documentation

  • Add how-to guide for safely restarting indri, plus AutoMounter reference card.

[v1.3.2] - 2026-02-04

Infrastructure

  • Fix Quartz build to use -d docs flag for accurate git-based file dates

[v1.3.1] - 2026-02-04

Infrastructure

  • Fix Quartz build to preserve git history for accurate file dates

Documentation

  • Fix misc changelog fragment type to show content (was showing empty entries)

[v1.3.0] - 2026-02-04

Features

  • Build workflow now supports version bump selection (major/minor/patch) and includes changelog in release body
  • Add ‘ai’ changelog fragment type for AI assistance changes

Bug Fixes

  • Fix Navidrome automatic library scan by correcting env var name from ND_SCANSCHEDULE to ND_SCANNER_SCHEDULE

Infrastructure

  • Move CHANGELOG.md to repository root (still included in docs build)
  • Remove iCloud Photos from borgmatic backup (photos now managed via Immich)

Documentation

  • Document Forgejo Actions secrets in forgejo reference card
  • Add troubleshooting how-to to zk-docs output

AI Assistance

  • Add wiki-link formatting convention to AI assistance guide

Miscellaneous

  • ,

[v1.2.1] - 2026-02-04

Features

  • Add doc-random mise task for random documentation review

Documentation

  • Add Caddy reference card and fix replication tutorial sequence

[v1.2.0] - 2026-02-04

Documentation

  • Complete Phase 6: migrate zk content, delete legacy cards, rewrite zk-docs for AI context priming

[v1.1.5] - 2026-02-04

Documentation

  • Add Phase 5 explanation docs: why GitOps, architecture overview, and security model

[v1.1.4] - 2026-02-04

Documentation

  • Add Phase 4 how-to guides: deploy k8s services, add ansible roles, update tailscale ACLs, and troubleshooting

[v1.1.3] - 2026-02-04

Features

  • Build workflow now automatically deploys docs after creating a release - updates the deployment manifest with the new release URL and syncs via ArgoCD, triggering a pod rollout

Miscellaneous

  • Remove confirmation prompt from container-tag-and-release task for non-interactive use

[v1.1.2] - 2026-02-04

No significant changes.

[v1.1.1] - 2026-02-04

Documentation

  • Add Phase 3 tutorials: “What is BlumeOps?”, “Exploring the Docs”, “AI Assistance Guide”, “Contributing”, and “Replicating BlumeOps” with sub-tutorials for Tailscale, Kubernetes, ArgoCD, and Observability. Each tutorial explicitly identifies its target audiences.

[v1.1.0] - 2026-02-04

No significant changes.

[v1.0.14] - 2026-02-04

No significant changes.

[v1.0.13] - 2026-02-04

No significant changes.

[v1.0.12] - 2026-02-04

No significant changes.

[v1.0.8] - 2026-02-04

Documentation

  • Convert wiki-link titles to lowercase slugs for reliable Quartz resolution

[v1.0.7] - 2026-02-03

Documentation

  • Switch to title-based wiki-links with validation (Quartz resolves via frontmatter title)

[v1.0.6] - 2026-02-03

Documentation

  • Fix wiki-links to use filename-based resolution with Quartz shortest path mode

[v1.0.5] - 2026-02-03

Documentation

  • Convert wiki-links to title-based format and add duplicate title detection

[v1.0.2] - 2026-02-03

Features

  • Add Reference section with 24 technical reference cards covering services, infrastructure, kubernetes, and storage

Documentation

  • Reorder documentation phases: Reference (Phase 2) now comes before Tutorials (Phase 3) so other docs can link to reference material

[v1.0.1] - 2026-02-03

Infrastructure

  • Add towncrier for automated changelog generation from news fragments

[0.1.0] - 2026-02-03

This is a historical release which doesn’t actually exist and which aggregates the changelogs prior to this date. The work on this blumeops project more or less began around Jan 16 2026. To an extent you can find corroborating details in the git commit log, but at the beginning (during this initial phase) there was a fairly large amount of non-source-controlled work. If a more accurate record is needed for this work, you may find it in borgmatic zk backups from this time period.

Features

  • Add Grafana Alloy for metrics remote_write to Prometheus
  • Add Alloy DaemonSet for automatic pod log collection and service health probes
  • Set up Borgmatic daily backups to Sifaka NAS with PostgreSQL streaming support
  • Add CloudNativePG PostgreSQL metrics scraping via Tailscale service
  • Add devpi PyPI caching proxy in Kubernetes with custom container image
  • Add Forgejo Actions CI runner in Kubernetes with host mode execution
  • Add Homepage service dashboard with automatic Kubernetes service discovery
  • Add Jellyfin media server with VideoToolbox hardware transcoding on indri
  • Add Kiwix offline Wikipedia server with kiwix-tools on indri
  • Add kube-state-metrics for Kubernetes resource metrics (pods, deployments, etc.)
  • Add Loki log aggregation with 31-day retention and Grafana integration
  • Add Miniflux RSS/Atom feed reader connected to PostgreSQL
  • Add Navidrome music streaming server with NFS storage from Sifaka
  • Add Prometheus metrics collection on indri with Sifaka node_exporter scraping
  • Add TeslaMate vehicle data logger with 18 Grafana dashboards
  • Add Transmission BitTorrent daemon for ZIM archive downloads
  • Add Zot OCI registry as pull-through cache for Docker Hub, GHCR, and Quay

Bug Fixes

  • Build Alloy with CGO for macOS native DNS resolver (fixes Tailscale MagicDNS)
  • Suppress noisy “v1 Endpoints is deprecated” warning from minikube storage-provisioner

Infrastructure

  • Deploy ArgoCD for GitOps continuous delivery with manual sync policy for workloads
  • Set up Caddy reverse proxy for *.ops.eblu.me with ACME DNS-01 TLS via Gandi
  • Deploy CloudNativePG operator and blumeops-pg PostgreSQL cluster in Kubernetes
  • Migrate Grafana from Homebrew to Kubernetes via Helm chart
  • Migrate Kiwix to Kubernetes with torrent-sync sidecar and ZIM watcher CronJob
  • Migrate Loki to Kubernetes StatefulSet with 50Gi PVC
  • Migrate Miniflux from Homebrew to Kubernetes with CloudNativePG database
  • Set up Minikube single-node Kubernetes cluster on indri with Tailscale API access
  • Migrate minikube from podman to docker driver for better stability and NFS support
  • Manage Prometheus configuration via Ansible
  • Migrate Prometheus to Kubernetes StatefulSet with 50Gi PVC
  • Set up Pulumi for Tailnet ACL management with OAuth authentication
  • Migrate Transmission to Kubernetes with NFS storage from Sifaka
  • Migrate Zot registry from Tailscale serve to Caddy reverse proxy at registry.ops.eblu.me
  • Integrate Zot as minikube registry mirror for all image pulls