6da4eba1dc
CHORE(grafana): remove admin login secret for SSO
...
- Remove grafana-admin-password ExternalSecret
- Remove admin section from helm-values.yaml
- Authentication handled by Authelia SSO middleware
2026-01-09 21:41:52 +09:00
2cf35d0f76
FEAT(loki): configure storage and HA
...
- Rename extraVolume to avoid duplicate name
- Add emptyDir for /var/loki cache
- Migrate to shared storage with MinIO
- Configure HA with 2 replicas
- Revert to single replica for Single Binary mode
2026-01-09 21:41:52 +09:00
2b7ee1fe51
FEAT(loki): configure storage and HA
...
- Rename extraVolume to avoid duplicate name
- Add emptyDir for /var/loki cache
- Migrate to shared storage with MinIO
- Configure HA with 2 replicas
- Revert to single replica for Single Binary mode
2026-01-09 21:41:52 +09:00
4286296591
PERF(resources): remove CPU limits - keep memory limits only
...
- CPU throttling prevents app startup, not crashes
- Memory OOM is the real cascading failure cause
- CPU request ensures fair scheduling
2026-01-07 23:48:35 +09:00
69dc3b34be
REFACTOR(secrets): flatten Vault paths
...
- Change secret paths from <category>/<app> to <app>
- monitoring/alertmanager → alertmanager
- monitoring/grafana → grafana
- databases/postgresql → postgresql
2026-01-06 16:52:58 +09:00
7888aeff36
REFACTOR(repo): move vault/ to manifests/
...
- Move ExternalSecret files from vault/ to manifests/secret.yaml
- Update kustomization.yaml references
- Remove vault/ folders
Apps: alertmanager, grafana, prometheus
2026-01-06 16:42:33 +09:00
7b9abaf9c8
REFACTOR(obs): integrate ingress to helm-values
...
- alertmanager: move ingress to karma inline, servicemonitor to manifests
- goldilocks: move ingress to helm-values
- grafana: move ingress to helm-values
- uptime-kuma: move ingress to helm-values
2026-01-06 01:57:03 +09:00
28ba50d1a3
REFACTOR(repo): observability repo structure
...
- Add application.yaml for ArgoCD app-of-apps
- Add kustomization.yaml with observability components
- Add renovate.json for automated updates
- Update all component argocd.yaml repoURLs to observability repo
Components: prometheus, alertmanager, grafana, loki, promtail,
node-exporter, kube-state-metrics, goldilocks, uptime-kuma, vpa
2026-01-05 00:40:01 +09:00
8dcb563ae4
REFACTOR(grafana): change Grafana storageClass
...
- Update storageClass to local-path
- Change storage backend configuration
2026-01-05 00:40:01 +09:00
c472035499
FEAT(grafana): add Grafana monitoring
...
- Add Grafana monitoring configuration
- Enable metrics collection
2026-01-05 00:40:01 +09:00
7653f2c4c8
FIX(grafana): fix storageClass key name
...
- Correct storageclass key spelling
- Fix Helm values configuration
2026-01-05 00:40:01 +09:00
aad4c249e2
CHORE(grafana): disable auto dashboard provision
...
- Use manual import instead of automatic provisioning
- Remove configMapGenerator for dashboards
- Remove sidecar and dashboards helm config
- Keep JSON files in dashboards/ for manual import reference
2026-01-05 00:40:01 +09:00
ababd677d4
FEAT(repo): enable ServerSideApply
...
- Enable ServerSideApply to handle large configmaps
- Fix resource management issues
2026-01-05 00:40:01 +09:00
9583be9b46
FEAT(grafana): export dashboards
...
- to JSON and use sidecar ConfigMaps
- Export 14 dashboards to JSON files
- Use kustomize configMapGenerator for dashboard ConfigMaps
- Enable Grafana sidecar to load dashboards from ConfigMaps
- Keep Longhorn and Traefik Official from grafana.com
2026-01-05 00:40:01 +09:00
c356493707
FEAT(alertmanager): add datasource to Grafana
...
- Add Alertmanager datasource configuration
- Enable alert visualization
2026-01-05 00:40:01 +09:00
200c6e97ae
REFACTOR(repo): migrate repoURL to K3S-HOME
...
- Update repository URL to K3S-HOME organization
- Change from personal to organization repo
2026-01-05 00:40:01 +09:00
renovate[bot]
1e1cde4cd9
CHORE(deps): update alertmanager to v1.30.0
...
- Upgrade Alertmanager chart version
- Apply dependency updates
2026-01-05 00:40:01 +09:00
939ae13c5d
CHORE(grafana): disable local auth, add SSO
...
- Enable anonymous auth with Admin role
- Disable login form
- Add Authelia middleware to ingress
2026-01-05 00:40:01 +09:00
e4b477a510
REFACTOR(longhorn): migrate to local-path
...
- alertmanager, grafana, loki, prometheus: storageClass -> local-path-retain
- Change storage backend configuration
2026-01-05 00:40:01 +09:00
823b2ba495
REFACTOR(repo): remove global panel from Grafana
...
- Remove global panel configuration
- Clean up dashboard settings
2026-01-05 00:40:01 +09:00
f6ceb50503
REFACTOR(grafana): remove dashboard 15757
...
- Remove Windows-specific queries dashboard
- Clean up unused dashboards
2026-01-05 00:40:01 +09:00
0617611d22
FIX(grafana): restore dashboard 15757
...
- Restore Kubernetes Global with CPU Real dashboard
- Re-enable monitoring visualization
2026-01-05 00:40:01 +09:00
685563b92c
REFACTOR(grafana): remove duplicated Dashboard
...
- Remove duplicate Grafana dashboard
- Clean up configuration
2026-01-05 00:40:01 +09:00
ebc5af24ef
FEAT(repo): add Grafana Global panel
...
- Add global panel to Grafana dashboard
- Enable overview visualization
2026-01-05 00:40:01 +09:00
d0fc55d403
FEAT(grafana): add uid to Grafana datasources
...
- for dashboard compatibi...
2026-01-05 00:40:01 +09:00
912b3aa38f
REFACTOR(minio): remove minio dashboard
...
- using manually imported one
2026-01-04 23:38:05 +09:00
8e964afe42
FEAT(grafana): add grafana dashboards
...
- for cluster monitoring
2026-01-04 23:38:05 +09:00
b3ad6338ac
FIX(prometheus): grafana prometheus datasource
...
- url with full namespace
2026-01-04 23:38:05 +09:00
a30dbf138f
REFACTOR(traefik): switch ingress to Traefik
...
- Update ingressClassName from haproxy to traefik
- Remove haproxy.org annotations
2026-01-04 23:38:05 +09:00
0cb7438d79
CHORE(external-secrets): update ESO API version from v1beta1 to v1
...
- Update ExternalSecret API version
- Migrate to stable API
2026-01-04 23:38:05 +09:00
ea4152a0d6
REFACTOR(gitea): migrate repoURL from Gitea
...
- to GitHub
2026-01-04 23:38:05 +09:00
5ec1a3323d
REFACTOR(goldilocks): use managedNamespaceMetad...
...
- Remove namespace.yaml files
- Add managedNamespaceMetadata with Goldilocks label
- Set CreateNamespace=true in syncOptions
- Update kustomization.yaml to remove namespace.yaml references
2026-01-04 23:38:05 +09:00
bbf6fa5001
CHORE(repo): clean kustomization files
...
- Remove unused entries from kustomization
- Clean up configuration
2026-01-04 23:38:05 +09:00
7d4f5ff86c
REFACTOR(grafana): change logging path in Grafana
...
- Update logging path configuration
- Fix log collection settings
2026-01-04 23:38:05 +09:00
a2e0ef4477
REFACTOR(prometheus): remove path sources
...
- from grafana and prometheus...
- Remove 'path: grafana' source from grafana Application
- Remove 'path: prometheus' source from prometheus Application
- ExternalSecret and Ingress will be managed manually via kubectl apply
-k
- Fixes circular dependency issue causing Progressing state
2026-01-04 23:38:05 +09:00
4e0f27192f
FIX(prometheus): namespaces in grafana
...
- and prometheus resources
- Update grafana ExternalSecret namespace: monitoring → grafana
- Update grafana Ingress namespace: monitoring → grafana
- Update prometheus ExternalSecret namespace: monitoring → prometheus
- Aligns with per-app namespace strategy
2026-01-04 23:38:05 +09:00
2309254fc9
FIX(repo): circular reference in app kustomizes
...
- Comment out argocd.yaml in all app kustomization.yaml files
- Prevents circular reference when apps have 'path:' source (grafana,
prometheus)
- ArgoCD Applications are managed manually, not via kustomize
2026-01-04 23:38:05 +09:00
b4ec13618a
REFACTOR(repo): to independent app management
...
- pattern
- monitoring/kustomization.yaml now only manages application.yaml (App
of Apps)
- Each app independently manages its own ArgoCD Application via
kustomization.yaml
- Apps are fully self-contained: argocd.yaml, namespace.yaml, and app-
specific resources
- Cleaner separation: no central app list to maintain
2026-01-04 23:38:05 +09:00
078850f77a
FIX(argocd): sharedresourcewarning by referencing
...
- argocd.yaml files d...
- Change monitoring/kustomization.yaml to reference argocd.yaml files
instead of folders
- Comment out argocd.yaml in each app's kustomization.yaml
- Matches applications folder pattern to avoid resource conflicts
2026-01-04 23:38:05 +09:00
6dec7e0a46
REFACTOR(argocd): monitoring apps
...
- to self-manage ArgoCD Applications
- Each app now includes its own argocd.yaml in kustomization.yaml
- Main monitoring/kustomization.yaml references app folders instead of
individual argocd.yaml files
- Better separation of concerns - each app is self-contained and
independently managed
2026-01-04 23:38:05 +09:00
5c4676ca9a
REFACTOR(repo): restructure monitoring folder
...
- and add namespace resou...
- Remove argocd/, helm-values/, ingress/ subdirectories
- Move files to parent directory (argocd.yaml, helm-values.yaml,
ingress.yaml)
- Update helm valueFiles paths in ArgoCD Applications
- Add namespace.yaml to all applications with Goldilocks labels
- Update destination namespaces to match folder names
- Update kustomization.yaml files to reference new structure
2026-01-04 23:38:05 +09:00
1bf40d431b
REVERT(grafana): grafana to local-path
...
- storageclass
Due to storage constraints, reverting from longhorn to local-path.
Only Loki, Alertmanager, and Gitea remain on longhorn.
2026-01-04 23:38:05 +09:00
1a2f15c468
REFACTOR(longhorn): migrate monitoring PVCs
...
- from local-path to Longhorn
- Grafana: 2Gi (replica=3)
- Loki: 10Gi (replica=3)
- Alertmanager: 1Gi (replica=3)
- Prometheus: 5Gi (replica=3)
- Use dedicated 50GB Longhorn storage on each node
2026-01-04 23:38:05 +09:00
7d41461e2e
FEAT(grafana): add per-application ingress
...
- management for Grafana
- Added ingress.yaml for Grafana
- Updated kustomization file to include ingress resource
- Migrated from centralized ingress management to per-app architecture
2025-12-25 20:21:28 +09:00
a11a9ab329
CHORE(argocd): update ArgoCD applications
...
- to point to monitoring repo...
2025-12-17 15:12:56 +09:00
baee94b69d
INIT(repo): monitoring stack setup
2025-12-17 15:06:58 +09:00