Commit Graph

27 Commits

Author SHA1 Message Date
d29651af7a REFACTOR(repo): remove control-plane scheduling
- Remove nodeSelector for control-plane node
- Remove tolerations for control-plane taint
- Allow pods to schedule on any available node
2026-01-10 18:35:15 +09:00
5acc1c7f9e PERF(security): adjust resources based on VPA
- Update authelia memory 256Mi→194Mi
- Update authelia redis cpu 10m→23m, memory 64Mi→100Mi
- Update falco memory 263Mi→283Mi
- Update falcosidekick cpu 10m→15m, memory 128Mi→100Mi
- Update external-secrets operator cpu 5m→15m, memory 128Mi→100Mi
- Update external-secrets webhook cpu 2m→15m, memory 128Mi→100Mi
- Update external-secrets certController cpu 2m→15m, memory 256Mi→283Mi
- Update vault cpu 35m→49m, memory 263Mi→175Mi
2026-01-10 14:32:33 +09:00
736205e464 PERF(falco): reduce sidekick replicas to 1
- Reduce falcosidekick replicas from 2 to 1
- DaemonSet tolerations kept for all-node coverage
2026-01-10 13:15:56 +09:00
74d29aabfc CHORE(resources): set memory limits equal to memory requests
- Align memory limits with memory requests for guaranteed QoS class
- falco: falcosidekick
- external-secrets: main, webhook, certController
- authelia: main, redis
2026-01-10 01:16:56 +09:00
739ac544c7 REFACTOR(repo): standardize taint to control-plane
- Remove deprecated master taint from falco
- Update vault tolerations to control-plane
- Change effect from NoExecute to NoSchedule
2026-01-09 21:45:16 +09:00
cbf00275e8 FEAT(security): enable HA with replica 2 and soft anti-affinity
- Add replicaCount: 2 to authelia, external-secrets, falco
- Add soft pod anti-affinity for node distribution
- Configure affinity for all security components
2026-01-08 13:07:56 +09:00
c24313154d FIX(security): remove CPU limits from falco and trivy
- falco: set cpu: null to disable chart default (1 core)
- trivy: set cpu: null for operator and scan jobs (500m default)
2026-01-08 00:33:13 +09:00
31007c5586 PERF(resources): remove CPU limits - keep memory limits only
- CPU throttling prevents app startup, not crashes
- Memory OOM is the real cascading failure cause
- CPU request ensures fair scheduling
2026-01-07 23:48:43 +09:00
c51cca27d8 CHORE(falco): disable sidekick-ui and Redis
- Use Grafana dashboard instead
- Set webui.enabled: false (disables UI and Redis)
- Remove ingress.yaml for falco-ui
- Saves ~384Mi memory (Redis 256Mi + UI 128Mi)
2026-01-05 00:40:26 +09:00
c66801a166 FEAT(falco): add loki output to falcosidekick
- Send Falco events directly to Loki
- Enables viewing detailed events in Grafana with all fields
- Same data as Falco UI but queryable in Grafana
2026-01-05 00:40:26 +09:00
76c5fd8343 FIX(falco): use SM create instead of enabled
- Falco chart uses 'serviceMonitor.create' not 'enabled'
- Add release: prometheus label for Prometheus discovery
2026-01-05 00:40:26 +09:00
d4b84305a2 FIX(redis): use customConfig for maxmemory
- extraArgs was not being applied to Redis container
- Use customConfig which is the correct way to set Redis directives
- maxmemory 800mb with allkeys-lru policy
2026-01-05 00:40:26 +09:00
94dcb7d585 FEAT(falco): add 6h TTL to sidekick-ui
- to prevent Redis OOM
- Events older than 6 hours are auto-deleted
- Prevents linear memory growth in Redis
2026-01-05 00:40:26 +09:00
1cd89f6bae REFACTOR(falco): remove CPU limit
- Set cpu: null to override chart default (1 core)
- Prevents CPU throttling under high load
2026-01-05 00:40:26 +09:00
c67b720ee4 FIX(falco): falco oom issues
- increase memory limits
- Falco: add 512Mi memory limit
- Falcosidekick: increase memory limit 256Mi -> 512Mi
- Redis: increase memory limit 512Mi -> 1Gi (was 84 restarts)
- Redis: increase maxmemory 400mb -> 800mb
2026-01-05 00:40:26 +09:00
ede767498d PERF(redis): increase Redis memory limit to 512Mi
- Increase memory limit to prevent OOM
- Optimize Redis configuration
2026-01-05 00:40:26 +09:00
1a551b47ca PERF(falco): optimize falco rules
- and add sidekick memory limit
- Add macros to exclude trivy, postgres, minio, vault from rules
- Disable Container Drift Detection (too noisy)
- Remove /etc/passwd from sensitive file access (normal lookups)
- Add 256Mi memory limit to falcosidekick (was using 1.1GB)
2026-01-05 00:40:26 +09:00
4d4ecb13d6 FIX(falco): add NoExecute tolerations
- and enable Redis persistence
- Add NoExecute tolerations for master/control-plane nodes to run Falco
  DaemonSet on all nodes
- Enable Redis storage to persist index data across pod restarts
2026-01-04 23:41:39 +09:00
7de57fc936 CHORE(authelia): disable falco-ui basic auth
- Use Authelia SSO instead
- Remove basic auth configuration
2026-01-04 23:41:39 +09:00
2a4d84a0bc CHORE(deps): upgrade Falco to 0.40.0
- Upgrade for kernel 6.14 support
- Apply dependency updates
2026-01-04 23:41:39 +09:00
5f197a607b FIX(falco): falco config errors
- Remove unsupported outputs_queue_capacity option
- Fix Container Drift Detection rule (remove undefined rename macro)
2026-01-04 23:41:39 +09:00
765104bb4e REFACTOR(authelia): remove falco-ui-secret
- Use Authelia SSO instead
- Remove basic auth secret
2026-01-04 23:41:39 +09:00
87b16d13e3 FEAT(falco): configure falco redis
- with 200mb maxmemory and lru eviction
2026-01-04 23:41:39 +09:00
27d1e5c4b1 FIX(falco): re-enable falco webui
- with redis memory limit 128mi
2026-01-04 23:41:39 +09:00
26e40d234a CHORE(falco): disable sidekick web ui
- to save 535mb redis memory
2026-01-04 23:41:39 +09:00
368f7b5f5a PERF(falco): reduce falcosidekick replicas to 1
- Scale down to single replica
- Reduce resource usage
2026-01-04 23:41:39 +09:00
34a1c9f783 REFACTOR(repo): restructure infra folder structure
- Remove argocd/, helm-values/, ingress/ subdirectories
- Move files to parent directory with standardized names
- Add namespace.yaml to all apps with Goldilocks labels
- Preserve vault/ subdirectories (falco, velero)
- Update main kustomization.yaml to reference argocd.yaml files directly
- Comment out argocd.yaml in each app's kustomization.yaml to prevent
  circular reference

Applications restructured:
- cert-manager (2 ArgoCD apps)
- external-secrets
- reloader
- vault (2 ArgoCD apps)
- velero (2 ArgoCD apps)
- falco
- cnpg
- haproxy
- metallb
- vpa
- argocd
2026-01-04 23:41:39 +09:00