Details
-
New Feature
-
Status: New
-
Not specified
-
Resolution: Unresolved
-
None
-
None
-
None
Description
It would be useful if something validated an installation for healthiness.
e.g.
1) Are components using sensible storage (e.g. is Dali using fast storage).
2) Are components well distributed over available nodes and/or sufficiently isolated.
3) Do components have reasonable resource specifications in relation to the nodes/K8s they are installed on
4) Is a postmortem plane setup
5) Is a log stack enabled/installed
6) Is the system secure, have certificates been enabled, is encryption enabled.
I'm sure there are many other things such a tool could check for, and we'll want to bite off 1 or 2 of them only at 1st and extend.
I think ideally this would run post helm install time.
It could be a separate tool that is run and interacts with k8s API, and is run by terraform - or o n-demand by admins.