Skip to content
Snippets Groups Projects
  1. Aug 16, 2021
  2. Aug 11, 2021
  3. Aug 02, 2021
  4. Jul 27, 2021
  5. Jul 26, 2021
  6. Jul 22, 2021
  7. Jul 20, 2021
    • Philip Gough's avatar
      jsonnet: Drop cAdvisor metrics with no (pod, namespace) labels while... · 463ad065
      Philip Gough authored
      jsonnet: Drop cAdvisor metrics with no (pod, namespace) labels while preserving ability to monitor system services resource usage
      
      The following provides a description and cardinality estimation based on the tests in a local cluster:
      
      container_blkio_device_usage_total - useful for containers, but not for system services (nodes*disks*services*operations*2)
      container_fs_.*                    - add filesystem read/write data (nodes*disks*services*4)
      container_file_descriptors         - file descriptors limits and global numbers are exposed via (nodes*services)
      container_threads_max              - max number of threads in cgroup. Usually for system services it is not limited (nodes*services)
      container_threads                  - used threads in cgroup. Usually not important for system services (nodes*services)
      container_sockets                  - used sockets in cgroup. Usually not important for system services (nodes*services)
      container_start_time_seconds       - container start. Possibly not needed for system services (nodes*services)
      container_last_seen                - Not needed as system services are always running (nodes*services)
      container_spec_.*                  - Everything related to cgroup specification and thus static data (nodes*services*5)
      463ad065
    • paulfantom's avatar
      manifests: regenerate · 02454b3f
      paulfantom authored
      02454b3f
  8. Jul 19, 2021
  9. Jul 14, 2021
  10. Jul 06, 2021
  11. Jul 05, 2021
  12. Jul 01, 2021
  13. Jun 23, 2021
  14. Jun 22, 2021
  15. Jun 18, 2021
  16. Jun 14, 2021
  17. Jun 12, 2021
  18. Jun 11, 2021
  19. Jun 10, 2021
  20. Jun 08, 2021
  21. May 27, 2021
  22. May 26, 2021
  23. May 25, 2021
  24. May 05, 2021
  25. Apr 27, 2021
  26. Apr 22, 2021
    • Damien Grisonnet's avatar
      jsonnet: make prometheus-adapter highly-available · 4c6a06cf
      Damien Grisonnet authored
      
      Prometheus-adapter is a component of the monitoring stack that in most
      cases require to be highly available. For instance, we most likely
      always want the autoscaling pipeline to be available and we also want to
      avoid having no available backends serving the metrics API apiservices
      has it would result in both the AggregatedAPIDown alert firing and the
      kubectl top command not working anymore.
      
      In order to make the adapter highly-avaible, we need to increase its
      replica count to 2 and come up with a rolling update strategy and a
      pod anti-affinity rule based on the kubernetes hostname to prevent the
      adapters to be scheduled on the same node. The default rolling update
      strategy for deployments isn't enough as the default maxUnavaible value
      is 25% and is rounded down to 0. This means that during rolling-updates
      scheduling will fail if there isn't more nodes than the number of
      replicas. As for the maxSurge, the default should be fine as it is
      rounded up to 1, but for clarity it might be better to just set it to 1.
      For the pod anti-affinity constraints, it would be best if it was hard,
      but having it soft should be good enough and fit most use-cases.
      
      Signed-off-by: default avatarDamien Grisonnet <dgrisonn@redhat.com>
      4c6a06cf
  27. Apr 21, 2021
  28. Apr 20, 2021
  29. Apr 16, 2021
  30. Apr 14, 2021
  31. Apr 09, 2021
Loading