- Aug 11, 2021
-
-
Dimitrije Manic authored
-
- Aug 02, 2021
-
-
dgrisonnet authored
-
- Jul 27, 2021
-
-
Prashant Balachandran authored
-
lanmarti authored
-
- Jul 26, 2021
-
-
paulfantom authored
-
- Jul 22, 2021
-
-
Manuel Rüger authored
Signed-off-by:
Manuel Rüger <manuel@rueg.eu>
-
paulfantom authored
-
- Jul 20, 2021
-
-
Philip Gough authored
jsonnet: Drop cAdvisor metrics with no (pod, namespace) labels while preserving ability to monitor system services resource usage The following provides a description and cardinality estimation based on the tests in a local cluster: container_blkio_device_usage_total - useful for containers, but not for system services (nodes*disks*services*operations*2) container_fs_.* - add filesystem read/write data (nodes*disks*services*4) container_file_descriptors - file descriptors limits and global numbers are exposed via (nodes*services) container_threads_max - max number of threads in cgroup. Usually for system services it is not limited (nodes*services) container_threads - used threads in cgroup. Usually not important for system services (nodes*services) container_sockets - used sockets in cgroup. Usually not important for system services (nodes*services) container_start_time_seconds - container start. Possibly not needed for system services (nodes*services) container_last_seen - Not needed as system services are always running (nodes*services) container_spec_.* - Everything related to cgroup specification and thus static data (nodes*services*5)
-
paulfantom authored
-
- Jul 19, 2021
-
-
paulfantom authored
-
- Jul 14, 2021
-
- Jul 06, 2021
-
-
Philip authored
-
- Jul 05, 2021
-
-
Damien Grisonnet authored
Signed-off-by:
Damien Grisonnet <dgrisonn@redhat.com>
-
Damien Grisonnet authored
Previously, prometheus-adapter configuration wasn't taking into account the scrape interval of kubelet, node-exporter and windows-exporter leading to getting non fresh results, and even negative results from the CPU queries when the irate() function was extrapolating data. To fix that, we want to set the interval used in the irate() function in the CPU queries to 4x scrape interval in order to extrapolate data between the last two scrapes. This will improve the freshness of the cpu usage exposed and prevent incorrect extrapolations. Signed-off-by:
Damien Grisonnet <dgrisonn@redhat.com>
-
- Jul 01, 2021
-
-
Sunil Thaha authored
With OVN, the container veth network interface names that used to start with `veth` has now changed to `<rand-hex>{15}@if<number>`(see Related Links below). This patch adapts to the new change introduced in ovn and ignores the network interfaces that match `[a-z0-9]{15}@if\d+` in addition to those starting with `veth` Related Links: - https://github.com/openshift/ovn-kubernetes/blob/master/go-controller/vendor/github.com/containernetworking/plugins/pkg/ip/link_linux.go#L107 - https://github.com/openshift/ovn-kubernetes/blob/master/go-controller/pkg/cni/helper_linux.go#L148 Signed-off-by:
Sunil Thaha <sthaha@redhat.com>
-
- Jun 23, 2021
-
-
paulfantom authored
-
- Jun 22, 2021
-
-
fpetkovski authored
This commit updates all downstream dependencies Signed-off-by:
fpetkovski <filip.petkovsky@gmail.com>
-
fpetkovski authored
Running sslscan against the prometheus adapter secure port reports two insecure SSL ciphers, ECDHE-RSA-DES-CBC3-SHA and DES-CBC3-SHA. This commit removes those ciphers from the list. Signed-off-by:
fpetkovski <filip.petkovsky@gmail.com>
-
Philip Gough authored
-
- Jun 18, 2021
-
-
paulfantom authored
-
- Jun 14, 2021
-
-
paulfantom authored
-
- Jun 12, 2021
-
-
Prashant Balachandran authored
adding changes from kube-mixin
-
- Jun 11, 2021
-
-
paulfantom authored
-
paulfantom authored
-
- Jun 10, 2021
-
-
paulfantom authored
-
- Jun 08, 2021
-
-
ArthurSens authored
Signed-off-by:
ArthurSens <arthursens2005@gmail.com>
-
- May 27, 2021
-
-
Prem Saraswat authored
-
paulfantom authored
-
- May 26, 2021
-
-
paulfantom authored
-
- May 25, 2021
-
-
paulfantom authored
-
- May 05, 2021
-
-
Damien Grisonnet authored
Adding a PodDisruptionBudget to prometheus-adapter ensure that at least one replica of the adapter is always available. This make sure that even during disruption the aggregated API is available and thus does not impact the availability of the apiserver. Signed-off-by:
Damien Grisonnet <dgrisonn@redhat.com>
-
paulfantom authored
-
- Apr 27, 2021
-
-
paulfantom authored
Signed-off-by:
paulfantom <pawel@krupa.net.pl>
-
- Apr 22, 2021
-
-
Damien Grisonnet authored
Prometheus-adapter is a component of the monitoring stack that in most cases require to be highly available. For instance, we most likely always want the autoscaling pipeline to be available and we also want to avoid having no available backends serving the metrics API apiservices has it would result in both the AggregatedAPIDown alert firing and the kubectl top command not working anymore. In order to make the adapter highly-avaible, we need to increase its replica count to 2 and come up with a rolling update strategy and a pod anti-affinity rule based on the kubernetes hostname to prevent the adapters to be scheduled on the same node. The default rolling update strategy for deployments isn't enough as the default maxUnavaible value is 25% and is rounded down to 0. This means that during rolling-updates scheduling will fail if there isn't more nodes than the number of replicas. As for the maxSurge, the default should be fine as it is rounded up to 1, but for clarity it might be better to just set it to 1. For the pod anti-affinity constraints, it would be best if it was hard, but having it soft should be good enough and fit most use-cases. Signed-off-by:
Damien Grisonnet <dgrisonn@redhat.com>
-
- Apr 21, 2021
-
-
paulfantom authored
-
- Apr 20, 2021
-
-
Jan Fajerski authored
Signed-off-by:
Jan Fajerski <jfajersk@redhat.com>
-
- Apr 16, 2021
-
-
ArthurSens authored
Signed-off-by:
ArthurSens <arthursens2005@gmail.com>
-
- Apr 14, 2021
-
-
Kristijan Sedlak authored
-
- Apr 09, 2021
-
-
Jan Fajerski authored
Signed-off-by:
Jan Fajerski <jfajersk@redhat.com>
-
- Mar 29, 2021
-
-
mansikulkarni96 authored
This commit includes windows_exporter metrics in the node queries for the prometheus adapter configuration. This will help obtain the resource metrics: memory and CPU for Windows nodes. This change will also help in displaying metrics reported through the 'kubectl top' command which currently reports 'unknown' status for Windows nodes.
-