Comments (3)
Can you describe the exact metrics from kube-state that you are using for monitoring and expecting to match Karpenter's metrics?
from karpenter-provider-aws.
From the left side which is kube-state-metrics dashboard, it shows node (ip-10-4-105-63.ec2.internal
, 16GB assigned) is using around 18.39% RAM Usage, around 3GB as Memory Usage.
From the right side, Node Summary node ip-10-4-105-63.ec2.internal
is using 85.8% memory utilization which is not the same.
Grafana expression:
((karpenter_nodes_total_daemon_requests{resource_type="memory",arch="$arch",capacity_type="$capacity_type",instance_type="$instance_type",nodepool="$nodepool",zone="$zone"} or karpenter_nodes_allocatable0) + \n(karpenter_nodes_total_pod_requests{resource_type="memory",arch="$arch",capacity_type="$capacity_type",instance_type="$instance_type",nodepool="$nodepool",zone=~"$zone"} or karpenter_nodes_allocatable0)) / \nkarpenter_nodes_allocatable{resource_type="memory",arch="$arch",capacity_type="$capacity_type",instance_type="$instance_type",nodepool="$nodepool",zone="$zone", cluster="$cluster"}"
I tried the expression karpenter_nodes_total_daemon_requests + karpenter_nodes_total_pod_requests with proper filter, and it shows huge gap than kube-state-metrics usage. Not sure other have similar issue or not.
from karpenter-provider-aws.
I think I have wrong impression on the metrics...
This metrics is the total daemonset + pod requested memory inside karpenter node, and the metrics is not indiciating current memory usage.
Reference:
karpenter-provider-aws/designs/metrics.md
Lines 26 to 37 in d77df0d
from karpenter-provider-aws.
Related Issues (20)
- bug: deprecated AMIs should be able to be listed when specifying ami id on the amiSelectorTerms HOT 6
- after some time Karpenter removes node that has terminationGracePeriodSeconds=12hr and safe-to-evict: "false" HOT 8
- Admission controller reject "app.kubernetes.io/managed-by" labels on Provisioner HOT 1
- Karpenter evicts pods having do-not-disrupt annotation HOT 2
- Errors after v0.37.0 upgrade: "conditions[1].reason in body should be at least 1 chars long" HOT 10
- Spot instances have unacceptably short lifetimes HOT 3
- Disruption via drift results in 4x the nodes being provisioned which aren't consolidated afterwards HOT 1
- more customize-able budget schedule HOT 2
- Upgrading from v0.32.5 to v0.33.2, nodeclaim giving issue HOT 2
- Upgrading from v0.32.5 to v0.33.2, nodeclaim giving issue {"level":"ERROR","time":"2024-07-01T07:48:17.958Z","logger":"controller.provisioner","message":"creating node claim, Internal error occurred: failed calling webhook \"validation.webhook.karpenter.sh\": failed to call webhook: Post \"https://karpenter.kube-system.svc:8443/validate/karpenter.sh?timeout=10s\": no service port 8443 found for service \"karpenter\"","commit":"3a61217"} HOT 2
- Support cpuManagerPolicy for kubelet configuration in NodePool HOT 2
- Suppose while migrating from Cluster Autoscaler to Karpenter, if something happens, what are the steps that need to be taken care of? HOT 3
- Donate to CNCF HOT 1
- New nodes are not provisioned for StatefulSet with podAntiAffinity topologyKey: failure-domain.beta.kubernetes.io/zone
- Documentation: v1beta1-migration incorrect information regarding EC2NodeClass role and instanceProfile
- Getting Started with Karpenter
- `c7i-flex` instances are not correctly labeled with bandwidth HOT 2
- The concurrent pod scheduling stuck with topologySpreadConstraints(AZs)
- amd series node infinite regenerative status HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from karpenter-provider-aws.