Comments (10)
/sig node
?
from kubernetes.
/kind failing-test
from kubernetes.
/triage accepted
This seems to be one of the causes for eviction test flakiness.
I've seen in mostly in disk stats.
from kubernetes.
Some ideas could be to use QoS in case of stats failure. We could say that BestEffort gets evicted before Burstable and Burstable before Guaranteed. This could help in the case of stats failure so we can control the eviction a bit better.
An interesting thing about this is this doesn't help disk based eviction as ephemeral-storage is never considered guaranteed (guaraneteed is only for cpu/memory).
from kubernetes.
Pid stats is another area where we are seeing failures but that is related to all pid stats seemed to be returning 0 so any stats gathering is not working.
from kubernetes.
cc @pacoxu
from kubernetes.
/priority important-longterm
from kubernetes.
Yeah overloading the compare prioritization with error checking appears on face to be problematic. I'd try to split that out into two separate funcs one for get stats another for the compare. The inability to get stats for pod a may be due to a block of the resource by pod b that will be ignored because of the failure it caused?
from kubernetes.
Brought this up to sig-node.
We should look into ways to populate cache more aggressively. Qos could be an option but it is worth investigating why stats gathering fails.
from kubernetes.
/remove-kind failing-test
from kubernetes.
Related Issues (20)
- [FailingTest] [sig-node] [NodeFeature:SidecarContainers] Containers Lifecycle should terminate sidecars simultaneously if prestop doesn't exit HOT 5
- [FailingTest] [gce-master-scale-correctness] Multiple test failures, mostly networking HOT 7
- HPAContainerMetrics Version V2Beta2 Cannot Implement on Kubernetes 1.30 as "V2Beta2" was remove since Kubernetes 1.26 HOT 2
- [Failing Test] [sig-cloud-provider-gcp] Provider does not support InstanceGroups HOT 5
- Volumeattachment deletion in a detach operation should carry the resourceVersion HOT 6
- Support `env` variables in values from `envFrom` HOT 2
- CVE-2024-3744: azure-file-csi-driver discloses service account tokens in logs HOT 1
- Open API v3 requirement for `PATCH` verb support (introduced PR #115119) HOT 3
- Potentially lengthy I/O by kubelet during pod startup needs to be visible to the user HOT 2
- Add Etcd client support: stop traffic to in-defragmentation-server HOT 2
- Allow labels feature.node.kubernetes.io by cloud provider HOT 2
- Archive legacy-cloud-providers when 1.30 goes out of support HOT 3
- scheduler_perf: define thresholds per test case and set up alerts for results HOT 7
- add EndpointSlice consumer helper functions HOT 2
- kubectl cp a file from a Windows Pod to Windows 11 localhost failed HOT 8
- [Flaking Test][sig-storage] should block a second pod from using an in-use ReadWriteOncePod volume on the same node HOT 3
- InPlacePodVerticalScaling does not meet the requirement of qosClass being equal to Guaranteed after shrinking the memory HOT 5
- [Flaking][sig-node] [NodeConformance] Containers Lifecycle when a pod is terminating because its liveness probe fails should execute readiness probe while in preStop, but not liveness HOT 9
- bug: kubelet panic & crash if `--config-dir` is used HOT 8
- app Container can't reuse its init Container cpuset in a specific condition HOT 11
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from kubernetes.