Comments (3)
Useful information from printf debugging: limits are somehow over/under flowing.
=== RUN TestIngester_Push
=== RUN TestIngester_Push/should_discard_metadata_when_max_metadata_per_user_exceeded
checking length of metadata. len = 0, metadata = map[]
creating new metadata. limit = 2147483647, len = 0
checking length of metadata. len = 1, metadata = map[test_metric_1:map[{Type:COUNTER MetricFamilyName:test_metric_1 Help:This is a test metric. Unit:}:2024-04-26 10:56:22.995277387 -0400 EDT m=+10.048988645]]
creating new metadata. limit = 2147483647, len = 1
from mimir.
This appears to be triggered by no ingesters being live in the zone when limits are computed. This means the limit code returns early with a global limit of 0
which is interpreted as "unlimited" leading to a local limit of int32.MAX
.
=== RUN TestIngester_Push/should_discard_metadata_when_max_metadata_per_user_exceeded
no ingesters in zone. ingesters in zone = 0, zone count = 1, shard size = 0
global limit = 1, local limit = 0
no ingesters in zone. ingesters in zone = 0, zone count = 1, shard size = 0
global limit = 1, local limit = 0
creating new metadata. limit = 2147483647, len = 0
global limit = 0, local limit = 0
no ingesters in zone. ingesters in zone = 0, zone count = 1, shard size = 0
global limit = 1, local limit = 0
no ingesters in zone. ingesters in zone = 0, zone count = 1, shard size = 0
global limit = 1, local limit = 0
creating new metadata. limit = 2147483647, len = 1
global limit = 0, local limit = 0
from mimir.
This appears to be because we were using i.lifecycler.HealthyInstancesCount()
to wait for ingester startup which doesn't actually guarantee the ingester owns any tokens. This is fixed by using a different method on i.lifecycler
to wait for startup.
Ultimately this seems like the same race condition @pr00se has identified because ingesters use ring.Lifecycler
from dskit instead of ring.BasicLifecycler
(which waits for token ownership before completing startup).
from mimir.
Related Issues (20)
- Alert State History from Mimir
- ingester.max-global-series-per-user: 2000000 is not parsable in ingester values HOT 3
- Chunk compression at rest HOT 3
- Unable to deploy helm-chart mimir-distributed with ArgoCD when setting any of `rbac.podSecurityContext` to `null`
- Lots of `"error processing requests from scheduler"` in querier logs HOT 3
- Test flake: TestDistributor/caching_unmarshal_data_disabled/reduce_native_histogram_buckets_via_down_scaling HOT 6
- Prometheus is crashign with Mimir push errors HOT 2
- failed to fetch some blocks | err-mimir-store-consistency-check-failed
- mimir-distributed Chart Env Var Expansion Failure: S3 Access Key and Access Key ID HOT 1
- [ingester] Ingester service state and lifecycler ring state not synchronized HOT 4
- Compactor fails to upload indexes larger than 1G to swift object storage
- Scrape commit failed" err="write to WAL: log samples: write data/wal/XXXXXXXX: no space left on device HOT 1
- Helm: Missing fields in Topology Spread Constraints
- Ruler Pods OOM/spike in memory observed with warning log closing ingester client stream failed
- store-gateway: add timeout to index-header loading
- Mimir returns HTTP status 422 in cases where 5xx makes more sense
- Docs: Update references to mmap in store-gateway architecture
- Query with aggregation return incorrect num of points HOT 1
- [mimir-distributed] Add additionalRuleLabels to PrometheusRule alerts HOT 1
- Request per Second Metric Does Not Sync with Total Request Count in Mimir Visualization
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mimir.