Comments (17)
How many blocks do you have? Can you bump to 0.35.0?
from thanos.
How many blocks do you have? Can you bump to 0.35.0?
What do you mean by how many blocks?
Bumped thanos versions to 0.35.1, redeployed with all the stores set. The red line that is spiking in the graphs is caused by the EKS node which has storegateway on it:
from thanos.
I guess it's fetching block metadata on startup ( the gateway) does it stabilize eventually and is the querier less noisy?
from thanos.
It stays that high indefinitely. The fact that it "stabilizes" is not really a good thing when it stays at 500+MB/s bandwidth 😅
from thanos.
Storage gw is also 0.35.0 right?
from thanos.
Storage gw is also 0.35.0 right?
Yeah all thanos components are now 0.35.1
from thanos.
How many blocks do you have in object storage roughly? Is your compactor working well?
from thanos.
Compactor seems to be doing it's job quite well and there are less than 800 objects reported totaling little under 30GB of data.
from thanos.
@sourcehawk traffic between querier and store gateway is triggered by incoming queries. We can't say 500 MB/s is unnatural unless we know a bunch of things, like:
- Is this querier getting lots of queries as soon as it comes up?
- Are these queries over long time frames?
- Are these queries touching many series?
- What's the size of your blocks?
from thanos.
After much debugging we've come to realize the traffic is probably being generated due to an infinite recursion loop happening between the thanos ruler and the querier when the ruler is added as a store on the querier. My best bet is because the ruler queries the querier but the querier also queries the ruler, causing an infinite call loop to both the sidecar and the store gateway.
This is the network traffic after removing thanos ruler, as can be seen, the traffic of all types drops almost instantaneously to zero.
from thanos.
Interesting. You can deploy a separate querier that will query almost everything, excluding the Ruler, and point the Ruler to this one.
from thanos.
Just out of curiosity - if you'd enable remote_write on Ruler that would stop Store API on it - possibly that could help?
from thanos.
Related Issues (20)
- Can Huawei's OBS storage be supported? HOT 1
- Thanos React-app : Proxy server for thanos-query
- Query: update of endpoint failed...context deadline exceeded
- Thanos Chart 0.34.0 app version 12.23.1
- Thanos receive fails "no space left on device"
- sidecar: Greatly increased Thanos sidecar memory usage from 0.32.2 to 0.32.3, still exists in 0.35.0 HOT 3
- api/v1/label returns wrong values HOT 3
- Regression in thanos v0.35.1 HOT 3
- Thanos Receiver: Router/Ingestor setup no longer returns `thanos_receive_write_timeseries_*` and `thanos_receive_write_samples_*` metrics with thanos v0.35.1 HOT 3
- Extend Thanos bucket rewrite to support filtered archiving of existing blocks
- Support additional aggregates for downsampling
- Store Announced LabelSets Unexpected
- Warning in Grafana 11+: Thanos Receive dashboard depends on Angular
- compactor: Thanos Compactor compaction level number being set to 93+ for 1 half of HA Prometheus (S3 Storage) HOT 3
- Codespace doesn't seem to be working
- AWS S3 objectstorage is not working for ap-south-2 region inspite of endpoint and region is mentioned and getting default to dualstack endpoint of us-east-1 HOT 7
- Thanos receive running on ext4 FS experiencing compaction failures HOT 10
- query: If i choose time window shorter than 6 months, i don't see downsampled metrics HOT 4
- compact: Thanos Compactor doesn't delete blocks which are marked for deletion HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from thanos.