Giter Club home page Giter Club logo

Comments (3)

aaronvarga avatar aaronvarga commented on June 2, 2024

I don't mean to hijack the thread, but I'm experiencing very similar behavior, and I also don't see anything super obvious, though I too find it difficult to interpret these numbers. Here are a few errors I'm receiving:

Timeout awaiting response (outbound=0KiB, inbound=0KiB, 60187ms elapsed, timeout is 18000ms), command=PING, next: PING, inst: 0, qu: 0, qs: 0, aw: False, bw: Inactive, rs: DequeueResult, ws: Idle, in: 0, last-in: 28, cur-in: 7, sync-ops: 0, async-ops: 39104, serverEndpoint: redacted.redis.cache.windows.net:6380, conn-sec: 1173267.98, aoc: 0, mc: 1/1/0, mgr: 8 of 10 available, clientName: redacted(SE.Redis-v2.7.4.20928), IOCP: (Busy=4,Free=996,Min=25,Max=1000), WORKER: (Busy=9,Free=1014,Min=25,Max=1023), POOL: (Threads=15,QueuedItems=25,CompletedItems=21339081,Timers=89), v: 2.7.4.20928 (Please take a look at this article for some common client-side issues that can cause timeouts: https://stackexchange.github.io/StackExchange.Redis/Timeouts)

Timeout awaiting response (outbound=0KiB, inbound=0KiB, 26188ms elapsed, timeout is 18000ms), command=EXPIRE, next: EXPIRE MyCacheKey, inst: 0, qu: 0, qs: 0, aw: True, bw: SpinningDown, rs: DequeueResult, ws: Writing, in: 0, last-in: 0, cur-in: 8, sync-ops: 0, async-ops: 52696, serverEndpoint: redacted.redis.cache.windows.net:6380, conn-sec: 850021.5, aoc: 0, mc: 1/1/0, mgr: 7 of 10 available, clientName: redacted(SE.Redis-v2.7.4.20928), IOCP: (Busy=0,Free=1000,Min=25,Max=1000), WORKER: (Busy=10,Free=1013,Min=25,Max=1023), POOL: (Threads=31,QueuedItems=0,CompletedItems=15462543,Timers=141), v: 2.7.4.20928 (Please take a look at this article for some common client-side issues that can cause timeouts: https://stackexchange.github.io/StackExchange.Redis/Timeouts)

Timeout awaiting response (outbound=0KiB, inbound=0KiB, 24641ms elapsed, timeout is 18000ms), command=HMGET, next: HMGET MyCacheKey, inst: 0, qu: 0, qs: 0, aw: False, bw: SpinningDown, rs: DequeueResult, ws: Idle, in: 0, last-in: 0, cur-in: 868, sync-ops: 0, async-ops: 62872, serverEndpoint: redacted.redis.cache.windows.net:6380, conn-sec: 929628.06, aoc: 0, mc: 1/1/0, mgr: 9 of 10 available, clientName: redacted(SE.Redis-v2.7.4.20928), IOCP: (Busy=1,Free=999,Min=25,Max=1000), WORKER: (Busy=5,Free=1018,Min=25,Max=1023), POOL: (Threads=8,QueuedItems=0,CompletedItems=16944007,Timers=102), v: 2.7.4.20928 (Please take a look at this article for some common client-side issues that can cause timeouts: https://stackexchange.github.io/StackExchange.Redis/Timeouts)

from stackexchange.redis.

NickCraver avatar NickCraver commented on June 2, 2024

@Crossbow78 Is there a chance the nodes are swapping on the network side somewhere? This appears that we legit did not get a response - given the lower volume, I'd even suggest a long-running scoped packet capture to just those commands perhaps to see what's happening, I don't see anything odd on your client stats that indicates a problem unless that outbound is common. Do all of your occurences have an outbound buffer that hasn't been sent?

@aaronvarga Yours is very perplexing, those timeout differentials tell us that the timer to process backlogs isn't firing for up to 40 seconds which is absolutely crazy lag - is the machine pegged on CPU/low on resources in general? That's one of the most extreme timer lags we've had reported.

from stackexchange.redis.

Crossbow78 avatar Crossbow78 commented on June 2, 2024

Thanks for your response, @NickCraver !

I have collected all exception metrics in an Excel sheet to get a better overview, and I can see that in 90% of the cases both 'outbound' and 'inbound' are 0.

Your remark about nodes triggered me to double-check how our pods were distributed over the underlying k8s cluster nodes. I found that 2 application nodes were running on different nodes than the redis pod, and 1 was running on the same. The distribution of client timeouts however is equally spread over all 3 application pods.

I'm not sure I understand what kind of network swapping could be involved, nor what a long-running scoped packet capture would look like. If you could elaborate I'd be happy to give it a try to gain more insight.

from stackexchange.redis.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.