chore: add canister message queue latency metric #1510

maksymar · 2024-09-16T12:47:06Z

This PR adds a metric to measure the queue latency of canister messages for both ingress and request messages.

For ingress messages, since the start time is not stored, the start time is estimated using the expiry time and the current expiration limit. The metric also tracks the call depth for request messages, as it is relevant for them.

alin-at-dfinity · 2024-09-16T18:39:40Z

rs/execution_environment/src/execution_environment.rs

+                    let message_type_label = "request";
+                    let call_tree_depth_label = &metadata.call_tree_depth().to_string();
+                    let latency = current_time()
+                        .saturating_duration_since(*metadata.call_tree_start_time())


IIRC the call tree start time is the time when the message at the root of the call tree started execution. Regardless, it has to do with the call tree, not with the individual message being executed. So a message enqueued in the same block (i.e. at the same time) when it's executed could have a call tree start time that is arbitrarily old.

alin-at-dfinity · 2024-09-16T18:44:50Z

rs/execution_environment/src/execution_environment.rs

+                let now = current_time();
+                let expiry_duration = expiry_time_from_now().saturating_duration_since(now);
+                let start = ingress.expiry_time.saturating_sub(expiry_duration);
+                let latency = now.saturating_duration_since(start).as_secs_f64();


Based on your PR description you are very likely aware of this, but I will still mention that the expiry time can be set by the caller to an arbitrary value, i.e. much lower than the maximum 5 minutes. So this calculation may also be way off.

(In fact, I seem to remember subnets where this is consistently significantly less than 5 minutes. There's an mr_unreliable_induct_ingress_message_duration_seconds metric that uses similar logic to estimate ingress induction latency. You can go here, unhide the first query and try selecting various subnets.)

maksymar added 3 commits September 16, 2024 09:50

add ingress queue latency metric

2f589f3

calculate expiry duration

fdccf7a

use current system time

14fd59f

github-actions bot added the chore label Sep 16, 2024

maksymar added 3 commits September 16, 2024 12:48

comment

40c9b3a

add message_type label

d2e29e0

add request

6bdb3a7

maksymar changed the title ~~chore: add ingress message queue latency metric~~ chore: add canister message queue latency metric Sep 16, 2024

cleanup

149df4f

maksymar marked this pull request as ready for review September 16, 2024 13:40

maksymar requested review from a team as code owners September 16, 2024 13:40

github-actions bot added @execution @ic-interface-owners labels Sep 16, 2024

Merge branch 'master' into maksym/msg_queue_latency

9c83e4a

maksymar requested review from berestovskyy and dsarlis September 16, 2024 13:41

alin-at-dfinity reviewed Sep 16, 2024

View reviewed changes

merge master

933a55f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

chore: add canister message queue latency metric #1510

chore: add canister message queue latency metric #1510

Uh oh!

maksymar commented Sep 16, 2024 •

edited

Loading

Uh oh!

alin-at-dfinity Sep 16, 2024

Uh oh!

alin-at-dfinity Sep 16, 2024

Uh oh!

Uh oh!

chore: add canister message queue latency metric #1510

Are you sure you want to change the base?

chore: add canister message queue latency metric #1510

Uh oh!

Conversation

maksymar commented Sep 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alin-at-dfinity Sep 16, 2024

Choose a reason for hiding this comment

Uh oh!

alin-at-dfinity Sep 16, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

maksymar commented Sep 16, 2024 •

edited

Loading