feat(new transform): Add new incremental_to_absolute transform and change to MetricSet to an LRU cache with optional capacity policies #23374

GreyLilac09 · 2025-07-14T21:21:56Z

Summary

Create new incremental_to_absolute transform

Useful for:

avoiding duplicate metrics cache creation at the sink level
creating a historical record of metrics to account for lossy connections/file-based back-filling

Problem it solves: #23018

Vector configuration

transforms:
  incremental_to_absolute:
    type: incremental_to_absolute
    cache:
      max_bytes: 100_000_00
      max_events: 1000
      time_to_idle: 300

How did you test this PR?

Example 1
Example configuration:

data_dir: ./vector-data-dir
sources:
  s0:
    type: static_metrics
    interval_secs: 1
    metrics:
      - name: response_time
        kind: incremental
        value:
          counter:
            value: 1
        tags: {}

transforms:
  t0:
    inputs:
      - s0
    type: incremental_to_absolute
    cache:
      max_events: 5
sinks:
  console:
    type: console
    inputs:
      - t0
    target: stdout
    encoding:
      codec: json
      json:
        pretty: true

Example output:

{
  "name": "response_time",
  "namespace": "static",
  "timestamp": "2025-07-16T04:02:06.446891Z",
  "kind": "absolute",
  "counter": {
    "value": 1.0
  }
}
{
  "name": "response_time",
  "namespace": "static",
  "timestamp": "2025-07-16T04:02:07.447752Z",
  "kind": "absolute",
  "counter": {
    "value": 2.0
  }
}
{
  "name": "response_time",
  "namespace": "static",
  "timestamp": "2025-07-16T04:02:08.447934Z",
  "kind": "absolute",
  "counter": {
    "value": 3.0
  }
}
{
  "name": "response_time",
  "namespace": "static",
  "timestamp": "2025-07-16T04:02:09.447506Z",
  "kind": "absolute",
  "counter": {
    "value": 4.0
  }
}

Example 2
Enforcing max_events. Note that max_events = 1 means the other item will just be evicted immediately, resulting in 1 being received always in this example

data_dir: ./vector-data-dir
sources:
  s0:
    type: static_metrics
    interval_secs: 1
    metrics:
      - name: m1
        kind: incremental
        value:
          counter:
            value: 1
        tags: {}
      - name: m2
        kind: incremental
        value:
          counter:
            value: 1
        tags: {}
transforms:
  t0:
    type: incremental_to_absolute
    inputs:
      - s0
    cache:
      max_events: 1
      time_to_idle: 10
sinks:
  console:
    type: console
    inputs:
      - t0
    target: stdout
    encoding:
      codec: json
      json:
        pretty: true

Output:

{
  "name": "m1",
  "namespace": "static",
  "timestamp": "2025-07-26T00:37:35.593501Z",
  "kind": "absolute",
  "counter": {
    "value": 1.0
  }
}
{
  "name": "m2",
  "namespace": "static",
  "timestamp": "2025-07-26T00:37:35.593512Z",
  "kind": "absolute",
  "counter": {
    "value": 1.0
  }
}
{
  "name": "m1",
  "namespace": "static",
  "timestamp": "2025-07-26T00:37:36.593826Z",
  "kind": "absolute",
  "counter": {
    "value": 1.0
  }
}
{
  "name": "m2",
  "namespace": "static",
  "timestamp": "2025-07-26T00:37:36.593830Z",
  "kind": "absolute",
  "counter": {
    "value": 1.0
  }
}

Change Type

Bug fix
New feature
Non-functional (chore, refactoring, docs)
Performance

Is this a breaking change?

Yes
No

Does this PR include user facing changes?

Yes. Please add a changelog fragment based on our guidelines.
No. A maintainer will apply the no-changelog label to this PR.

References

Closes: Convert incremental to absolute counter values for statsd source #23018

Notes

Please read our Vector contributor resources.
Do not hesitate to use @vectordotdev/vector to reach out to us regarding this PR.
Some CI checks run only after we manually approve them.
- We recommend adding a pre-push hook, please see this template.
- Alternatively, we recommend running the following locally before pushing to the remote branch:
  - cargo fmt --all
  - cargo clippy --workspace --all-targets -- -D warnings
  - cargo nextest run --workspace (alternatively, you can run cargo test --all)
After a review is requested, please avoid force pushes to help us review incrementally.
- Feel free to push as many commits as you want. They will be squashed into one before merging.
- For example, you can run git merge origin master and git push.
If this PR introduces changes Vector dependencies (modifies Cargo.lock), please
run cargo vdev build licenses to regenerate the license inventory and commit the changes (if any). More details here.

src/transforms/incremental_to_absolute.rs

thomasqueirozb · 2025-07-15T17:00:24Z

Hey @GreyLilac09, thanks for the PR. Please update the test plan in your description including a vector config and expected output. This part allows us to run your code with a config and easily validate what the expected output should look like.

Here is an example config you can build on top of:

sources:
  s0:
    type: static_metrics
    interval_secs: 1
    metrics:
      - name: response_time
        kind: incremental
        value:
          counter:
            value: 1
        tags: {}

transforms:
  t0:
    type: remap
    inputs:
      - s0
    source: |-
      .tags.output = "some value"

sinks:
  console:
    type: console
    inputs:
      - t0
    target: stdout
    encoding:
      codec: json
      json:
        pretty: true

or you can create one from scratch. Thanks!

iadjivon

All set from Docs!

lib/vector-core/src/event/metric/data.rs

Copilot

Pull Request Overview

This PR introduces a new incremental_to_absolute transform that converts incremental metrics to absolute metrics while preserving the cumulative values. This is useful for avoiding duplicate metric caches at sink levels and creating historical records for scenarios with lossy connections or file-based backfilling.

Key changes:

Implements the core transform logic with TTL-based metric expiration
Adds comprehensive documentation and configuration examples
Includes unit tests covering incremental-to-absolute conversion and pass-through behavior for already-absolute metrics

Reviewed Changes

Copilot reviewed 7 out of 8 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
`src/transforms/incremental_to_absolute.rs`	Core implementation of the transform with metric conversion logic and tests
`website/cue/reference/components/transforms/incremental_to_absolute.cue`	Documentation and configuration reference for the new transform
`lib/vector-core/src/event/metric/data.rs`	Updates `into_absolute()` method to remove interval_ms when converting metrics
`src/transforms/mod.rs`	Registers the new transform module
`website/cue/reference/components.cue`	Adds feature definition for the transform
`Cargo.toml`	Adds feature flags for the new transform
`changelog.d/incremental_to_absolute_transform.feature.md`	Changelog entry documenting the new feature

Comments suppressed due to low confidence (2)

website/cue/reference/components/transforms/incremental_to_absolute.cue:50

The example title 'Aggregate over 5 seconds' is misleading. This transform converts incremental to absolute metrics but doesn't aggregate over time windows. A more accurate title would be 'Convert incremental counter to absolute'.

			title: "Aggregate over 5 seconds"

website/cue/reference/components/transforms/incremental_to_absolute.cue

src/transforms/incremental_to_absolute.rs

lib/vector-core/src/event/metric/data.rs

src/transforms/incremental_to_absolute.rs

GreyLilac09 · 2025-07-17T03:16:58Z

After thinking about this some more, I'm not so sure if just the expire_metrics_secs would be the right approach. The problem is that a lot of incremental counters are sparse (eg they could be incremented as a statsd counter every few hours or days), and the current approach just miss all of them. I think a better approach would be to implement the MetricSet (see https://github.com/vectordotdev/vector/blob/master/src/sinks/util/buffer/metrics/normalize.rs) as an LRU cache rather than as a IndexMap, and have a configurable max size.

I initially was thinking about this approach, but shied away from it because it would involve a much more substantial change to the code than I felt making at the time. However, I do think it's the right way to go about this.
The problem with just an LRU cache is someone might have a scenario where they need to be able to handle extremely high bursts of data without dropping. In this scenario, the expire_metrics_secs would be useful.
a. in this scenario, the cache would just eventually grow to the max size and stay at that size until restart. If the LRU cache max size is say 256 MB, one month later Vector could just sit at 256 MB allocated memory for Vector, even if it hasn't received any of those incremental counters for 27 days
b. they might not be able to predict the size of the burst ahead of time, and they need to be flexible, so having a fixed max size would cause a lot of inaccurate data when that burst comes and it exceeds the allowed size

Thus, I would propose an additional configuration, eg

transforms:
  incremental_to_absolute:
    type: incremental_to_absolute
    cache_max_size: 268435488 (default)
    expire_metrics_secs: 120s (default)

where cache_max_size is in bytes

Alternatively, maybe it would just be better to group the cache configs like we do for buffer and batch? So the config would be like

transforms:
  incremental_to_absolute:
    type: incremental_to_absolute
    cache:
      max_size: 268435488
      timeout_secs: 120s

Curious to hear your thoughts. I'd also eventually like to add this config to the prom remote write sink (eg from this PR).

I can also do add the LRU cache in a separate PR, but it would not be ideal to change the config later (eg. go from expire_metrics_secs to cache.timeout_secs) if it can be avoided, so if we go with the second config we'd probably want to do it in this PR.

pront · 2025-07-18T14:18:01Z

he problem is that a lot of incremental counters are sparse (eg they could be incremented as a statsd counter every few hours or days), and the current approach just miss all of them.

Can you explain with an example?

From a UX perspective the following is better:

transforms:
  incremental_to_absolute:
    type: incremental_to_absolute
    cache:
      max_size: 268435488
      timeout_secs: 120s

GreyLilac09 · 2025-07-18T15:14:19Z

For example, if we increment (+1) count every 10 minutes, and the expire_metrics_secs is 5 minutes, that count would always just show up as 1 (unchanging) in prometheus and the increase in value is never logged

GreyLilac09 · 2025-07-18T15:16:24Z

transforms:
  incremental_to_absolute:
    type: incremental_to_absolute
    cache:
      max_size: 268435488
      timeout_secs: 120s

@pront that makes sense, if this is the case we should also change the config of prom remote write sink (#23286) to be the same. I think the plan would be to modify this PR to use the LRU cache with this config, and in a separate follow-up PR modify the prom remote write config to have the same?

pront · 2025-07-18T16:33:53Z

transforms:
  incremental_to_absolute:
    type: incremental_to_absolute
    cache:
      max_size: 268435488
      timeout_secs: 120s
@pront that makes sense, if this is the case we should also change the config of prom remote write sink (#23286) to be the same. I think the plan would be to modify this PR to use the LRU cache with this config, and in a separate follow-up PR modify the prom remote write config to have the same?

Hi @GreyLilac09, this makes sense to me!

GreyLilac09 · 2025-07-24T16:04:05Z

@pront I would appreciate an initial look at the new structure here. There's still some details I'm working to iron out (specifically, time_to_idle doesn't seem to be working) but I'd love to know if the structure is directionally correct.

GreyLilac09 · 2025-07-24T16:12:01Z

src/sinks/util/buffer/metrics/normalize.rs

+    }
+}
+
+impl InternalEvent for MetricSet {


not sure if this will just work, I'm looking to emit a metric for cache events and size

GreyLilac09 · 2025-07-24T16:12:48Z

src/sinks/util/buffer/metrics/normalize.rs

+use vector_lib::ByteSizeOf;
+
+#[derive(Debug, Snafu, PartialEq, Eq)]
+pub enum NormalizerError {


Copying same structure as BatchConfig

GreyLilac09 · 2025-07-24T17:06:32Z

~~FYI, after some debugging, it looks like the TTI and max_events/max_bytes works, but they're enforced in the background so the max_events/max_bytes can exceed it temporarily~~

~~The alternative unsync::Cache (which enforces capacity/max events) is not thread-safe so I don't think we can use it~~

Our new approach can support it (no extra dependencies)

…GreyLilac09/vector into greylilac09/add-incremental-to-absolute

GreyLilac09 · 2025-07-26T00:40:08Z

@pront I think it's ready for review. I originally went for a mini-moka approach, but I think 1. I'm not able to build it with cross due to new dependencies 2. I think we get more control and flexibility on behavior (eg being able to specify both max_events and max_bytes, rather than either) by just using the original cache impl and manually tracking/evicting from the LRU cache

GreyLilac09 · 2025-07-31T15:40:05Z

@pront @thomasqueirozb gentle bump on a review--we'd like to get this in before next week's release if possible

add incremental_to_absolute

5616c4e

GreyLilac09 requested review from a team as code owners July 14, 2025 21:21

github-actions bot added domain: transforms Anything related to Vector's transform components domain: external docs Anything related to Vector's external, public documentation labels Jul 14, 2025

GreyLilac09 commented Jul 14, 2025

View reviewed changes

src/transforms/incremental_to_absolute.rs Outdated Show resolved Hide resolved

GreyLilac09 added 2 commits July 14, 2025 19:17

fix docs

9c6e6dc

fix tpos

239f7f3

thomasqueirozb added the meta: awaiting author Pull requests that are awaiting their author. label Jul 15, 2025

Merge branch 'master' into greylilac09/add-incremental-to-absolute

284d9db

github-actions bot removed the meta: awaiting author Pull requests that are awaiting their author. label Jul 15, 2025

iadjivon approved these changes Jul 15, 2025

View reviewed changes

pront added the meta: awaiting author Pull requests that are awaiting their author. label Jul 15, 2025

use sync

0d6c9f1

github-actions bot removed the meta: awaiting author Pull requests that are awaiting their author. label Jul 15, 2025

GreyLilac09 added 2 commits July 15, 2025 22:13

fix extra aggregate

9d55415

remove interval_ms

8dedf1c

github-actions bot added the domain: core Anything related to core crates i.e. vector-core, core-common, etc label Jul 16, 2025

GreyLilac09 commented Jul 16, 2025

View reviewed changes

lib/vector-core/src/event/metric/data.rs Outdated Show resolved Hide resolved

GreyLilac09 requested a review from thomasqueirozb July 16, 2025 04:08

pront requested a review from Copilot July 16, 2025 19:50

Copilot AI reviewed Jul 16, 2025

View reviewed changes

website/cue/reference/components/transforms/incremental_to_absolute.cue Outdated Show resolved Hide resolved

src/transforms/incremental_to_absolute.rs Outdated Show resolved Hide resolved

lib/vector-core/src/event/metric/data.rs Outdated Show resolved Hide resolved

pront reviewed Jul 16, 2025

View reviewed changes

src/transforms/incremental_to_absolute.rs Outdated Show resolved Hide resolved

pront added the meta: awaiting author Pull requests that are awaiting their author. label Jul 16, 2025

address comments

0ee9cb1

github-actions bot removed the meta: awaiting author Pull requests that are awaiting their author. label Jul 16, 2025

pront added the meta: awaiting author Pull requests that are awaiting their author. label Jul 17, 2025

GreyLilac09 requested a review from pront July 17, 2025 23:28

github-actions bot removed the meta: awaiting author Pull requests that are awaiting their author. label Jul 17, 2025

Merge branch 'master' into greylilac09/add-incremental-to-absolute

3cc9d8b

pront added the meta: awaiting author Pull requests that are awaiting their author. label Jul 21, 2025

use lru cache

b1a18aa

github-actions bot added domain: sinks Anything related to the Vector's sinks and removed meta: awaiting author Pull requests that are awaiting their author. labels Jul 24, 2025

GreyLilac09 changed the title ~~feat(new transform): Add new incremental_to_absolute transform~~ feat(new transform): Add new incremental_to_absolute transform and change to MetricSet to an LRU cache (mini-moka) Jul 24, 2025

use max_bytes or max_events

0761542

GreyLilac09 commented Jul 24, 2025

View reviewed changes

GreyLilac09 and others added 5 commits July 24, 2025 17:42

add debug print

7c6fa3e

Merge branch 'master' into greylilac09/add-incremental-to-absolute

bbe41f4

Merge branch 'greylilac09/add-incremental-to-absolute' of github.com:…

5596b9c

…GreyLilac09/vector into greylilac09/add-incremental-to-absolute

remove debug print

07db954

use original cache impl

a1ce197

GreyLilac09 changed the title ~~feat(new transform): Add new incremental_to_absolute transform and change to MetricSet to an LRU cache (mini-moka)~~ feat(new transform): Add new incremental_to_absolute transform and change to MetricSet to an LRU cache with optional capacity policies Jul 25, 2025

GreyLilac09 added 2 commits July 25, 2025 19:49

restore original formatting

51d959a

use lrucache

00c2388

GreyLilac09 added 4 commits July 25, 2025 21:35

make some optimizations and fix clippy

c8f73d1

fix incr_to_absolute and absolute_to_incr

f832a4e

fix

7c692d5

add overhead

72f86a0

simplify free_items

3cc49f1

feat(new transform): Add new incremental_to_absolute transform and change to MetricSet to an LRU cache with optional capacity policies #23374

Are you sure you want to change the base?

feat(new transform): Add new incremental_to_absolute transform and change to MetricSet to an LRU cache with optional capacity policies #23374

Conversation

GreyLilac09 commented Jul 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Vector configuration

How did you test this PR?

Change Type

Is this a breaking change?

Does this PR include user facing changes?

References

Notes

Uh oh!

Uh oh!

thomasqueirozb commented Jul 15, 2025

Uh oh!

iadjivon left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

GreyLilac09 commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pront commented Jul 18, 2025

Uh oh!

GreyLilac09 commented Jul 18, 2025

Uh oh!

GreyLilac09 commented Jul 18, 2025

Uh oh!

pront commented Jul 18, 2025

Uh oh!

GreyLilac09 commented Jul 24, 2025

Uh oh!

GreyLilac09 Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

GreyLilac09 Jul 24, 2025

Choose a reason for hiding this comment

Uh oh!

GreyLilac09 commented Jul 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

GreyLilac09 commented Jul 26, 2025

Uh oh!

GreyLilac09 commented Jul 31, 2025

Uh oh!

Uh oh!

GreyLilac09 commented Jul 14, 2025 •

edited

Loading

GreyLilac09 commented Jul 17, 2025 •

edited

Loading

GreyLilac09 commented Jul 24, 2025 •

edited

Loading