Implement ChannelMode and sampling rate for extended aggregation #187

hush-hush · 2021-03-01T18:09:33Z

When using extended aggregation we still want to respect sampling rate for histograms, distribution and timing as it will have direct consequences on the Agent (number of metrics to aggregate).

For app sending a very high number of metrics the aggregator implements ChannelMode to avoid lock contention when generating
andom numbers.

truthbk

Super clean, I really don't have much to say. Added a couple of comments regarding areas that may be a bit of a concern in terms of potential performance hurdles - difficult to overcome in any case - but that we could maybe look into.

Feel free to merge.

statsd/options.go

truthbk · 2021-04-12T05:08:37Z

statsd/buffered_metric_context.go

+}
+
+func (bc *bufferedMetricContexts) sample(name string, value float64, tags []string, rate float64) error {
+	if !shouldSample(rate, bc.random, &bc.randomLock) {


I know there's no easy way around this, I'm a little concerned about all this locking here. We have one lock per metric type which is still a pretty coarse granularity. Can't really think of a good workaround.

That's true. Also it's the same overhead as the default configuration right now. When using the default setting (ie mutext mode), we have the same lock and random. So the overhead should be the same with and without extended aggregation.

truthbk · 2021-04-12T05:38:25Z

statsd/aggregator.go

@@ -11,59 +11,9 @@ type (
 	countsMap         map[string]*countMetric
 	gaugesMap         map[string]*gaugeMetric
 	setsMap           map[string]*setMetric
-	bufferedMetricMap map[string]*histogramMetric
+	bufferedMetricMap map[string]*bufferedMetric


So, I'm wondering if we should consider sync.Map? It's not totally clear to me, but we might get better performance. On paper a sync.Map might offer better performance when used as follows:

(1) when the entry for a given key is only ever written once but read many times, as in caches that only grow, (2) when multiple goroutines read, write, and overwrite entries for disjoint sets of keys.

I think we have a bit of (1) in that we create the context once and then sample over and over, but it might not fit the description fully. We don't have super high concurrency for (2), and the sets of keys would not be disjoint, so I'm not sure that apples. In any case, might be worth giving it thought to decide if we should keep the current implementation.

That's a really good point. I'll do that in a different PR !

When using extended aggregation we still want to respect sampling rate for histograms, distribution and timing as it will have direct consequences on the Agent (number of point to aggregate). For apps sending a high number of metrics the aggregator implements ChannelMode to avoid lock contention when generating random numbers.

hush-hush force-pushed the maxime/add-channelmode-aggregator branch 2 times, most recently from 8c443d1 to f2fb331 Compare March 16, 2021 09:13

hush-hush force-pushed the master branch from 9b1bc22 to 7e21371 Compare March 16, 2021 10:31

truthbk approved these changes Apr 12, 2021

View reviewed changes

hush-hush force-pushed the maxime/add-channelmode-aggregator branch from e3cb1a4 to 358b394 Compare April 15, 2021 11:12

hush-hush merged commit d052db7 into master Apr 15, 2021

hush-hush deleted the maxime/add-channelmode-aggregator branch April 15, 2021 12:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement ChannelMode and sampling rate for extended aggregation #187

Implement ChannelMode and sampling rate for extended aggregation #187

hush-hush commented Mar 1, 2021

truthbk left a comment

truthbk Apr 12, 2021

hush-hush Apr 15, 2021

truthbk Apr 12, 2021

hush-hush Apr 15, 2021

Implement ChannelMode and sampling rate for extended aggregation #187

Implement ChannelMode and sampling rate for extended aggregation #187

Conversation

hush-hush commented Mar 1, 2021

truthbk left a comment

Choose a reason for hiding this comment

truthbk Apr 12, 2021

Choose a reason for hiding this comment

hush-hush Apr 15, 2021

Choose a reason for hiding this comment

truthbk Apr 12, 2021

Choose a reason for hiding this comment

hush-hush Apr 15, 2021

Choose a reason for hiding this comment