Make asynchronous emitters have the same (configurable?) policy when event queue is full #7057

leventov · 2019-02-12T22:31:05Z

This issue narrows the idea of #7037.

There are many emitters (at least: AmbariMetricsEmitter, GraphiteEmitter, StatsDEmitter and KafkaEmitter and HttpPostEmitter, I haven't checked other) that use the same producer-consumer pattern for asynchronous emit: emit() pushes the event to some queue (one of the queues), and there is an asynchronous executor that retrieves events from the queue and sends them over network to emit.

AmbariMetricsEmitter and GraphiteEmitter use the same policy when the queue is full (they log a warning). But StatsDEmitter apparently silently discards new events when the queue is full (see NonBlockingStatsDClient code). KafkaEmitter discards new events, but increments "lost events" counts. HttpPostEmitter packs events in batches and drops the oldest batch when overwhelmed, simultaneously logging that (see HttpPostEmitter.limitBuffersToEmitSize() and limitFailedBuffersSize()).

I think all emitters should be similar in this regard. Probably event throttling policy should be configurable.

Related to #2868.

The text was updated successfully, but these errors were encountered:

justinborromeo · 2019-02-12T23:37:16Z

Do you have any strong opinions on which policy would be the best to use across all the emitters?

leventov · 2019-02-12T23:46:52Z

In my opinion, dropping old events is better than dropping new ones (as StatsDEmitter and KafkaEmitter any maybe some other emitters currently do). Logging a warning or error alongside dropping events also makes sense.

justinborromeo · 2019-02-13T00:06:21Z

Also, what do you mean by "backpressure policy"? Does that just involve dropping the oldest events in queue?

leventov · 2019-02-13T22:08:21Z

Yes, that was not very precise to call it "backpressure" policy, because we probably never want actual backpressure in metrics emitter, because metrics are relatively unimportant to slow down queries. Rephrased to "event throttling" policy. Simple policies might be "drop oldest" and "drop newest". Some more sophisticated policies may change the event "level" (like logging level) when the queue is full, like stop emitting some relatively less important kinds of events. (Probably trying to return to the default level after some backoff time or when the queue becomes sufficiently empty).

justinborromeo · 2019-02-14T01:58:24Z

Looking at the code, I noticed that NonBlockingStatsDEmitter is a library class. Does it make sense to ignore it when refactoring since we can't change it or do you think that a custom implementation should be written?

justinborromeo · 2019-02-14T02:57:47Z

Opened a proposal (#7075) since the change adds a config.

leventov · 2019-02-14T20:43:09Z

I've created an issue in the repo of that library: DataDog/java-dogstatsd-client#71

justinborromeo · 2019-02-14T21:26:14Z

Does that correspond to what we're using? NonBlockingStatsDClient isn't a Datadog library.

leventov · 2019-02-15T19:16:02Z

@justinborromeo /~https://github.com/DataDog/java-dogstatsd-client/blob/de8c7982d208b1458e17eef81e4d18085d7fd9f5/src/main/java/com/timgroup/statsd/NonBlockingStatsDClient.java don't we use this class?

justinborromeo · 2019-02-15T19:22:08Z

Sorry, I misread something. You are correct.

github-actions · 2023-08-02T00:17:11Z

This issue has been marked as stale due to 280 days of inactivity.
It will be closed in 4 weeks if no further activity occurs. If this issue is still
relevant, please simply write any comment. Even if closed, you can still revive the
issue at any time or discuss it on the dev@druid.apache.org list.
Thank you for your contributions.

leventov added Area - Operations Area - Metrics/Event Emitting labels Feb 12, 2019

leventov mentioned this issue Feb 12, 2019

[WIP] 7037 Abstract BlockingQueue#Offer Failure Handling #7052

Closed

leventov added the Contributions Welcome label Feb 12, 2019

justinborromeo mentioned this issue Feb 14, 2019

[PROPOSAL] Add a Configurable Event Throttling Policy to Asynchronous Emitters #7075

Open

github-actions bot added the stale label Aug 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make asynchronous emitters have the same (configurable?) policy when event queue is full #7057

Make asynchronous emitters have the same (configurable?) policy when event queue is full #7057

leventov commented Feb 12, 2019 •

edited

Loading

justinborromeo commented Feb 12, 2019

leventov commented Feb 12, 2019

justinborromeo commented Feb 13, 2019

leventov commented Feb 13, 2019 •

edited

Loading

justinborromeo commented Feb 14, 2019

justinborromeo commented Feb 14, 2019

leventov commented Feb 14, 2019

justinborromeo commented Feb 14, 2019

leventov commented Feb 15, 2019

justinborromeo commented Feb 15, 2019

github-actions bot commented Aug 2, 2023

Make asynchronous emitters have the same (configurable?) policy when event queue is full #7057

Make asynchronous emitters have the same (configurable?) policy when event queue is full #7057

Comments

leventov commented Feb 12, 2019 • edited Loading

justinborromeo commented Feb 12, 2019

leventov commented Feb 12, 2019

justinborromeo commented Feb 13, 2019

leventov commented Feb 13, 2019 • edited Loading

justinborromeo commented Feb 14, 2019

justinborromeo commented Feb 14, 2019

leventov commented Feb 14, 2019

justinborromeo commented Feb 14, 2019

leventov commented Feb 15, 2019

justinborromeo commented Feb 15, 2019

github-actions bot commented Aug 2, 2023

leventov commented Feb 12, 2019 •

edited

Loading

leventov commented Feb 13, 2019 •

edited

Loading