New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Refactor BatchLogProcessor #2494

Merged

cijothomas merged 9 commits into open-telemetry:main from utpilla:utpilla/Refactor-BatchLogProcessor

Jan 14, 2025

Contributor

utpilla commented Jan 3, 2025 •

edited

Loading

Towards #2478

Changes

Refactor BatchLogProcessor to wake up the background thread ONLY when needed BatchProcessor shutdown issue #2478 (comment)
Address the shutdown issue

Performance

There is a significant performance improvement mainly because of mpsc:Receiver not being excessively active (unlike how it is on main branch). It looks like there is a significant amount of synchronization overhead with senders and receivers actively working on the channel all the time. With this PR, the channel receivers are only woken up when needed so the logging threads incur a lesser synchronization overhead cost.

I updated the Logging_Comparable_To_Appender benchmark and logs stress test to use a BatchProcessor with a no-op exporter and ran the tests. Here are the results:

25% improvement in throughput with stress test results going up from 12M (main) to 15 M
~46% improvement in benchmarks with the latency measurement going down from 420ns (main) to 227 ns.

Merge requirement checklist

CONTRIBUTING guidelines followed
Unit tests added/updated (if applicable)
Appropriate CHANGELOG.md files updated for non-trivial, user-facing changes
Changes in public API reviewed (if applicable)


          Refactor BatchLogProcessor

f040407

codecov bot commented Jan 3, 2025 •

edited

Loading

Codecov Report

Attention: Patch coverage is 59.34066% with 37 lines in your changes missing coverage. Please review.

Project coverage is 77.9%. Comparing base (73859c3) to head (08687d1).
Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
opentelemetry-sdk/src/logs/log_processor.rs	59.3%	37 Missing ⚠️

Additional details and impacted files

@@           Coverage Diff           @@
##            main   #2494     +/-   ##
=======================================
- Coverage   77.9%   77.9%   -0.1%     
=======================================
  Files        123     123             
  Lines      22881   22943     +62     
=======================================
+ Hits       17838   17875     +37     
- Misses      5043    5068     +25

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

utpilla mentioned this pull request

BatchProcessor shutdown issue #2478

Open

Member

lalitb commented Jan 3, 2025

Excellent approach! Please make this ready for review once you feel it's ready :)

lalitb reviewed

View reviewed changes

opentelemetry-sdk/src/logs/log_processor.rs Outdated Show resolved Hide resolved

lalitb mentioned this pull request

BatchSpanProcessor optimization #2493

Open

lalitb reviewed

View reviewed changes

opentelemetry-sdk/src/logs/log_processor.rs Show resolved Hide resolved

utpilla marked this pull request as ready for review

January 3, 2025 19:55

utpilla requested a review from a team as a code owner

January 3, 2025 19:55

utpilla and others added 2 commits

January 3, 2025 20:17


          Fix ordering issue

9c210b0


          Merge branch 'main' into utpilla/Refactor-BatchLogProcessor

77dd3e8

lalitb added the integration tests label

lalitb reviewed

View reviewed changes

opentelemetry-sdk/src/logs/log_processor.rs Outdated Show resolved Hide resolved

lalitb reviewed

View reviewed changes

opentelemetry-sdk/src/logs/log_processor.rs Outdated

+                          // Check if the a control message for exporting logs is already sent to the worker thread.
+                          // If not, send a control message to export logs.
+                          // `export_log_message_sent` is set to false ONLY when the worker thread has processed the control message.
+                          if !self.export_log_message_sent.swap(true, Ordering::Relaxed) {

Member

lalitb Jan 6, 2025

should we use compare_exchange instead of swap. This will avoid redundant writes if the export_log_message_sent is already true?

Contributor Author

utpilla Jan 6, 2025 •

edited

Loading

I used swap because it's less verbose. Note that both swap and compare_exchange need exclusive access to the cache line. More importantly, a compare_exchangeoperation would need exclusive access even to perform the comparison.

I pushed some new changes. I have added a new if check using atomic load to avoid aggressively calling swap. We would now call swap only if the atomic load value returned false. Let me know what you think.

utpilla and others added 5 commits

January 6, 2025 21:05


          Address PR comments

e81c639


          Merge branch 'utpilla/Refactor-BatchLogProcessor' of https://github.c…

a599c1c

…om/utpilla/opentelemetry-rust into utpilla/Refactor-BatchLogProcessor


          Fix typo

177abcf


          Update comments


          Merge branch 'main' into utpilla/Refactor-BatchLogProcessor

5f56d1c

cijothomas reviewed

View reviewed changes

opentelemetry-sdk/src/logs/log_processor.rs

+                              // This method gets upto `max_export_batch_size` amount of logs from the channel and exports them.
+                              // It returns the result of the export operation.
+                              // It expects the logs vec to be empty when it's called.

Member

cijothomas Jan 13, 2025

we can get rid of logs vec and instead create a fresh array and pass its slice to export method.?

Contributor Author

utpilla Jan 13, 2025

For using array, we need to know the size of the array beforehand which we don't because we don't know how many items are present in the channel. It might be less than max_export_batch_size.

Member

cijothomas Jan 14, 2025

we can allocate an array of max_export_batch_size, and slice it at the required length. Anyway, this is not a blocker.

Contributor Author

utpilla Jan 14, 2025

Always stack allocating a big array even to export very few logs seems wasteful.

What's the benefit of using an array instead of a vec here? We are reusing the same vec for each export anyway so allocation isn't a concern.

cijothomas reviewed

View reviewed changes

opentelemetry-sdk/src/logs/log_processor.rs Show resolved Hide resolved

cijothomas reviewed

View reviewed changes

opentelemetry-sdk/src/logs/log_processor.rs

-                                          let result = export_with_timeout_sync(
-                                              config.max_export_timeout,
-                                              &mut exporter,
+                                          let result = get_logs_and_export(

Member

cijothomas Jan 13, 2025

same as flush - we need to ensure logs are continued to be exported until it finishes all logs received upto the point shutdown was invoked.

cijothomas approved these changes

View reviewed changes

Member

cijothomas left a comment

Need to fix flush/shutdown, but good to merge and follow up.

Also, I am unsure of the perf testing, as any perf test would instantly fill up the channel, so majority of items would be taking the "drop_due_to_full" path. Developing a testing strategy for BatchProcessor is also good candidate for another PR. Irrespective of that, the direction in this PR looks correct to me.

Contributor Author

utpilla commented Jan 13, 2025

Also, I am unsure of the perf testing, as any perf test would instantly fill up the channel, so majority of items would be taking the "drop_due_to_full" path.

Since we are using a no-op exporter, the channel should be getting emptied really quick as well. But yes, it would be nice to verify that objectively instead of manual reasoning.


          Merge branch 'main' into utpilla/Refactor-BatchLogProcessor

08687d1

cijothomas merged commit 888d5a3 into open-telemetry:main

20 of 21 checks passed

utpilla mentioned this pull request

Fix BatchLogProcessor #2510

Merged

4 tasks

lalitb mentioned this pull request

BatchSpanProcessor optimizations - Separate control signal queue, and wake up background thread only when required. #2526

Merged

4 tasks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

integration tests