-
Notifications
You must be signed in to change notification settings - Fork 2.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Leader Election Extension #34460
Comments
That sounds interesting. A similar approach is used in Metricbeat for covering the cluster level metrics collection when running as part of a Daemonset. II assume we want to achieve something similar here where the Collector will be running only as a Daemonset and the leader will be responsible for enabling the While this is useful from user experience perspective, from my past experience, it can be problematic when it comes to scale. |
Hey @ChrsMark! Thanks for the feedback. Yes, exactly the leader is responsible for enabling a sub-receiver (such as Yes, I fully agree with the concerns about resource limits/requests. However, as you said, it should be properly documented. Not sure if there is a way to workaround it. Regarding putting some extra load on the k8s API - do you mean querying/updating leases? |
@skhalash yes, but this is something that can also be properly documented along with some perf tests results so as users be aware of any possible impact to their clusters. |
Why does it have to be a separate receiver? I think this should be an extension providing an interface that any receiver can connect to to check if it's a leader (have to do the work) or not (do nothing). I would be happy to sponsor and review that |
Hey @dmitryax! Thanks so much for your response! We’d be happy to contribute such a component. I don’t have much experience with extensions—would this mean that every receiver needing this functionality would require code modifications? Implementing it as a delegating receiver, similar to receiver-creator, could avoid it. |
Modifying code is fine - better than having two similar receivers. It's better to write some code, but keep the user interface clean. Restricting it to receiver_creator, we won't be able to use leader election for receivers that don't work with receiver creator, for example /~https://github.com/open-telemetry/opentelemetry-collector-contrib/tree/main/receiver/k8seventsreceiver, or don't need to use it all, e.g. collecting data from a static endpoint. Also, future of receiver_creator is unclear. We might take a different approach to discovery and consolidate all scraping receivers in one, see open-telemetry/opentelemetry-collector#11238 |
Thanks for the response! We'll explore the idea of implementing leader election as an extension instead of a receiver and will get back to you. |
Hey @dmitryax, We're exploring a specific use case and wanted to get your thoughts. The scenario involves multiple receivers within a single running collector instance that may need to operate in singleton mode, using leader election. To support this, each receiver might need its own lease, allowing each to be managed by a different leader. We're considering two possible approaches:
receivers:
k8s_cluster:
singleton:
lease_name: foo
lease_namespace: default
k8s_events:
singleton:
lease_name: bar
lease_namespace: default
extensions:
singleton:
receivers:
k8s_cluster:
singleton:
name: singleton/foo
k8s_events:
singleton:
name: singleton/bar
extensions:
singleton/foo:
lease_name: foo
lease_namespace: default
singleton/bar:
lease_name: bar
lease_namespace: default The second approach seems simpler to implement. However, from what I’ve seen, there aren’t any examples in the wild of multiple instances of extensions being registered. Do you think this approach would be feasible? |
This is a very interesting feature, and many of the plugins in otel col contrib are currently restricted to running on a single instance. |
We did some investigation and agree that implementing leader election as an extension is indeed a cleaner solution. We will provide a PR with the implementation soon. |
hello @skhalash wondering what's the progress there ? |
Hi @chenlujjj I am actively working on this. I plan to create a PR with the implementation of the extension in this week. |
I sponsor this component |
As per review comments and guidelines the PR needs to be divided into several small PRs. |
The purpose and use-cases of the new component
There can be a use case where one needs to run components in an HA mode. But currently running multiple instances of components like
k8sclusterreceiver
wouldcause collection of duplicate metrics. In order to be able to run such receivers in an HA mode one needs to make sure only one receiver is active. This can be achieved by utilizing the leader election mechanism provided by Kubernetes client library.
The support for extension can be implemented as an extension so that any component that wants to leverage leader election can use it easily.
Example configuration for the component
Telemetry data types supported
traces, metrics, and logs
Is this a vendor-specific component?
Code Owner(s)
@skhalash @a-thaler @rakesh-garimella
Sponsor (optional)
No response
Additional context
I work at SAP on a project called Kyma: https://kyma-project.io/#/. In Kyma, we recently developed such a receiver and we are already testing it out in a production setup: /~https://github.com/kyma-project/opentelemetry-collector-components/tree/main/receiver/singletonreceivercreator.
We’ve noticed discussions in the community about introducing leader election in the k8sobjects and k8scluster receivers. That's why we believe that a generic mechanism could be quite beneficial. If there’s interest, we are ready to contribute it to the Open Telemetry project.
The text was updated successfully, but these errors were encountered: