Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Kafka Receiver does not include topic or receiver as a label #35336

Open
jhuizingh opened this issue Sep 20, 2024 · 7 comments · May be fixed by #36068
Open

Kafka Receiver does not include topic or receiver as a label #35336

jhuizingh opened this issue Sep 20, 2024 · 7 comments · May be fixed by #36068
Labels
enhancement New feature or request good first issue Good for newcomers receiver/kafka

Comments

@jhuizingh
Copy link
Contributor

Component(s)

receiver/kafka

What happened?

Description

The metrics emitted by the Kafka Receiver to not contain enough labels to determine which Kafka Receiver and topic they are referring to if you use multiple instances of the Kafka Receiver. Many other receivers include a receiver label which shows exactly which receiver the metric comes from by name, but the Kafka Receiver does not.

Steps to Reproduce

Create an otel collector with the following relevant pieces

  ... irrelevent parts omitted ...
  receivers:
    kafka/first:
      topic: kafka-topic-first
    kafka/second:
      topic: kafka-topic-second
  service:
    pipelines:
      metrics/kafka:
        receivers:
          - kafka/first
          - kafka/second
  ... irrelevent parts omitted ...

Expected Result

Metric in the form:

otelcol_kafka_receiver_offset_lag {
  cluster="mycluster",
  http_scheme="http",
  net_host_name="10.110.0.242",
  net_host_port="8888",
  partition="24",
  server_address="10.110.0.242",
  server_port="8888",
  service_name="otelcol-contrib",
  service_version="0.103.1",
  url_scheme="http",
  receiver="kafka/first",
  topic="kafka-topic-first"
}

Actual Result

Metric in the form:

otelcol_kafka_receiver_offset_lag {
  cluster="mycluster",
  http_scheme="http",
  net_host_name="10.110.0.242",
  net_host_port="8888",
  partition="24",
  server_address="10.110.0.242",
  server_port="8888",
  service_name="otelcol-contrib",
  service_version="0.103.1",
  url_scheme="http"
}

Collector version

0.103.1

Environment information

Environment

AWS EKS
Kubernetes v 1.30.3

Image: opentelemetry-collector-contrib
Helm Chart: opentelemetry-collector-0.95.0

OpenTelemetry Collector configuration

receivers:
    kafka/first:
      topic: kafka-topic-first
    kafka/second:
      topic: kafka-topic-second
  service:
    pipelines:
      metrics/kafka:
        receivers:
          - kafka/first
          - kafka/second

Log output

N/A

Additional context

No response

@jhuizingh jhuizingh added bug Something isn't working needs triage New item requiring triage labels Sep 20, 2024
Copy link
Contributor

Pinging code owners:

See Adding Labels via Comments if you do not have permissions to add labels yourself.

@StephanSalas
Copy link

Looks like this fix may be as simple as adding the missing attributes in a few places, such as:

@MovieStoreGuy
Copy link
Contributor

Hey,

I just wanted to clarify that you are referring to the internal metrics, correct?
I presume you don't want to mutate the data you're reading from the actual stream itself.

Then the fields you're looking to include are:

  • receiver
  • topic

Is that correct?

@jhuizingh
Copy link
Contributor Author

@MovieStoreGuy yes, exactly. Without those attributes, it can be challenging to understand how things are performing on a single deployment with multiple replicas and multiple Kafka receivers configured.

@StephanSalas
Copy link

StephanSalas commented Oct 29, 2024

@MovieStoreGuy, looks like this was partially fixed in v0.104.x version of OTEL Contrib with the "name" and "partition" attributes being added: https://github.com/open-telemetry/opentelemetry-collector-contrib/blob/release/v0.104.x/receiver/kafkareceiver/kafka_receiver.go. The problem is in the the v0.103.x version that @jhuizingh is referring to, so would recommend to them to upgrade to v0.104.x to at least obtain the new "name" and "partition" internal telemetry attributes.

As for the add topic attribute ask... I think adding topic attribute to a few of the relevant metrics makes a lot of sense because it will make it easier to observe what topic is actually being consumed rather than only having kafkareceiver/receiver name yaml as an item the troubleshooter has to backtrack to, just to obtain the topic name.

To address his other request to add topic to the attributes, I do have PR #36068 lined up. Please review :)

Copy link
Contributor

This issue has been inactive for 60 days. It will be closed in 60 days if there is no activity. To ping code owners by adding a component label, see Adding Labels via Comments, or if you are unsure of which component this issue relates to, please ping @open-telemetry/collector-contrib-triagers. If this issue is still relevant, please ping the code owners or leave a comment explaining why it is still relevant. Otherwise, please close it.

Pinging code owners:

See Adding Labels via Comments if you do not have permissions to add labels yourself.

@github-actions github-actions bot added the Stale label Dec 30, 2024
@jhuizingh
Copy link
Contributor Author

/label -stale

@github-actions github-actions bot removed the Stale label Dec 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request good first issue Good for newcomers receiver/kafka
Projects
None yet
3 participants