Open mx-psi opened 7 months ago
cc @open-telemetry/semconv-system-approvers
The challenge with any ID is that an ID is only truly usable in a specific context. Given the existence of a host in multiple contexts. Should we look at host.id as a graph and the variations of the ids as a node in a graph that has a graph id that is shared across the contexts.
This is something that the Entities SIG should look into before we make progress
host.id
is currently used as a catch-all convention for any sort of ID in cloud providers or machines alike, this makes it difficult to use by vendors to retrieve specific cloud provider IDs.Currently, a single host will have a single value for
host.id
; in certain environments you can rely on othercloud.
attributes likecloud.platform
to understand the specific value withinhost.id
. For example, ifcloud.platform
isaws_ec2
, then implictly this ensures thathost.id
, if present, will have the AWS EC2 instance id.Proposals like #576 make it so a single host may have multiple possible values for
host.id
; this makes it impossible for a vendor to identify the actual meaning ofhost.id
.Within the OpenTelemetry Github org, these are the current values for
host.id
other thanmachine-id
:WEBSITE_HOSTNAME
(Python contrib)A solution for this is introducing semantic conventions that are specific to a given cloud provider. For example, we currently have
gcp.gce.instance.name
and #600 proposes a similar convention for AWS EC2.