Closed icemanDD closed 3 weeks ago
Hi!,
If you have access rights to see the bucket, could you check if the file is there and it can be downloaded?
tempo_cluster_seed.json is available in the bucket, content:
{"UID":"...","created_at":"...","version":{"version":"main-...","revision":"...","branch":"main","buildUser":"","buildDate":"","goVersion":"..."}}
What is this file for? Does compactor need to frequently access and update the json file?
What is this file for? Does compactor need to frequently access and update the json file?
This file is used to report usage statistics. The problem could be a misconfiguration but it's hard to know from that error. If you don't need that feature (most likely you don't) just disable it:
https://grafana.com/docs/tempo/latest/configuration/#usage-report
Interesting, which components will default enable this and access the S3 file? Should I disable for all tempo micro services?
Interesting, which components will default enable this and access the S3 file? Should I disable for all tempo micro services?
This is not per component, it's a part of Tempo itself. It checks your config and sends the report back to Grafana
Then do we need to disable usage report in all Tempo components: distributor, ingester, compactor, metrics generator, querier and query frontend?
Then do we need to disable usage report in all Tempo components: distributor, ingester, compactor, metrics generator, querier and query frontend?
Hi, no, it's a global config:
Do you mean when I apply the global config in any of the tempo components, it will work for all? To clarify we are using separate yaml file to configure each component. After adding
usage_report:
reporting_enabled: false
to ingester and compactor, I still see high 4xx errors
Do you mean when I apply the global config in any of the tempo components, it will work for all? To clarify we are using separate yaml file to configure each component. After adding
usage_report: reporting_enabled: false
to ingester and compactor, I still see high 4xx errors
Please take a look at the documentation linked to some comments above: https://grafana.com/docs/tempo/latest/configuration/#usage-report
After setting this global config to multiple tempo component, I did not see 4xx errors go down, anything else I should try to stop Tempo from GET /tempo_cluster_seed.json
?
Can anyone else help take a look at this? We are getting extra cost because of the additional Get actions
So it seems like you are getting a lot of 4XX's and we believe it may be related to the usage reporting?
Are all 4xx's trying to get tempo_cluster_seed.json
?
Can you share rate of 4XX's by component?
Are there any logs in Tempo that might explain what is happening?
This issue has been automatically marked as stale because it has not had any activity in the past 60 days. The next time this stale check runs, the stale label will be removed if there is new activity. The issue will be closed after 15 days if there is no new activity. Please apply keepalive label to exempt this Issue.
Describe the bug Constantly see 4xx errors (~20% total requests) in AWS S3 buckets for tempo storage. Example error:
REST.GET.OBJECT tempo_cluster_seed.json "GET /tempo_cluster_seed.json HTTP/1.1" 400 InvalidArgument 432 - 17 - "-" "MinIO (linux; amd64) minio-go/v7.0.70"
To Reproduce Steps to reproduce the behavior:
Expected behavior No or minimal 4xx error rates for S3 requests.
Additional Context