PostHog / posthog

🦔 PostHog provides open-source product analytics, session recording, feature flagging and A/B testing that you can self-host.
https://posthog.com
Other
19.39k stars 1.13k forks source link

feat: sample message too large to s3 #23374

Closed pauldambra closed 3 days ago

pauldambra commented 3 days ago

We get MessageTooLarge error from Kafka, we know this means that the event content is too large for kafka, and based on how we process messages we can be pretty sure this is caused by full snapshots (which aren't chunk-up-able)

This adds a new sample rate config to dump a selection of these messages to S3 so we can analyse them

I'm anticipating manually setting the environment variable and monitoring the bucket until we have >10 and <100 samples and then turning it off so we can validate the contents

once we've gathered enough samples to take some remedial steps in the SDK or during ingestion, we'll remove this code

sentry-io[bot] commented 3 days ago

Suspect Issues

This pull request was deployed and Sentry observed the following issues:

Did you find this useful? React with a 👍 or 👎