PostHog / posthog

🦔 PostHog provides open-source product analytics, session recording, feature flagging and A/B testing that you can self-host.
https://posthog.com
Other
21.23k stars 1.26k forks source link

pipeline 🤝 replay ingestion changes #23395

Open pauldambra opened 3 months ago

pauldambra commented 3 months ago

Feature request

pipeline and replay teams are going to trade time to improve heatmap_data and $exception event ingestion

 why change heatmap data ingestion?

  1. we get many heatmap_data items per event that carries them, so if we're under heavy load we automatically take a multiple of that load and it's hard to scale/react because the magnification is happening inside main event processing
  2. we want to make these changes without breaking main event ingestion. Also improves development speed by proxy.
  3. failure isolation, e.g. incident time we have more easier leavers we can pull
  4. not slowing down analytics ingestion
  5. cost we can optimize the independent clearly different work

TODO

why change $exception data ingestion

  1. we want to add more processing to these events, that will require changes to speed of processing, infra requirements, etc, we want to make these changes without breaking main event ingestion. Also improves development speed by proxy.
  2. failure isolation, e.g. incident time we have more easier leavers we can pull
  3. not slowing down analytics ingestion
  4. cost we can optimize the independent clearly different work

TODO

Debug info

No response

pauldambra commented 3 months ago

@tiina303 dumped my thoughts here since we've probably gone beyond slack

i'd be happy to discover i'm wrong so feel free to say what tasks i'm missing / or can delete / or shouldn't be trying to do etc etc