open-telemetry / otel-arrow

Protocol and libraries for sending and receiving OpenTelemetry data using Apache Arrow
Apache License 2.0
85 stars 16 forks source link

New arrow data types #85

Open djaglowski opened 1 year ago

djaglowski commented 1 year ago

Is your feature request related to a problem? Please describe. The discussion on https://github.com/open-telemetry/oteps/pull/171 included some comments (example) about potentially needing to define a new signal type in the collector, in order to support columnar encodings. I'd like to offer a more specific proposal for how this should be handled in the collector.

Describe the solution you'd like For each current data type supported in the collector, introduce a corresponding arrow data type. Names here are only meant as placeholders, but let's say atrace, ametric, and alog.

Introducing three distinct data types means that we could use the collector's pipeline system to our advantage.

jmacd commented 1 year ago

Thanks - I like this proposal.

lquerel commented 1 year ago

I really like this approach, as it brings us closer to optimizing the collector for transport and processing, enabling an optimized end-to-end Arrow representation without breaking the existing ecosystem.