Open djaglowski opened 1 year ago
Thanks - I like this proposal.
I really like this approach, as it brings us closer to optimizing the collector for transport and processing, enabling an optimized end-to-end Arrow representation without breaking the existing ecosystem.
Is your feature request related to a problem? Please describe. The discussion on https://github.com/open-telemetry/oteps/pull/171 included some comments (example) about potentially needing to define a new signal type in the collector, in order to support columnar encodings. I'd like to offer a more specific proposal for how this should be handled in the collector.
Describe the solution you'd like For each current data type supported in the collector, introduce a corresponding arrow data type. Names here are only meant as placeholders, but let's say
atrace
,ametric
, andalog
.Introducing three distinct data types means that we could use the collector's pipeline system to our advantage.
arrow receiver -> arrow processor -> to_pdata connector -> pdata processor -> to_arrow connector -> arrow exporter