eto-ai / rikai

Parquet-based ML data format optimized for working with unstructured data
https://rikai.readthedocs.io/en/latest/
Apache License 2.0
136 stars 19 forks source link

efficient way to deal with label attributes? #649

Open changhiskhan opened 2 years ago

changhiskhan commented 2 years ago

In bdd100K dataset, each bounding box / label has attributes. e.g., these may indicate whether the labeled object is occluded, the color of a traffic light, etc. These are easy to think about / deal with in exploded form but running analytics on these is complicated in the nested format

image