A way to add a new column to datasets when they are being collected or when they are already exported. Maybe a python script to add tags to the datasets before being compressed?
Maybe while we do kafkacat -b ip:9092 -t winlogbeat -C -o end > file.json we can add a command to append a column named atttack ids to it with the specific technique IDs that we believe the dataset will contain?
kafkacat -b ip:9092 -t winlogbeat -C -o end > file.json
we can add a command to append a column namedatttack ids
to it with the specific technique IDs that we believe the dataset will contain?