Closed dhondta closed 1 year ago
The issue seems to come from pandas
while used when saving a dataset ;
>>> import pandas as pd
>>> data = pd.read_csv("/root/.packing-box/datasets/test-upx/data.csv", sep=";", parse_dates=['ctime', 'mtime'])
>>> data.label.value_counts().to_dict()
{'upx': 13}
While saving the dataset after making new samples, the counts
of the metadata show a ""
key with an inconsistent count.
This issue was fixed with a previous commit.
Steps to reproduce:
dataset make test-upx -n 5 -f PE -p upx
dataset make test-upx -n 20 -f PE -p upx
dataset show test-upx
Issue: While all the samples got labelled, the Labelled value is not 100%. While inspecting the
metadata.json
of the dataset, the total is not 25.