Geocene / trainset

A lightweight web application for brushing labels onto time series data; useful for building training sets.
https://trainset.geocene.com/
MIT License
162 stars 36 forks source link

I don't understand the file format #68

Closed konradjurk closed 4 years ago

konradjurk commented 4 years ago

The required format is the following:

filename,timestamp,value,label
sample_trainset,2019-01-14T17:26:37+0100,29.4375,1
sample_trainset,2019-01-14T17:31:37+0100,28.5625,1
sample_trainset,2019-01-14T17:36:37+0100,21.8125,0
sample_trainset,2019-01-14T17:41:37+0100,19.875,0
sample_trainset,2019-01-14T17:46:37+0100,20.625,0
sample_trainset,2019-01-14T17:51:37+0100,19.6875,0
sample_trainset,2019-01-14T17:56:37+0100,20.125,0
sample_trainset,2019-01-14T18:01:37+0100,19.9375,0
sample_trainset,2019-01-14T18:06:37+0100,20.75,0
sample_trainset,2019-01-14T18:11:37+0100,21.0625,0
sample_trainset,2019-01-14T18:16:37+0100,20.3125,0
sample_trainset,2019-01-14T18:21:37+0100,20.625,0
sample_trainset,2019-01-14T18:26:37+0100,21,0
sample_trainset,2019-01-14T18:31:37+0100,21.3125,0
sample_trainset,2019-01-14T18:36:37+0100,21.3125,0
sample_trainset,2019-01-14T18:41:37+0100,21.75,0
sample_trainset,2019-01-14T18:46:37+0100,21.75,0

Why is the filename part of every line? Can I concatenate multiple csv files into one this way? I don't understand :)

daterdots commented 4 years ago

Yeah, the idea is that later on it's easy to concatenate multiple outputs of TRAINSET into a single data set and have the different filenames there. Like, if you read in all the output files using R or Python, you get a nice long data structure and you know the provenance of all the timestamp+value pairs

konradjurk commented 4 years ago

@daterdots Ok, Thanks :)