Closed bensLine closed 7 years ago
hi there, the arff file will change many many times later. Now we need to concentrate on getting and incorporating as many features as possible into it. Then we will do feature selection (based on weka statistics we will throw some of the features out). Only afterwards it makes sense to train and test the model .. not at this moment now.
We moved pass the binary classifiers.
According to the plots of the dummy and api data most of the sights in our data are from Pidgey. Therefore we want to modify the existing data set to classify if a sighting is from Pidgey or not.
You can use the dummy data for this classifier since it has about 600 entries whereas the apiData has about 2500. So if you want to save some time while building the classifier stick to the small data set ;)
timestamp, latitude, longitude
as attributes, all are numeric.isPidgey
as class label, which can either be true or false. The class label will be true if the data entry has thepokemonId == 16
otherwise it is false.To create the .arff file just adapt one of the existing scripts we already have in the repo.