Closed swiftCC closed 4 months ago
Hello, may I ask if you can share the other two datasets (.pkl format) besides the NYC dataset? Thanks
I have noted that each row in NYC.npy has the meaning of [user id, check-in location id, time in minutes]. Can you explain how to calculate the 'time in minutes' ? And when I want to use to dataset like Gowalla, the time is expressed as '2010-09-30T23:58:27Z'. How can I get the same format as the NYC does?
I have the same question. Have you solved it? and How to process the other three datasets mentioned in the article, apart from the NYC dataset?
I have noted that each row in NYC.npy has the meaning of [user id, check-in location id, time in minutes]. Can you explain how to calculate the 'time in minutes' ? And when I want to use to dataset like Gowalla, the time is expressed as '2010-09-30T23:58:27Z'. How can I get the same format as the NYC does?
Hi, the file "load.py" has taken care of it. Please refer to the code below (right after "# add the code below if you are using dividing time into minutes instead of hours"). For time in other slightly different formats, the key is just to transform time into second as the unit. One way is to set the first timestamp as 0 and then count how many seconds has passed since then for other timestamps.
data[:, -1] = np.array(data[:, -1]/60, dtype=np.int)
Hello, may I ask if you can share the other two datasets (.pkl format) besides the NYC dataset? Thanks
Hi, I do not keep the original dataset files. Please refer to the download link for other datasets. I also do not have the preprocessing code to get the raw data into the csv/npy format, unfortunately, and it was used by the 2016 STRNN paper.
I have noted that each row in NYC.npy has the meaning of [user id, check-in location id, time in minutes]. Can you explain how to calculate the 'time in minutes' ? And when I want to use to dataset like Gowalla, the time is expressed as '2010-09-30T23:58:27Z'. How can I get the same format as the NYC does?