Closed prateikarora closed 5 years ago
@prateikarora For 1 and 2, the link is a small piece of the road, which is a segment in math, while it is viewed as a point in this paper (we use one point to represent the link, not two points). The gps of the road segment is the middle point of the segment (mathematically). Similarly, the snode and enode are also segments in math, while they are viewed as points. For 3, the social attributes are described intime_feature_extraction_15min.py Thanks for your attention!
thank you
@bbliao I am sorry for so many questions, but I'm unfamiliar with deep learning, so I cannot get these answers through your code. I wish to develop a parametric model for traffic prediction for my thesis project and I wish to use this data. It would be very helpful of you if you could clear these queries.
@prateikarora I am sorry for your confusion. For the road network sub-dataset, the original dataset contains ∼450k road segments. however, we only release all the information (traffic speed, attributes, neighbors) of 15,073 central road segments because of disk capacity(the raw data is about 1TB), data incompleteness (not all the missing rate of the road segments are less than 1%), privacy and security. Besides, the Seq2Seq+NB method also needs the traffic speed of their neighbors, so we actually release the traffic speed of 44,172 (15,073 central road segments + their neighbors) road segments in the traffic speed sub-dataset. Given a central road segment, we construct its local directed connected road network from road_network_sub_dataset, and five predecessors and five successors in the local directed connected road network are selected based on PageRank score, resulted in the neighbours_1km.txt file. Note that the neighbors and the central road segment may not be adjacent. Of course, the traffic speed of these neighbors is provided as mentioned before. Therefore,
Yes, the snode and enode are the adjacent links/road sections of the central road segment, but the extracted neighbors (using PageRank) and the central road segment may not be adjacent;
You can just ignore the redundant road segments id;
Yes, we can not provide the complete road network due to data incompleteness (not all the missing rate of the road segments are less than 1%), privacy and security;
We only provide all the information (traffic speed, attributes, neighbors) of 15,073 central road segments and the traffic speed of 44,172 (15,073 central road segments + their neighbors) road segments. We do not care about the situations you mentioned because of data incompleteness;
Yes, the neighbors are all within 1km (physical distance using GPS coordinates) from the central road segments. For computation efficiency, given a central road segment, we select five predecessors and five successors from its local network.
Thanks for your attention!
@bbliao Thank you very much for your time and effort. This really cleared my confusion. I only have one final query, how to identify/locate/separate these 15,073 central segments from the total segments?
@prateikarora The first column of the neighbours_1km.txt is the 15,073 central road segments, the 2nd-6th and 7th-11th columns are the predecessors and successors, respectively.
Hi. My questions may seem a bit trivial but I could not find explicit explanation in the paper.