Closed 123yu456 closed 2 months ago
another question: you mentioned in your paper,"For a vehicle in the dataset, its original trajectory throughout the highway section, which is approximately 50 to 70 seconds in time length, is evenly partitioned into 50 short-term trajectories, each with 5 s length of time. Each trajectory represents a driving scene involving different situations and different kinds of interactions with the surrounding vehicles. 35 trajectories among them are randomly selected and serve as the training data for reward function learning. The rest 15 trajectories serve as the testing conditions, where the learned reward function is used to select the candidate trajectories. " Does this mean that the first 35 tracks used for training were not selected in chronological order of vehicle movement? The vehicle trajectory used for training may actually occur after the trajectory data used for testing?
Here are my answers to your questions:
Thank you very much for your reply, which is very helpful to me!