Open IranQin opened 2 weeks ago
on any given AD video
Certainly. I am not sure what kind of video format you want to use, but the easiest way would be to extract the initial frame and set the --dataset
argument of the sampling script as IMG.
using text conditions
Our model does not accept textual inputs, as text can be unintuitive and ambiguous when specifying actions in the task of autonomous driving.
Could you provide a mapping between the command and category of [0,1,2,3] ?
Hi, we translate trajectories into commands following ST-P3. Please refer to this issue https://github.com/OpenDriveLab/Vista/issues/3 for details.
Can it be tested on any given AD video using text conditions?