BraveGroup / LAW

Enhancing End-to-End Autonomous Driving with Latent World Model
MIT License
73 stars 0 forks source link

Question about latent supervision. #1

Closed synsin0 closed 2 months ago

synsin0 commented 3 months ago

Thanks for your amazing work! I'd like to ask about latent supervision. I think it is similar to PETR approach with a set of queries cross attention with 3D PE + image features. Then these queries are quantized and cross attention to decode waypoints only. Do you use pretrained PETR model or use intermediate supervision? Only using waypoint as supervision is similar to BEV-planner approach with a sparse paradigm. I'm really curious about it. Thanks again.

liyingyanUCAS commented 3 months ago

Our latent supervision is not related to PETR. We only utilize PETR's 3D PE to enhance the image feature, without using a pretrained PETR model or intermediate supervision. The latent supervision comes from the latent world model(Eq.7) and waypoints(Eq.4).

synsin0 commented 2 months ago

Thanks for your answers. Look forward to your code!