zhanghm1995 / Forge_VFM4AD

A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.
218 stars 8 forks source link
3dgs adaptation autonomous-driving diffusion end-to-end-autonomous-driving foundation-model large-language-models nerf pre-training survey world-models

Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities

Awesome GitHub license

This is the partner repository for the survey paper Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities. The repository will be continuously updated to track the progress of forging VFMs for AD. We hope this repository can act as a quick reference for researchers who wish to read the relevant papers and implement the associated methods.

Authors: Xu Yan, Haiming Zhang, Yingjie Cai, Jingming Guo, Weichao Qiu, Bin Gao, Kaiqiang Zhou, Yue Zhao, Huan Jin, Jiantao Gao, Zhen Li, Lihui Jiang, Wei Zhang, Hongbo Zhang, Dengxin Dai and Bingbing Liu.


Our survey at a glance.


Research tree of forging vision foundation models for autonomous driving.

NOTE: Here we have select a number of featured papers for each part, and almost for each paper we have included the abstract and a figure from the original paper, showing the main framework or motivations, to help us take a glance about these papers (You can expand the Abstract button to see them). More papers list and details can be found in our survey paper.

Welcome to star this repository!

We greatly appreciate any contributions via PRs, issues, emails, or other methods.

Citation

If this work is helpful for your research, please consider citing the following BibTeX entry :

@misc{yan2024forging,
      title={Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities}, 
      author={Xu Yan and Haiming Zhang and Yingjie Cai and Jingming Guo and Weichao Qiu and Bin Gao and Kaiqiang Zhou and Yue Zhao and Huan Jin and Jiantao Gao and Zhen Li and Lihui Jiang and Wei Zhang and Hongbo Zhang and Dengxin Dai and Bingbing Liu},
      year={2024},
      eprint={2401.08045},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

📰 News

Table of Content

Related Survey Papers

👆 Back to Top

Data Preparation

GAN

👆 Back to Top

Diffusion

👆 Back to Top

NeRF

👆 Back to Top

3D Gaussian Splatting

👆 Back to Top

Self-supervised Training

Contrastive

A survey paper of contrastive-based self-supervised learning: A survey on contrastive self-supervised learning.

👆 Back to Top

Reconstruction

👆 Back to Top

Distillation

👆 Back to Top

Rendering

👆 Back to Top

World Model

👆 Back to Top

Adaptation

Vision Foundation Models

👆 Back to Top

Large Language Models

👆 Back to Top

Multimodal Foundation Models

👆 Back to Top

Citation

If this work is helpful for your research, please consider citing the following BibTeX entry.

@misc{yan2024forging,
      title={Forging Vision Foundation Models for Autonomous Driving: Challenges, Methodologies, and Opportunities}, 
      author={Xu Yan and Haiming Zhang and Yingjie Cai and Jingming Guo and Weichao Qiu and Bin Gao and Kaiqiang Zhou and Yue Zhao and Huan Jin and Jiantao Gao and Zhen Li and Lihui Jiang and Wei Zhang and Hongbo Zhang and Dengxin Dai and Bingbing Liu},
      year={2024},
      eprint={2401.08045},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}