donydchen / mvsplat

🌊 [ECCV'24 Oral] MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images
https://donydchen.github.io/mvsplat
MIT License
750 stars 35 forks source link

What are the best practices for incorporating custom dataset inputs? #21

Closed Harshitavardhani closed 5 months ago

Harshitavardhani commented 5 months ago

Hi @donydchen, kudos to your work. Your work MVSplat has my keen interest and I'm particularly interested in using it with custom datasets, but I'm having some trouble with a few things.

1.Do i have to upload custom dataset to the YouTube and use the URL, like in your dataset? Is there another approach I can take? If yes, could you please tell that approach? 2.How do you generate timestamps, camera poses, images, and keys for a particular video?

Thank you in advance.

donydchen commented 5 months ago

Hi @Harshitavardhani, thanks for your interest in our work.

  1. No. It is unnecessary to upload the custom data to YouTube, that is merely the way RealEstate10K provides its data. Our dataloader loads data from local storage. For building a customized dataloader, kindly refer to https://github.com/donydchen/mvsplat/issues/1

  2. In this project, all datasets we use provide camera poses and extracted image frames. If you intend to work on your own video, you can check and follow how RealEstate10K or ACID preprocess their data.