smlab-niser / tirtha-public

Project Tirtha
GNU Affero General Public License v3.0
9 stars 7 forks source link

[EN] Faster & better (learned) feature extraction (SuperPoint / SuperGlue etc.) & matching (LoFTR etc.) #35

Open JeS24 opened 10 months ago

JeS24 commented 10 months ago

Feature request / Suggestion

Currently, we use the executables from AliceVision Meshroom (and COLMAP for Gaussian Splatting) to perform all the photogrammetry steps, including feature extraction and feature matching. While we have taken steps (batching and parallelization) to ensure high throughput and performance, these steps are still limited in several ways – primary ones include slower overall processing as compared to COLMAP and the feature extraction being reliant on traditional computer vision heuristics that could be improved by learning-based approaches, such as SuperPoint / SuperGlue / LightGlue (see: glue-factory), combine with LoFTR / DeepMatcher (also see: SuperCOLMAP). This project would involve a survey of the state-of-the-art for the most suitable methods, followed by integration of the same in the Tirtha pipeline (with tests).

Possible implementation

  1. Survey the state-of-the-art (check the next section for some starting points) and gather results using some methods on a set of small image sets from Tirtha's database.
  2. Check if the most well-performing models are a good fit for Tirtha (FOSS + integration-wise).
  3. Implement the best model(s) in Tirtha's pipeline.

Resources

Self-check

github-actions[bot] commented 8 months ago

This issue is stale because it has been open for 60 days with no activity. Remove stale label or comment to re-open.