Open tianyi-1349 opened 5 months ago
Sorry for the late reply. Could you clarify the second question?
Regarding the first question, it depends somewhat on the output resolution and batch size. For reproducing e.g. our mega1500 test results, the inference speed is roughly ~1 second per image pair. If you run in batched mode on lower resolution you might get down to about 0.1 seconds.
What is the overall reasoning speed? Are there relevant test data? tank you.