-
Ideas:
- ChatGPT style search (definitions, source code,)
- Input could be text (questions), images, video
- Output could be text and images
- Generate (VAE, GAN) Multimodal (Text, Images, Video, Audi…
-
### Prerequisite
- [X] I have searched [Issues](https://github.com/open-mmlab/mmocr/issues) and [Discussions](https://github.com/open-mmlab/mmocr/discussions) but cannot get the expected help.
- […
-
Good work. Are you planning to upload a dataset_nuscenes and train_nuscenes files? I really want to test it.
-
Hi, thank you for the great work! In the code I saw options to run on nuScenes dataset. Seems like you converted the nuScenes dataset to the SemanticKITTI format. I did not find existing tools to do t…
-
-
DDPM is known to be time-consuming and I am not sure if it is suitable for video segmentation tasks. So I wonder how much time is needed to for video object segmentation
-
Do you have a recommendation for parse image model?
-
The current mapillary dataset available in the official webpage is version 2.0.
Download instructions are based on v1.1 .
-
Thanks for the work. If only detection is used instead of both seg and det , will the mAP decrease? Can you estimate the magnitude of the decline?
-
I want get the panoptic segmentaion results of a series of images, how can I run the inference in a local terminal?