cvat-ai / cvat

Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
https://cvat.ai
MIT License
11.75k stars 2.88k forks source link

Voice ASR Transcription and Natural Language Entity Extraction NLP and 2-3D Point Cloud and Pure 3D Point Cloud Segmentation Open Source Tool Reference #6814

Open KTXKIKI opened 9 months ago

KTXKIKI commented 9 months ago

Voice ASR
https://github.com/PaddlePaddle/PaddleSpeech https://github.com/nl8590687/ASRT_SpeechRecognition https://github.com/nobody132/masr https://github.com/espnet/espnet https://github.com/wenet-e2e/wenet https://github.com/mozilla/DeepSpeech

natural language entity extraction NLP https://github.com/hankcs/pyhanlp

2-3D point cloud and pure 3D point cloud segmentation https://github.com/PJLab-ADG/SensorsCalibration https://github.com/walzimmer/3d-bat https://github.com/Hitachi-Automotive-And-Industry-Lab/semantic-segmentation-editor

bsekachev commented 9 months ago

@KTXKIKI

Thank you for provided information, but it is not clear what we are supposed to do with that. It was not implemented not because we do not have references, but because now it is not our priority.

If you want to help, you may create aggregated feature request regarding 3D features you would like to have, decsribing examples, use-cases and concepts. What do you think?

Speaking about voice ASR, I do not think its relevant for now. It is even less priority than 3D and multi-modal CVAT data scenarious.

KTXKIKI commented 9 months ago

Oh, oh, I want to help you. The CVAT team hopes that CVAT will become the number one in the field of computer vision annotation. I don't have strong abilities, but I am still working hard to learn. If my abilities are sufficient, I will be very willing to develop and contribute code together. In the future, if I have enough funds, I will also be very willing to become a paid version of the enterprise. I hope CVAT will become stronger and stronger @bsekachev