prannaykaul / mm-ovod

Official repo for our ICML 23 paper: "Multi-Modal Classifiers for Open-Vocabulary Object Detection"
84 stars 7 forks source link

Request for Code Release #1

Open rotem154154 opened 1 year ago

rotem154154 commented 1 year ago

Used AskYourPDF Sure, I can help you with that. Here is a draft of a polite request for code release:

Dear Authors,

I recently came across your paper titled "Open-Vocabulary Object Detection using Language and Vision" and found it to be extremely insightful. Your work on open-vocabulary object detection (OVOD) that can detect objects beyond the set of categories seen at training is truly innovative. I am particularly impressed by the three contributions you made: text-based classifiers, vision-based classifiers, and multi-modal classifiers.

I am currently working on a project that could greatly benefit from your approach. I believe that your model could significantly enhance our research and potentially lead to further advancements in this field.

I noticed that the code for your paper has not been released on GitHub. I understand that preparing code for release can be a time-consuming process, but I would greatly appreciate it if you could consider making it available. Having access to your code would allow me and others in the community to fully understand your methodology and potentially build upon your work.

Thank you for considering my request. I look forward to the possibility of further exploring your work.

Best regards, Rotem

prannaykaul commented 1 year ago

Hello, the timeframe for the code to be released is 2 weeks. Trying to prioritise clean code before the ICML conference. Thank you.

eternaldolphin commented 1 year ago

When the training code for visual aggregator will be released?

yhxu022 commented 1 year ago

When the training code for visual aggregator will be released?

same question😭

shaniaos commented 1 year ago

I also have the same question. I hope the training code can be released so that I can follow this nice work