Open 545487677 opened 5 months ago
@AI-HPC-Research-Team I would also like to know how to get the pretrain data. Could you provide these data?
I apologize for not being able to upload the extensive pretrain image data to the platform. However, I can guide you through a simpler download process from PubChem:
For the pretrain text data, the Mol-Instruction dataset available at https://huggingface.co/datasets/zjunlp/Mol-Instructions offers a more comprehensive dataset that is homologous to ours but with higher standardization.
Hi, thank you for sharing such a great work! However, I can't find the image2d file in the dataset. Can you tell me how can I get the dataset? Thank you!!