HelloJocelynLu / t5chem

Transformer-based model for chemical reactions
MIT License
58 stars 14 forks source link

Dataset damage #19

Closed WangYitian123 closed 2 weeks ago

WangYitian123 commented 2 weeks ago

Hi, I downloaded the dataset from https://yzhang.hpc.nyu.edu/T5Chem/index.html. For USPTO_500_MT and USPTO_TPL dataset, I cannot open them correctly. However, for other dataset, such as USPTO-MIT, it worked.

Thanks a lot.

WangYitian123 commented 2 weeks ago

Could you please tell me the correct encoding format for these files and what should be used to open them?

HelloJocelynLu commented 2 weeks ago

Hi,

Thank you for the report. I suspect an issue with the webpage download links (which I will address soon! Appreciate the heads-up). Here's the usual process I followed:

Step 1: Copy the download link of the dataset you're interested in. Step 2: Download the dataset:

wget https://yzhang.hpc.nyu.edu/T5Chem/data/USPTO_500_MT.tar.bz2

Step 3: Unpack the compressed file:

tar -xvf USPTO_500_MT.tar.bz2

Here's what I found on MacOS:

image

I hope this information is useful!

WangYitian123 commented 2 weeks ago

Hi,

Thanks a lot. You are so nice. I have solved the problem and the webpage download links are correct.

------------------ 原始邮件 ------------------ 发件人: @.>; 发送时间: 2024年6月14日(星期五) 下午4:48 收件人: @.>; 抄送: @.>; "State @.>; 主题: Re: [HelloJocelynLu/t5chem] Dataset damage (Issue #19)

Hi,

Thank you for the report. I suspect an issue with the webpage download links (which I will address soon! Appreciate the heads-up). Here's the usual process I followed:

Step 1: Copy the download link of the dataset you're interested in. Step 2: Download the dataset: wget https://yzhang.hpc.nyu.edu/T5Chem/data/USPTO_500_MT.tar.bz2

Step 3: Unpack the compressed file: tar -xvf USPTO_500_MT.tar.bz2
Here's what I found on MacOS: image.png (view on web)

I hope this information is useful!

— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you modified the open/close state.Message ID: @.***>