felix-schmitt / FormulaNet

FormulaNet is a new large-scale Mathematical Formula Detection dataset.
Creative Commons Attribution 4.0 International
14 stars 10 forks source link

Error #1

Closed NaivePawn closed 2 years ago

NaivePawn commented 2 years ago

download hep-th/0003025v2: 5%|██▌ | 122/2392 [00:11<03:28, 10.88it/s, download images] Traceback (most recent call last): File "download.py", line 88, in download("urls.txt") File "download.py", line 77, in download resize_image(image_list[page_number], {'w': 1447, 'h': 2048}, str(train_img / page)) IndexError: list index out of range

felix-schmitt commented 2 years ago

Hi Can you please give me more information about your setup as I could not reproduce your error.

Siri-2001 commented 2 years ago

Hi!I have the same error. My setup as follows: python 3.7.13 Linux version 5.15.0-52-generic (buildd@lcy02-amd64-045) (gcc (Ubuntu 9.4.0-1ubuntu1~20.04.1) 9.4.0, GNU ld (GNU Binutils for Ubuntu) 2.34) #58~20.04.1-Ubuntu SMP Thu Oct 13 13:09:46 UTC 2022 Cuda compilation tools, release 9.1, V9.1.85

felix-schmitt commented 2 years ago

Can you please check if the generated PDF of hep-th/0003025v2 has 30 pages? If not, can you please send me the log file (0003025.log).

felix-schmitt commented 2 years ago

please use the dockerfile