felix-schmitt / FormulaNet

FormulaNet is a new large-scale Mathematical Formula Detection dataset.
Creative Commons Attribution 4.0 International
14 stars 10 forks source link

download failed #2

Closed Siri-2001 closed 2 years ago

Siri-2001 commented 2 years ago

When I was downloading images,a lot of error as below occured.Is it because of the copyrights? Could not create PDF of 0001005. Try to increase the waiting_time and check the log to identify missing latex packages

felix-schmitt commented 2 years ago

Hi No, it could not compile the latex file. Do you have texlive-full installed? If yes, please increase the waiting_time https://github.com/felix-schmitt/FormulaNet/blob/ed8121b09c5db376355fc0b2b3b573a269986e4e/download.py#L71. If this does not help, please check the latex log file in the temp folder and try to compile it manually.

Siri-2001 commented 2 years ago

Hi No, it could not compile the latex file. Do you have texlive-full installed? If yes, please increase the waiting_time

https://github.com/felix-schmitt/FormulaNet/blob/ed8121b09c5db376355fc0b2b3b573a269986e4e/download.py#L71

. If this does not help, please check the latex log file in the temp folder and try to compile it manually.

Thank you! It helps. But I still have errors as below. Traceback (most recent call last): File "download.py", line 100, in download("urls.txt") File "download.py", line 88, in download resize_image(image_list[page_number], {'w': 1447, 'h': 2048}, str(train_img / page)) IndexError: list index out of range

felix-schmitt commented 2 years ago

see #1