tesseract-ocr / langdata_lstm

Data used for LSTM model training
Apache License 2.0
115 stars 152 forks source link

Alternative way to download langdata_lstm master file instead from github #38

Closed timjin520 closed 11 months ago

timjin520 commented 4 years ago

I'm trying to download the langdata_lstm from a work laptop. However, I couldnt download this file in github due to a firewall block which i have no control on. Is there another site that i can download this file from?

thank you

stweil commented 4 years ago

Does your firewall block GitHub, or what exactly is blocked?

timjin520 commented 4 years ago

Thank you for your reply. Our firewall blocks anything that download from github. Only file I want to get is langdata_lsmt master zip file which I couldn't find outside of github.

On Thu, Jun 25, 2020, 2:54 PM Stefan Weil notifications@github.com wrote:

Does your firewall block GitHub, or what exactly is blocked?

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/tesseract-ocr/langdata_lstm/issues/38#issuecomment-649758131, or unsubscribe https://github.com/notifications/unsubscribe-auth/AQCQBZECYN3HQWICNLJFNDTRYOMOTANCNFSM4OIUOLDQ .

stweil commented 4 years ago

If only downloads are blocked, git clone https://github.com/tesseract-ocr/langdata_lstm.git should work and would also get the desired files.

timjin520 commented 4 years ago

Sorry, that also blocks that. Is there a other site where I can download it?

On Thu, Jun 25, 2020, 4:34 PM Stefan Weil notifications@github.com wrote:

If only downloads are blocked, git clone should work and would also get the desired files.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/tesseract-ocr/langdata_lstm/issues/38#issuecomment-649803327, or unsubscribe https://github.com/notifications/unsubscribe-auth/AQCQBZFHMXTDCJVBDBE3K5DRYOYEJANCNFSM4OIUOLDQ .

stweil commented 4 years ago

But you are able to write here, so you have access to https://github.com/. Then why does git clone not work?

timjin520 commented 4 years ago

I do and im able to access and view github. Sorry I work at a federal building. Their restrictions are little high. They have a firewall that blocks an outgoing connections when I use github app, python, maven, etc. will not work if I do cloning or download files from github. And cannot use external hd on federal machines. So I cannot modify the restrictions. Only thing I would need for my project is the langdata master zip. I want to know if theres any other site that have that zip file outside of github. I can't find it on google.

FYI, I cannot login gmail on my work machine (using my phone). So a download link that is not github would be helpful. Hopefully it works.

Thank you

On Fri, Jun 26, 2020, 1:06 AM Stefan Weil notifications@github.com wrote:

But you are able to write here, so you have access to https://github.com/. Then why does git clone not work?

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/tesseract-ocr/langdata_lstm/issues/38#issuecomment-649969089, or unsubscribe https://github.com/notifications/unsubscribe-auth/AQCQBZEUPHZAIIRSIEICVH3RYQUFNANCNFSM4OIUOLDQ .

zdenop commented 4 years ago

As far as we know these data are not available anywhere else. IMO you try to solve problem on wrong end: if you need these data on your work computer, you have to cooperate with your IT support to make it available for you. Downloading&using data/programs from not official source is security risks.

stweil commented 4 years ago

@timjin520, you might try https://code.bib.uni-mannheim.de/UB-Mannheim/langdata_lstm.

timjin520 commented 4 years ago

Thank you so much!!! I got it downloaded. Thank you so much for your help!

On Fri, Jun 26, 2020, 12:45 PM Stefan Weil notifications@github.com wrote:

@timjin520 https://github.com/timjin520, you might try https://code.bib.uni-mannheim.de/UB-Mannheim/langdata_lstm.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/tesseract-ocr/langdata_lstm/issues/38#issuecomment-650280358, or unsubscribe https://github.com/notifications/unsubscribe-auth/AQCQBZEPCJRJ3GYNYAHLT3TRYTGBZANCNFSM4OIUOLDQ .

tfmorris commented 11 months ago

@stweil Sounds like this is resolved/answered.

stweil commented 11 months ago

Yes, I think so, too, and closed this issue.