openeventdata / mordecai

Full text geoparsing as a Python library
MIT License
742 stars 97 forks source link

Can't download geonames_index.tar.gz using wget #115

Closed jangboolee closed 1 year ago

jangboolee commented 1 year ago

I am trying to download geonames_index.tar.gz to get mordecai running on my Windows 10 PC. Following the instructions on the README file, I tried using the command wget https://andrewhalterman.com/files/geonames_index.tar.gz --output-file=wget_log.txt on Windows PowerShell, but this results in the following error message:

Invoke-WebRequest : A positional parameter cannot be found that accepts argument '--output-file=wget_log.txt'.
At line:1 char:1
+ wget https://andrewhalterman.com/files/geonames_index.tar.gz --output ...
+ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    + CategoryInfo          : InvalidArgument: (:) [Invoke-WebRequest], ParameterBindingException
    + FullyQualifiedErrorId : PositionalParameterNotFound,Microsoft.PowerShell.Commands.InvokeWebRequestCommand

I am able to stream the contents of the file by removing the "--output-file=wget_log.txt" part and running the following command:

wget https://andrewhalterman.com/files/geonames_index.tar.gz

However, the command retrieves the stream for over 90 minutes, and terminates with the following error message:

wget : Stream was too long.
At line:1 char:1
+ wget https://andrewhalterman.com/files/geonames_index.tar.gz
+ ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
    + CategoryInfo          : NotSpecified: (:) [Invoke-WebRequest], IOException
    + FullyQualifiedErrorId : System.IO.IOException,Microsoft.PowerShell.Commands.InvokeWebRequestCommand

I also referred to this post on SO, and tried modifying the command to wget https://andrewhalterman.com/files/geonames_index.tar.gz --output-document=wget_log.txt, but that also results in the same Invoke-WebRequest error.

I am using PSVersion 5.1.19041.3031 on Windows 10. Is there a way to get around this issue?

jangboolee commented 1 year ago

Update: I realized I can just enter the URL into the browser and manually download the geonames index file. Oops.