clulab / eidos

Machine reading system for World Modelers
Apache License 2.0
36 stars 24 forks source link

Connection timeout for geonames index and idea for cache location #731

Closed bgyori closed 4 years ago

bgyori commented 4 years ago

The URL for downloading GeoNames seems to be unreachable currently and when launching Eidos, it errors at the "Downloading the GeoNames index from http://clulab.cs.arizona.edu/models/geonames+woredas.zip" line. In relation to this immediate issue (which is probably external to Eidos), I've been thinking whether it would be better to have a fixed location for the geonames cache on a given machine (e.g. in a folder called ~/.eidos/) rather than always looking for the cache in the current working folder (which may be any path from which Eidos is being used, often changing). Any thoughts?

MihaiSurdeanu commented 4 years ago

You're right: the zip file is not accessible through HTTP, which makes me believe the CS dept. web server is down. Just FYI, the file is accessible through the local file system, once I log in. I will start a CS ticket for this issue.

I like the .eidos idea! @kwalcock, what do you think?

kwalcock commented 4 years ago

I'm not sure about the web servers.  cs.arizona.edu seems to be OK, but clulab.cs.arizona.edu is not. Any idea where this is hosted? The downloading of geonames at runtime is slated to be replaced with a compile-time dependency, so that problem should eventually go away. The cache directory is set in eidos.conf and can be changed to an absolute rather than relative path. I'm not sure whether ~ is allowed, but suspect it is not. It doesn't seem outlandish to insist that the program be called from the same working directory in order for it to reuse a cache. It is possible to have multiple caches that were built using different options. Is there a use case in which eidos needs to be called from different working folders?

bgyori commented 4 years ago

I see, given all these factors, I guess it's not worth changing anything now. By the way, the use case in which this comes up is when we use Eidos as a dependency of INDRA and so Eidos is invoked from application-specific folders for different reading/assembly runs.

kwalcock commented 4 years ago

Another option would be to make a symbolic link from ./cache in each of these directories to ~/cache.

bgyori commented 4 years ago

The server has since been fixed so I'm closing this.