stanfordnlp / CoreNLP

CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
http://stanfordnlp.github.io/CoreNLP/
GNU General Public License v3.0
9.66k stars 2.7k forks source link

Is downloads.cs.stanford.edu down? #1434

Open snc-mana opened 6 months ago

snc-mana commented 6 months ago

Cannot download http://www-nlp.stanford.edu/pubs/crosswikis-data.tar.bz2

AngledLuffa commented 6 months ago

Where was this linked from? It would seem that the basic problem is not that the download site is not available, but that the directory doesn't have an index.html in it and there's no default action to list the files when presented with a link to the data directory rather than a specific file. The solution is either to ask our IT to make browsing the directory available, or to put a boilerplate index.html on it which links all the files. The question I would need answered, though, is how did you get there? I think that would inform my figuring out how to make it available to you via download

On Tue, Mar 19, 2024 at 10:34 AM snc-mana @.***> wrote:

Cannot download http://www-nlp.stanford.edu/pubs/crosswikis-data.tar.bz2

— Reply to this email directly, view it on GitHub https://github.com/stanfordnlp/CoreNLP/issues/1434, or unsubscribe https://github.com/notifications/unsubscribe-auth/AA2AYWLVLBNGAW7GCZXQNL3YZBZJBAVCNFSM6AAAAABE6AVMCGVHI2DSMVQWIX3LMV43ASLTON2WKOZSGE4TKNJXGQ2TCMQ . You are receiving this because you are subscribed to this thread.Message ID: @.***>

snc-mana commented 6 months ago

It was found here: A Cross-Lingual Dictionary for English Wikipedia Concepts

It's the 4th footnote.

AngledLuffa commented 6 months ago

Gotcha. In that case, it seems like the easiest thing to do is just compress the entire directory into a new .bz2 file, which I've done. I'm not sure why it was expanded in the first place. You should be able to download it now... LMK if it's what you're looking for.