dkpro / dkpro-uby

Framework for creating and accessing UBY resources – sense-linked lexical resources in standard UBY-LMF format
https://dkpro.github.io/dkpro-uby
Other
22 stars 3 forks source link

Links to tutorials are dead #177

Closed guilherme-salome closed 2 years ago

guilherme-salome commented 5 years ago

On the page https://dkpro.github.io/dkpro-uby/ under Learning UBY the link to the tutorials are dead. Specifically:

guilherme-salome commented 5 years ago

The link to database dumps http://uby.ukp.informatik.tu-darmstadt.de/uby is also dead

reckart commented 5 years ago

Unfortunately, the university had us migrate to a different content management system which killed all the existing "nice" URLs.

Some of the material is linked here: https://www.informatik.tu-darmstadt.de/ukp/research_6/data/lexical_resources/uby/index.en.jsp

Although it seems that even here, some links are presently dead :(

guilherme-salome commented 5 years ago

I was also searching that website for a tutorial on how to start using UBY but can't find anything. There is a tutorial that links to this page but following it is not possible because the dumps are not available. Do you know if there is any easy to use API available somewhere? Thanks!

reckart commented 5 years ago

Here are some URLs I found:

@chmeyer Anything to comment here, maybe regarding the an API?

@chmeyer I found the pre-build resources on the VM as well, but I don't know why they were moved off the public page. Do you know if they are incompatible with the latest Uby versions?

randomnoob commented 2 years ago

Hi, I know the project is now abandoned, but does anyone know where I can get the DB dump? I've already exhausted every resources that I can.

reckart commented 2 years ago

I'll check... no promises...

reckart commented 2 years ago

@randomnoob Do you know which dumps you are looking for exactly?

randomnoob commented 2 years ago

@randomnoob Do you know which dumps you are looking for exactly?

Hi, I'm looking for one of these :

The filenames comes from https://web.archive.org/web/20160122132248/http://uby.ukp.informatik.tu-darmstadt.de/uby/ Thank you!

logological commented 2 years ago

I can provide the following dumps if needed:

Note that the first two of these contain the proprietary resource GermaNet, and so I can give them to you only if you have a GermaNet licence. (If you do, you should send me your licence details and I'll verify these with the GermaNet folks.)

reckart commented 2 years ago

I have been able to salvage these from the archived Uby VM:

uby_open_0_7_0_20150507.sql.xz
uby_open_0_6_0.sql.xz
uby_open_0_6_0.sql.7z

But they lived in an "old" folder, so I do not know if they are compatible with the latest version of the code. Let me check where I can put them online for you...

reckart commented 2 years ago

@logological these" uby_open" files I mentioned above do not contain GermaNet, do they?

logological commented 2 years ago

@reckart: I can't say for sure without examining them. But you could do so yourself: try grepping for GN_Synset on the decompressed SQL files. If you get a lot of output, then the dump contains GermaNet.

reckart commented 2 years ago
$ xzgrep "GN_Synset" uby_open_0_7_0_20150507.sql.xz
$ xzgrep "GN_Synset" uby_open_0_6_0.sql.xz
$

Looks safe.

reckart commented 2 years ago
logological commented 2 years ago

@reckart Would you also like me to send you uby_release_1_0.sql to post there?

Come to think of it, shouldn't all these go on TUdatalib? Most of the other UKP data sets and resources are already there.

reckart commented 2 years ago

It would be good if there were on TUdatalib. Could you provide descriptions for the datasets?

I have a uby_release_1_0.sql.7z also that I could put on the server. I assume that's the same you have.

logological commented 2 years ago

Yes, that's probably the same version I have. Below is a generic description that can be used for all the databases; unfortunately, I don't know which versions of the resources are included in which versions of the UBY database.

UBY is a large-scale lexical-semantic resource for natural language processing (NLP) based on the ISO standard Lexical Markup Framework (LMF).  The UBY database distributed here integrates the English- and German-language lexical-semantic resources listed below.  (UBY is also capable of integrating GermaNet, but we do not include it here as it is subject to a proprietary licence.)  We also separately distribute [software for accessing the database](https://dkpro.github.io/dkpro-uby/).

Integrated resources (only including resources with open licences):

- English WordNet, Wiktionary, Wikipedia, FrameNet and the syntactically rich VerbNet
- German Wikipedia, Wiktionary, the syntactically rich IMSLex-Subcat, as well as multilingual OmegaWiki

Integrated alignment resources:

External alignments:
- WordFrameNet
- Semlink VerbNet - FrameNet mapping

Alignments created by UKP Lab:
- WordNet 3.0 - Wiktionary English alignment
- WordNet 3.0 - OmegaWiki German alignment
- Wiktionary English - FrameNet alignment
- Wiktionary English - OmegaWiki English alignment
- WordNet - Wikipedia English alignment

If you need further metadata for the TUdatalib record (papers to cite, licence, etc.), you can probably copy them from the DKPro UBY page and/or the UBY database licence of 2015-05-22.

reckart commented 2 years ago

Thanks... looking into it...

reckart commented 2 years ago

Here is the TUdatalib page: https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/2941

@logological Does that look good to you?

logological commented 2 years ago

Yep.

reckart commented 2 years ago

Ok, then I believe we can close this issue. 🎉