lapps / vocabulary-pages

DSL files and templates used to generate the LAPPS WS-EV pages.
Apache License 2.0
0 stars 0 forks source link

Adding tagsets #55

Closed ksuderman closed 6 years ago

ksuderman commented 7 years ago

From @marcverhagen on March 16, 2017 21:23

We do not have discriminators for tagsets yet. Two ways to do this (both by editing discriminators.config).

One way is to keep it in house and add a postagset URI to the discriminators:

def meta = { "http://vocab.lappsgrid.org/ns/media/meta/$it" }

postagset {
   uri meta('postagset')
   description "POS tagset used." }

brown_tagset { 
   uri meta('tagset#brown')
   description "The Brown tag set, see <SOME_URL>" }

This probably requires resolution of issue https://github.com/lapps/vocabulary-pages/issues/39.

The other way is to make the URI for the discriminator the external link:

def dkpro = { "https://dkpro.github.io/dkpro-core/releases/1.8.0/docs/tagset-reference.html#$it" }

ptb_tagset {
   uri dkpro('tagset-en-ptb-pos')
   description 'Tag set used in the Penn Tree Bank'
}

Copied from original issue: lappsgrid-incubator/discriminator-dsl#6

ksuderman commented 7 years ago

I think I prefer the first method as it is consistent with what we have so far. However, the base URI for tag sets should be something like http://vocab.lappsgrid.org/ns/tagsets as http://vocab.lappsgrid.org/ns/media is intended for media types.

def tagset = { "http://vocab.lappsgrid.org/ns/tagset/$it" }

'pos-tagset' {
    uri tagset('pos')
    description "Part-Of-Speech tag set used."
}
'brown-tagset' {
    uri tagset('pos#brown')
    description "See <some uri>"
}
'dkpro-tagset-en-ptb-pos' {
    uri tagset('pos#dkpro-en-ptb')
    description 'See https://dkpro.github.io/dkpro-core/releases/1.8.0/docs/tagset-reference.html#tagset-en-ptb-pos'
}

The other tag set types would use http://vocab.lappsgrid.org/ns/tagset/ner etc.

marcverhagen commented 7 years ago

Yes, http://vocab.lappsgrid.org/ns/media was a typo or me being daft. Question, do we have a namespace for meta data information? If so, then instead of http://vocab.lappsgrid.org/ns/tagsets we could use http://vocab.lappsgrid.org/ns/meta/tagsets.

The advantage is that there is a clear mapping of meta data properties in the vocab and the metadata content of a LIF object and a location in the discriminator section. The disadvantage is a conflict with the meta discriminator which points to http://vocab.lappsgrid.org/ns/meta (which really is http://vocab.lappsgrid.org/ns/meta.html).

ksuderman commented 7 years ago

It is unfortunate about the name collision with the http://vocab.lappsgrid.org/ns/meta discriminator, but that shouldn't prevent us from using a URI like http://vocab.lappsgrid.org/ns/meta/tagsets/pos#ptb as a feature value.

Another alternative is to use a completely different namespace like http://vocab.lappsgrid.org/meta (no /ns in the path).

marcverhagen commented 7 years ago

Yes, as long as there is no need for http://vocab.lappsgrid.org/ns/meta to point at the directory rather than the html file then we should be okay. I guess one option would be to have a special subdirectory of ns/ for things like meta, error, ok etcetera.

ksuderman commented 7 years ago

I think we agreed to forego the /ns/meta URL and use http://vocab.lappsgrid.org/ns/tagset/pos and http://vocab.lappsgrid.org/ns/tagset/dependency etc. as the roots for tag set URLs.