clarin-eric / generate_language_info_pages

Generates language info HTML pages for the CLARIN Virtual Language Observatory.
0 stars 1 forks source link

synchronize language info list with VLO #1

Open dietervu opened 8 years ago

dietervu commented 8 years ago

Report from Jörg Knappen:

_I tried the link to the Arabic langauge ( https://infra.clarin.eu/content/language_info/data/ara.html ) getting a 404 -- page not found._

As far as I can see there is no page generated for ara since that is considered by glottolog as a group of languages:

http://glottolog.org/resource/languoid/id/arab1395

This is fine, but then we should make sure the VLO does not feature a link to the ara page. The source for the language info script is:

http://glottolog.org/resourcemap.json?rsc=language

Might also relate to the update of the CMDI ISO-639-3 language code component.

menzowindhouwer commented 8 years ago

ara is a ISO 639-3 code for the macro language Arabic, which is indeed a group of languages.

Some questions:

  1. wouldn't it be better to have pages for all ISO-639-3 codes, as some users will expect them, and than discourage the use of some codes, e.g., for macrolanguages
  2. how to deal with updates of the codeset, i.e., keep old codes around? (not all the linked resources will be continuously up-to-date)
  3. do we really want to exclude marco languages from the CMDI ISO-639-3 language code component?

Do we want to exclude marco languages from the CMDI ISO-639-3 language code component?