sign-language-processing / sign-language-processing.github.io

Documentation and background of sign language processing
99 stars 9 forks source link

Update AUSLAN dataset #81

Closed cleong110 closed 2 days ago

cleong110 commented 1 week ago

image https://aclanthology.org/Y08-1002.pdf https://www.elararchive.org/uncategorized/SO_a93b67cc-7339-4f08-8f09-8648791d0c3d/

TODO:

cleong110 commented 1 week ago

The Auslan corpus annotations that have been created to date are intended primarily for inves- tigations of grammar and discourse, rather than a basic phonological or lexical analysis of the language. The investigation centres on the modification of indicating verbs in terms of fre- quency of types/tokens, and their environments of occurrence (e.g., during periods of con- structed action, with or without contiguous pointing signs, or with reference to the sequential order of related nominal arguments). The focus is on the analysis of the grammatical use of space in Auslan in terms of semantic roles and grammatical relations.

OK, so linguistic sort of annotations are in there, are there glosses?

cleong110 commented 1 week ago

image AUSLAN itself

cleong110 commented 1 week ago

Poking around on the website https://elararchive.org/collections/ lead me to image http://hdl.handle.net/2196/d8a991a5-d8cc-4f85-a5ff-c37279ebb625

cleong110 commented 1 week ago

http://hdl.handle.net/2196/d8a991a5-d8cc-4f85-a5ff-c37279ebb625 leads to https://www.elararchive.org/dk0001

cleong110 commented 1 week ago

https://auslan.org.au/about/corpus/ has more info image

cleong110 commented 1 week ago

https://auslan.org.au/about/annotations/ says

image

cleong110 commented 1 week ago

Let's focus on the 2008 version, aka https://www.elararchive.org/dk0001

image OK, so for "Features" I think we can safely say video, gloss. We don't have a category for "linguistic/grammatical".

cleong110 commented 1 week ago

Now for licensing:

https://auslan.org.au/ says it's cc by-nc-nd https://creativecommons.org/licenses/by-nc-nd/4.0/ image

However the 2008 version hosted on ELAR Collections says has this to say: image

cleong110 commented 1 week ago

Reading the paper, they used ELAN for annotation

cleong110 commented 2 days ago

Also citation key had "2010" in it but points to https://aclanthology.org/Y08-1002, from 2008. Fixed this also.