PhyloStar / UDTelugu

Universal Dependency Tagging for Telugu
Apache License 2.0
1 stars 1 forks source link

Marking Wh-questions #4

Open PhyloStar opened 7 years ago

PhyloStar commented 7 years ago

How to mark Emiti? Is it DET or NOUN? I mark it as Determiner.

nishkalavallabhi commented 7 years ago

I started marking them as Pronoun and then adding PronType as int (interrogative). This will make it easy for us to resolve pronoun vs det issues of this kind (since both have PronType as interrogative)

nishkalavallabhi commented 7 years ago

More on this: In examples with questions, I think the question word should be the head word, following English examples.

e.g., What is his name becomes: root(ROOT-0, What-1) cop(What-1, is-2) nmod:poss(name-4, his-3) nsubj(What-1, name-4)

vaadi pEru EmiTi ? should perhaps become: nmod(pEru,vaaDi) nsubj(EmiTi, pEru) punct(EmiTi, ?)

This is how I am now redoing my annotations... EmiTi is a interrogative Determiner (Evaru - I tagged as a interrogative Pronoun, not determiner). EmiTi - may have a subject but Evaru/Evari can have a subj, and obj.

PhyloStar commented 7 years ago

The dependency analysis is more reasonable than what I did. Evaru is a personal pronoun. I suppose that these kind of personal pronouns should be always tagged as pronoun. I need to change the annotation throughout. :)

PhyloStar commented 7 years ago

Okay. Issue can be closed.

nishkalavallabhi commented 7 years ago

I am a bit confused with Bh K's definition of adverbial pronouns. In page 101 (Chapter on Adverbial Nouns), he considers "ekkada" as a Adverbial noun.

"miiru ekkaDa uNTaaru?"

We need to decide what to tag these question words as. Some of them refer to a noun, so I was tagging them as pronoun. But by that token, ekkaDa would have been a pronoun too.

At another place, in the chapter on pronouns, he listed: ఎవరు, ఏమిటి, ఏం, ఎందుకు, ఎవతె, ఏది, ఏమి as interrogative pronouns. I think the reason why he considers ekkaDa as a adverbial noun is that he says words related to time, place etc are adverbial nouns. We should reach some kind of agreement on how to tag all the question words, and what their relation to other nouns/verbs will be.

nishkalavallabhi commented 7 years ago

in the example on page 114, eMdaru is classified as determinative noun. It seems like all these question words are either pronouns or adverbial/determinative nouns for BhK.

nishkalavallabhi commented 7 years ago

"When the words anta [anta], anni; [anni] and andaru [andaru] are used as pronouns (i.e. as substitutes for nouns), their final vowels are lengthened to emphasize the meaning of ‘all’," - okay, so they are nouns used as pronouns in some contexts.

PhyloStar commented 7 years ago

At another place, in the chapter on pronouns, he listed: ఎవరు, ఏమిటి, ఏం, ఎందుకు, ఎవతె, ఏది, ఏమి as interrogative pronouns. I think the reason why he considers ekkaDa as a adverbial noun is that he says words related to time, place etc are adverbial nouns. We should reach some kind of agreement on how to tag all the question words, and what their relation to other nouns/verbs will be.

I think markers of place, manner, and time need to be adverbs. The main reason is that we will be following UD guidelines then of parallelism. I am following English translation and checking for the POS of the word.

PhyloStar commented 7 years ago

లోపల, బయట, మీద, కింద, దగ్గర, వెనక, ముందు, పక్కన, వైపు, ప్రకారం, తర్వాత, పాటు, దాకా, వరకు, etc. can be classified as adverbs when they modify the verb and as ADP when they modify a nominal

PhyloStar commented 7 years ago

"When the words anta [anta], anni; [anni] and andaru [andaru] are used as pronouns (i.e. as substitutes for nouns), their final vowels are lengthened to emphasize the meaning of ‘all’," - okay, so they are nouns used as pronouns in some contexts.

These are marked ad determiners since these words are quantifiers following UD guidelines. When a vowel lengthening happens, they are PRON as you say.

PhyloStar commented 7 years ago

What BhK calls as adverbial nouns: ninna, monna, ivala, indaaka These can inflect for genitive case and behave as adjectives ninnaTi can take up locative case as in taruwaata I think they are adverbials but also inflect for case markers. I suppose we can tag them as ADV but in morphology the case marker can show up.