codeaudit / dkpro-core-asl

Automatically exported from code.google.com/p/dkpro-core-asl
0 stars 0 forks source link

Use "universal" categories for part-of-speech tags #76

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
A set of universal POS tags very similar to the existing DKPro Core POS tags 
has been presented at LREC recently:

http://code.google.com/p/universal-pos-tags/

It has been discussed that these will be adopted for DKPro Core. Only a few 
changes need to
be made:

Universal       DKPro (old) DKPro (new) Change      Description
---------------------------------------------------------------------
ADJ         ADJ         ADJ         no          adjetives
ADV         ADV         ADV         no          adverbs
DET         ART         DET         rename      determiners
NUM         CARD        NUM         rename      cardinal numbers
CONJ                CONJ                CONJ                no          conjunctions
NOUN        NP, NN, N       NOUN        merge       nouns
X           O           X           rename      other
ADP         PP          ADP         rename      adjectives
PRON        PR          PRON        rename      pronouns
VERB                V           VERB                rename      verbs
PRT                     PRT         new         particles
.           PUNC        PUNC        no          punctuation

Original issue reported on code.google.com by richard.eckart on 27 Jun 2012 at 8:02

GoogleCodeExporter commented 9 years ago
I'd also suggest to remove the special "tweek" POS tags - or - alternatively 
move them to the arktweet module.

Original comment by richard.eckart on 27 Jun 2012 at 8:02

GoogleCodeExporter commented 9 years ago
They need to be moved - not deleted.

Original comment by torsten....@gmail.com on 28 Jun 2012 at 8:40

GoogleCodeExporter commented 9 years ago
Should we have a api.tweets module or just move them to the arktweet for now?

Original comment by richard.eckart on 28 Jun 2012 at 8:45

GoogleCodeExporter commented 9 years ago
api.tweets would be more generic, but adding a new module is maybe too much 
overhead for now.
Whoever ends up fixing this issue can decide on his/her own.

Original comment by torsten....@gmail.com on 28 Jun 2012 at 8:54

GoogleCodeExporter commented 9 years ago

Original comment by richard.eckart on 13 Oct 2012 at 6:31

GoogleCodeExporter commented 9 years ago

Original comment by richard.eckart on 21 Feb 2013 at 9:50

GoogleCodeExporter commented 9 years ago

Original comment by richard.eckart on 25 Jun 2013 at 10:52

GoogleCodeExporter commented 9 years ago

Original comment by richard.eckart on 25 Jun 2013 at 10:58

GoogleCodeExporter commented 9 years ago
Mind that NP and NN could remain as sub-classes of Noun.

Original comment by richard.eckart on 5 Aug 2013 at 7:54

GoogleCodeExporter commented 9 years ago

Original comment by richard.eckart on 29 Sep 2013 at 2:59

GoogleCodeExporter commented 9 years ago

Original comment by richard.eckart on 29 Sep 2013 at 3:00

GoogleCodeExporter commented 9 years ago

Original comment by richard.eckart on 26 Mar 2014 at 10:51

GoogleCodeExporter commented 9 years ago

Original comment by richard.eckart on 31 Mar 2014 at 10:49

GoogleCodeExporter commented 9 years ago

Original comment by richard.eckart on 12 Nov 2014 at 10:10