FoLiA: Format for Linguistic Annotation - FoLiA is a rich XML-based annotation format for the representation of language resources (including corpora) with linguistic annotations. A wide variety of linguistic annotations are supported, making FoLiA a useful format for NLP tasks and data interchange. Note that the actual Python library for processing FoLiA is implemented as part of PyNLPl, this contains higher-level tools that use the library as well as the full documentation, validation schemas, and set definitions
FoLiA currently has the token annotation subjectivity for limited sentiment analysis or other subjectivity annotation, it is used by the VU-DNC corpus for instance. This, however, is not sufficient for more complex expressions of sentiment. A strong span annotation element is needed. The following proposal is inspired on NAF's opinion layer:
FoLiA currently has the token annotation
subjectivity
for limited sentiment analysis or other subjectivity annotation, it is used by the VU-DNC corpus for instance. This, however, is not sufficient for more complex expressions of sentiment. A strong span annotation element is needed. The following proposal is inspired on NAF's opinion layer:This predefines the following feature subsets, whether they are actually used and the class values they take are defined by the set.
polarity
strength
The following span role elements are introduced and used (will be reused in another upcoming proposal as well):
source
- The source/holder of the sentiment (optional)target
- The target/recipient of the sentiment (optional)hd
- The head contains the sentiment itself (required)