Closed AngledLuffa closed 2 years ago
There are also a few instances of la/lo
or la/il
, which again is not standard in this dataset or others. Most of the time, that one is lemmatized as la
le/lo
occurs a few times instead of le/le
li/lo
instead of li/li
lo/il
occurs twice, usually it is lo/lo
se/si
occurs twice instead of `se/se
I can provide a PR for these items if that will help
Excellent, thanks!
There are several instances where
gli
as aPRON
is lemmatized aslo
, which I believe to be incorrect. The standard used in most of this dataset, along with VIT and ISDT, is to lemmatize it asgli
.For example:
vs
There are three other
gli/lo
in the train section and onegli/il