Closed AngledLuffa closed 3 months ago
"What about X" is a way to ask a question, where X may be anything—no approximation involved. So it makes sense that that has a different structure from "at about TIME". Is that the main difference you noticed?
Ok, that makes sense. However, in I came here about 12 years ago
, about
has xpos IN
and upos ADV
. Normally I would expect ADP
as the upos for IN
. There are 16 total cases between train/dev/test with IN
and ADV
, though.
Is this a case where it'd be fine for the CoreNLP converter to editorialize the tags to be ADV
even though there's an IN
xpos (https://github.com/UniversalDependencies/docs/issues/717) or would it make more sense to unify the tags in the EWT treebank? Personally I would go with the latter and accept that there will be a few unfixable validation errors in the converter.
EWT has just 4 of these ADV/IN combos: https://universal.grew.fr/?custom=6609ff40eada0
I think they're errors, will fix.
Made a more general issue: #516
Came across the following:
vs
This also looks different:
Mostly this came up because I was trying to figure out how to convert this constituent to dependencies
but I'm not figuring out a proper pattern from flipping through EWT. It's possible the
about_IN
in PTB is not the same standard used in EWT, though