Hello! Firstly I have to say that I love this project. Really helping me exploring syntax of different kinds of text. So thank you so much!
I have a question regarding tagsets. I am using swedish model, and few years back, I remember it used to be based on Swedish treebank tagset called Mamba. But it seems like it has been changed in the new version (benepar-sv2).
I tried to print what kind of labels have been used to train the core model, and I got these results.
What is the difference between NP::NP::S and S::NP::NP ?
In this example ( In English: Hello, I am a banana)
There is a S (simple declarative clause) which has 2 NPs as children. Would this be NP::NP::S or S::NP::NP ? And what is happening with AUX? Because, for me it is hard to think about any structure where S has only 2 NPs. Because at least one VP is required to become a S.
Also, general question: I saw from #30 that you are using this for training: http://surdeanu.cs.arizona.edu//mihai/teaching/ista555-fall13/readings/PennTreebankConstituents.html Is it same for Swedish model and other language's models? For example unlike English model, I see there is no FRAG in labels for Swedish models. Is this because of the nature of the language itself? Or did you use different label set for different languages?
Hello! Firstly I have to say that I love this project. Really helping me exploring syntax of different kinds of text. So thank you so much!
I have a question regarding tagsets. I am using swedish model, and few years back, I remember it used to be based on Swedish treebank tagset called Mamba. But it seems like it has been changed in the new version (benepar-sv2).
I tried to print what kind of labels have been used to train the core model, and I got these results.
What is the difference between
NP::NP::S
andS::NP::NP
?In this example ( In English: Hello, I am a banana) There is a
S
(simple declarative clause) which has 2NP
s as children. Would this beNP::NP::S
orS::NP::NP
? And what is happening withAUX
? Because, for me it is hard to think about any structure whereS
has only 2NP
s. Because at least oneVP
is required to become aS
.Also, general question: I saw from #30 that you are using this for training: http://surdeanu.cs.arizona.edu//mihai/teaching/ista555-fall13/readings/PennTreebankConstituents.html Is it same for Swedish model and other language's models? For example unlike English model, I see there is no
FRAG
in labels for Swedish models. Is this because of the nature of the language itself? Or did you use different label set for different languages?