quadrama / DramaAnalysis

An R package for analysis of dramatic texts
GNU General Public License v3.0
15 stars 2 forks source link

Scene position counting #151

Closed nilsreiter closed 5 years ago

nilsreiter commented 5 years ago

If file ElFaetonte.xml is parsed, some utterance positions are outside of segments (i.e., text$utteranceBegin is larger than the largest segments$end.Scene). This leads to issues in the segment() function, apparently the counting of the positions for scenes doesn't count everything.

nilsreiter commented 5 years ago

It seems that in every act, there is one scene (the last?) missing. type="scene" is 97 times in the XML file, but length(unique(d$segments$begin.Scene)) is 94 (and there are three acts).