Closed AbrahamSanders closed 3 years ago
Implemented basic functionality in this commit co-authored with @Erikellerx
Left to do:
Implemented the to do items in the previous comment, tested for backward compatibility, and merged to master in this PR.
Many times a narrative segment will be in the same paragraph as a dialog turn. We should support extracting these just as we do for narrative segments in their own paragraphs:
Source excerpt from
Sister Carrie
byTheodore Dreiser
:What is currently extracted:
What should be extracted:
The order in which the dialog and narrative are positioned in the paragraph should dictate the order in which they appear in the corpus. For example, the narrative can come before the dialog turn:
Source excerpt, also from
Sister Carrie
byTheodore Dreiser
:What is currently extracted:
What should be extracted: