RJP43 / CitySlaveGirls

The Restoration of Nell Nelson
http://nelson.newtfire.org
5 stars 4 forks source link

The tagging saga continues ... Grammatical Markup #50

Closed RJP43 closed 8 years ago

RJP43 commented 8 years ago
RJP43 commented 8 years ago

@spadafour anything else we decided to edit for the grammatical markup?

RJP43 commented 8 years ago

<seg>It occupies only the three upper floors of a four-story building, but the stairways are so dark and narrow that one must grope <w type="adj" subtype="poss">his</w> <w type="noun">way</w> from somewhere to a suppositious somewhere else</seg>, which resembles nowhere when he gets there, because <seg>the rooms are so overcrowded with material that one <choice><sic>employe</sic><reg resp="#rjp">employee</reg></choice> cannot in many instances see <w type="adj" subtype="poss" ana="#workingGirl">her</w> nearest <w type="noun" ana="#workingGirl">neighbor</w> two yards away</seg>.

@spadafour I want to point out a couple of things here about the new grammatical markup. Okay I think we should use the <seg> to grab the context of the possessive adjectives and nouns so you extend the <seg> over as much of the line that is necessary to make it clear who the noun part of the two is in reference to. As we can see in the example above the first <seg> is a more general reference to any person navigating the building so we have the seg wrapping enough to make that clear and then we don't need to make any new archetypes to mark as an @ana on either of the grabbed words ( <w> ). Then if we output this in the HTML as a table every seg can have a context column. Now looking at the next <seg> we can see that the <seg> is being used to grab as much of the sentence as necessary to make it clear who the @ana for both grabbed words (<w>) is referring to. I think this will work instead of adding a note explaining the @ana on either or both parts of the word as we brefly discussed Wednesday evening. Why try to justify in our own words when the text can do it for us is my theory on this.

Here is another example: <!-- rjp: seg holding two sets of words, but both referring to the same context in the adj archetype declaraction --><seg><persName ref="#employer">Mr. <app><rdg wit="#CT021">Schultz</rdg><rdg wit="#WSGC23">S.</rdg></app></persName> had the usual explanation to make as to the employment of young boys and girls, and no doubt made it truthfully. <!-- rjp: should we have a child employee archetype??? --><w type="adj" subtype="poss" ana="#employee">Their</w> <w type="noun">parents</w> needed <w type="adj" subtype="poss" ana="#employee">their</w> <w type="noun">assistance</w> and would sign any sort of certificate as to age, and supplement it with personal solicitation asking employment as a charity.</seg><seg> In extreme cases he negotiated a compromise with <w type="adj" subtype="poss" ana="#employer">his</w> <w type="noun" subtype="refTo" ana="#employee">judgment</w>, as most manufacturers do, and gave the child employment.

In this example we can see how two sets of the word elements come one right after the other with the same context for the possessive adjective archetype. I think it is fine to use one seg here and then when we are grabbing them just be aware that we need to use the position to determine which adjective goes with which noun (unless we can think of a better way to do this???). In the next seg we see our use of the @subtype on the noun with the refTo value that indicates there is reference to an archetype even if it isn't as direct as above in the first example was (<w type="adj" subtype="poss" ana="#workingGirl">her</w> nearest <w type="noun" ana="#workingGirl">neighbor</w>). Here the seg allows us to see that the employer's judgment is in relation to employing the child employee. @spadafour I also bring up the question here about adding a child employee archetype.

By the way these both come from article 1888-08-19

RJP43 commented 8 years ago

@spadafour scratch some of that we should continue with the seg around just the two words but grab the sentences with a s that hold any segs and use that as context so then when multiple segs fall in a single sentence the sentence is the context so we can grab segs individually and not complicate the markup by grabbing the words on position.

RJP43 commented 8 years ago

<s>It occupies only the three upper floors of a four-story building, but the stairways are so dark and narrow that one must grope <seg><w type="adj" subtype="poss">his</w> <w type="noun">way</w></seg> from somewhere to a suppositious somewhere else, which resembles nowhere when he gets there, because the rooms are so overcrowded with material that one <choice><sic>employe</sic><reg resp="#rjp">employee</reg></choice> cannot in many instances see <seg><w type="adj" subtype="poss" ana="#workingGirl">her</w> nearest <w type="noun" ana="#workingGirl">neighbor</w></seg> two yards away.</s>

RJP43 commented 8 years ago

We have successfully transitioned our grammatical markup. Please refer to our codebook