Closed seanredmond closed 6 years ago
After some offline discussion we're going to handle this according to a few of principles:
For this particular issue then, we will go with the last option:
<copies date="1951-05-03" num="2">2c 3May51</copies>
which both preserves the original text, but adds some derived attributes that will make the data easier to work with.
Some bits of the data are so far going to be converted to attributes, meaning they'll be taken out of the text representation of the XML though the data is preserved. Can we decide on a principal to help guide when that occurs. To take the
copies
element as an example:it could (1) just be text
The current proposal (2) from DCL is to regularize the date (see #8)
But we could go further (3) and just parse out the number of copies, too, so that it's an empty tag
Or combine the first and third (4)
I think either the first or the last (and really, I think the last is the best option). They both preserve the original information. The second (currently proposed) version does some of the processing up front and makes later processing easier but leaves out an important piece. The last option will be the easiest do deal with for both human and machine.