WaxCylinderRevival / frus-dates-project

Project repository for FRUS date extraction and normalization initiative
https://history.state.gov
GNU General Public License v3.0
1 stars 0 forks source link

Add @calendar attribute to <date> #2

Closed WaxCylinderRevival closed 6 years ago

WaxCylinderRevival commented 7 years ago

While ISO-8601 requires use of the Gregorian calendar, it is best practice to explicitly designate calendar used. For most if not all of FRUS, the @calendar attribute should be populated with #Gregorian.

TEI Construction (See #6)

http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-att.datable.html#tei_att.calendar

See also #14

joewiz commented 7 years ago

Great, we've never declared anything about our calendars, and you make a good argument for explicitly designating this.

WaxCylinderRevival commented 7 years ago

Note: If we have any documents that were written in, say, Russia prior to the Revolution and originally written using Julian calendar dates, we'll need to add a calendar declaration on the date and TEI/teiheader/profiledesc. Given our corpus, such instances are likely rare outlier cases that could be flagged for second- or third-phase investigation and enhancement.

joewiz commented 7 years ago

Good point, definitely relevant to #10 too.

WaxCylinderRevival commented 7 years ago

In looking for multiple date elements in a single dateline, I found a Russian Julian/Gregorian calendar use case already present in FRUS: https://history.state.gov/historicaldocuments/frus1914-20v02/d156

Using an online Julian-Gregorian converter (http://stevemorse.org/jcal/julian.html), I confirmed that "January 29/February 11, 1917" (as well as "January 14/27" in the text) are most likely written as parallel dates in the communiqué, rather than a date range.

Current encoding:

    <opener>
      <dateline rendition="#right">
        <placeName>
          <hi rend="smallcaps">Petrograd</hi>
        </placeName>,<date when="1917-01-29">
          <hi rend="italic">January 29</hi>
        </date>/<date when="1917-02-11">
          <hi rend="italic">February 11, 1917</hi>
        </date>.<lb/>[Received April 10.]</dateline>
    </opener>

The same applies to https://history.state.gov/historicaldocuments/frus1914-20v02/d167

joewiz commented 7 years ago

Absolutely fascinating!

WaxCylinderRevival commented 7 years ago

How timely (groan), a cataloger just posted this calendar conversion resource to a FB group: https://www.fourmilab.ch/documents/calendar/

...complete source code is embedded in or linked to this page, and you're free to download these files to your own computer and use them even when not connected to the Internet.

joewiz commented 7 years ago

It's about time! (Groan.)

Go catalogers!