PerseusDL / canonical-latinLit

XML Canonical resources for Latin Literature
https://scaife.perseus.org
Creative Commons Attribution Share Alike 4.0 International
44 stars 57 forks source link

phi0914 #94

Open PonteIneptique opened 8 years ago

PonteIneptique commented 8 years ago

CC @balmas @lcerrato

balmas commented 8 years ago

Unfortunately it looks to me like Livy is a complete mess and I'm not sure I want to merge the PR #96 until we can straighten it out. We might have to go back to the original Perseus 4 source and start over.

It looks like there might have been various editions merged during the migration because the URNs for these in the classics.xml are completely screwed up and duplicated and cross editions.

PonteIneptique commented 8 years ago

I do not think we need to do over, as for what I have seen, texts are pretty consistent from an edition to the other (way they are introduced for example).

I would love to be able to add the missing stuff, I just do not know where to get it...

As far as PR goes, this PR is just as "wrong" as the original data (and by that I mean we miss some, data themselves are okay). I understand the principle, not sure I agree with the decision :')

PonteIneptique commented 8 years ago

I might actually be able to do it given the zip from Perseus. Is there any newer/more recent version of it ?

balmas commented 8 years ago

The P4 source files are here https://www.dropbox.com/s/y6bkfy7fjrj6q6e/Livy.tgz?dl=0 There are 2 separate directories: Classics and sdl -- in theory the data in sdl supercedes what's in Classics, but what is currently live in Perseus 4 is a mixture of data from sdl and Classics.

The attached file contains the data from the Perseus 4 catalog file that drove the migration -- the format is Perseus 4 document id, cts urn, source text path

(the source text paths are slightly longer than in the P4 tar file linked here but I think you can figure it out --- I just removed the Livy and Latin/Livy from the path when zipping up for you.)

livy.txt

lcerrato commented 8 years ago

see also #100