direct-phonology / jdsw

Parsing the "Jingdian Shiwen" with spaCy
MIT License
2 stars 0 forks source link

add zhengwen versions of missing texts #9

Closed thatbudakguy closed 2 years ago

thatbudakguy commented 2 years ago

we're still looking for clean 正文 versions of these texts:

note that, for the 公羊 at least, Kanripo suggests a 正文 is available: https://www.kanripo.org/titlesearch?query=%E6%98%A5%E7%A7%8B%E5%85%AC%E7%BE%8A

but, the link to it is broken: https://www.kanripo.org/text/KR1e0005/

maybe Christian Wittern can help with this.

thatbudakguy commented 2 years ago

TLS version of the Zhouli is https://github.com/kanripo/KR1d0001.

GDRom commented 2 years ago

note: TLS lists both:

CH1e0877 春秋公羊傳 (this appears correct) CH1e0873 儀禮 (link leads to an error)

@thatbudakguy can you reproduce this?

thatbudakguy commented 2 years ago

excellent work — I can reproduce that error, but it doesn't matter because i was able to use those CHANT IDs to find the XML source of the texts in the tls-kr repo.

https://github.com/tls-kr/tls-chant/blob/master/chant/CH1/CH1e0877.xml https://github.com/tls-kr/tls-chant/blob/master/chant/CH1/CH1e0873.xml

i can convert these into workable Zhengwen versions.