TheStanfordDaily / archives-text

Archives text for the Stanford Daily since 1892. Help us improve by submitting a pull request!
https://archives.stanforddaily.com/
0 stars 0 forks source link

File name too long #4

Open epicfaace opened 5 years ago

epicfaace commented 5 years ago

CI script fails here:

Traceback (most recent call last):
  File "/opt/hostedtoolcache/Python/3.7.2/x64/lib/python3.7/shutil.py", line 563, in move
    os.rename(src, real_dst)
OSError: [Errno 36] File name too long: '1984/05/24/MODSMD_ARTICLE28.article.txt' -> '19xx/198x/1984y/05m/24d/MODSMD_ARTICLE28.RVZFTlRTIEVWRU5UUyBFVkVOVFMgRVZFTlRTIEVWRU5UUyBFVkVOVFMgRVZFTlRTIEVWRU5UUyBFVkVOVFMgRVZFTlRTIEVWRU5UUyBFVkVOVFMgRVZFTlRTIEVWRU5UUyBFVkVOVFMgRVZFTlRTIEVWRU5UUyBFVkVOVFMgRVZFTlRTIEVWRU5UUyBFVkVOVFMgRVZFTlRTIEVWRU5UUyBFVkVOVFMgRVZFTlRTIEVWRU5UUyBFVkVOVFMgRVZFTlRTIEVWRU5UUyBFVkVOVFMgRVZFTlRTIEVWRU5UUyBFVkVOVFMgRVZFTlRTIEVWRU5UUyBFVkVOVFM=.article.txt'

Possible solutions:

epicfaace commented 5 years ago

Turns out that the appropriate METS file (https://s3.amazonaws.com/stanforddailyarchive/data.2013-oct/data/stanford/1984/05/24_01/Stanford_Daily_19840524_0001-METS.xml) has this wrong title here:

image