clamsproject / aapb-brandeis-datahousing

Apache License 2.0
0 stars 0 forks source link

shorten index keys (GUIDs) #4

Closed keighrim closed 1 year ago

keighrim commented 1 year ago

At the moment the GUID string values used as keys for the index are full file "stem". But since cpb-aacip- part is common among all GUIDs, we can trim that part. Also there are some files with cpb-aacip_ prefixes (a underscore instead of a dash), so that could also be another reason to chop first 10 chars if the file name starts with cpb.