uga-libraries / format-report

Aggregate and analyze csv files with file format information generated by the UGA Libraries' digital preservation system (ARCHive).
Creative Commons Attribution Share Alike 4.0 International
0 stars 0 forks source link

Add groups(s) to new_formats.txt #25

Open amhanson9 opened 1 year ago

amhanson9 commented 1 year ago

It would be helpful to know what groups the formats are from in case I need to look at the format information for context in making choices about standardization.

amhanson9 commented 1 year ago

Right now, the script iterates on each ARCHive format report, so I could get the group from the format report name. It stores the data in a dictionary, with the format name as the key and if it matched or not as the value. If I stored the value as (match, group), it would only store the first group that had the format. Is that enough? Or would I need to update group when there are more than one?

amhanson9 commented 1 year ago

This may be simpler (or at least just different) once I use pandas for the comparison (Issue 13), so decided not to do this one right now.