usnationalarchives / digital-preservation

NARA digital preservation file format risk analysis and preservation plans
Other
197 stars 16 forks source link

Consider using IETF "MIME types" to describe document formats #35

Closed masinter closed 3 months ago

masinter commented 3 months ago

There is an international standard for MIME types, and a public registry maintained by IANA. MIME provides a mechanism for labelling content as stored or transmitted by email or HTTP.

MIME is widely used and understood. Using MIME type labels offers some kinds of categories and a mechanism for extension and a greater likelihood of adoption and availability of software for interpretation and conversion.

In addition, if there is need for finer granularity of format description, there are even IETF specifications for additional labelling attributes for file formats.

The current NARA descriptions of file formats could use a framework for categorization of media types.

hannahlwang commented 3 months ago

Thanks for your comment @masinter. Currently, we do map to MIME types in the Preservation Action Plans for individual file formats. In those plans, we also map to other available standards and frameworks for identifying and characterizing file formats, including PRONOM and the Library of Congress FDDs. As you note, MIME types are widely used and it is important to us to make sure to identify direct mappings between our framework and other resources, where possible.

masinter commented 3 months ago

@hannahlwang I see now on further examination that there is a column for MIME type in the spreadsheet. I had only looked at the summary documents, such as the pages about Digital Audio records

And there are some inadequacies