hyperspy / rosettasciio

Python library for reading and writing scientific data format
https://hyperspy.org/rosettasciio
GNU General Public License v3.0
49 stars 28 forks source link

RosettaSciIO Format Naming #34

Closed jlaehne closed 2 years ago

jlaehne commented 2 years ago

The naming of formats in RosettaSciIO is historically very inconsistent.

  1. We have formats with overly complicated names, e.g.: Digital Micrograph dm3, SEMPER UNF (unformatted), Electron Microscopy Data (EMD), ...

  2. We have formats with capitalized names (either file extensions, acronyms or shorter manufacturer names), e.g.: HSPY, DENS, MRCZ, TIFF, EMPAD, JEOL, ...

  3. And we have formats with concise, but not capitalized names, e.g.: ZSpy, Nexus, Blockfile, PantaRhei, ...

In case 2&3, the format_name often, but not always, corresponds to the module name.

With the first release, we should consider making the naming more consistent. To allow backwards compatibility for HyperSpy users, we could allow aliases for the format_name. I would propose to add a field format_alias to the .yaml dictionary defining a format, similar to the aliases of signal_types in HyperSpy.

I would propose to use a combination of case 2&3 with the following rules for the names:

ericpre commented 2 years ago

This sounds good!