I've added a (github markdown) file of definitions as a starting point for that discussion.
I've added a "census-data" directory with the SAIPE data and readme as well as the (Haskell) code to pull that data and add the state abbreviations and 5-digit FIPS codes to match up with the VERA data.
A note about the code: I'm adding it in submodule form for now and not yet as a dockerfile or something more portable. I don't know how to do that yet and this at least serves as a marker for doing that later if we decide that's important even for tools which just pull data and re-label.
I've also not documented how to get the code running for anyone who isn't pretty familiar with haskell. When we settle on levels of documentation for different sorts of tools, I'll bring this piece into compliance with whatever we decide.
A note about the code: I'm adding it in submodule form for now and not yet as a dockerfile or something more portable. I don't know how to do that yet and this at least serves as a marker for doing that later if we decide that's important even for tools which just pull data and re-label. I've also not documented how to get the code running for anyone who isn't pretty familiar with haskell. When we settle on levels of documentation for different sorts of tools, I'll bring this piece into compliance with whatever we decide.