darenasc / presupuesto-2015

0 stars 0 forks source link

Budget Data Package or Data Package? #1

Open trickvi opened 9 years ago

trickvi commented 9 years ago

After looking through the data package (the descriptor file datapackage.json and the data file data/Presupuesto 2015.csv) I found that it conforms to a normal data package but not to the budget data package. The difference lies in required fields. Budget data packages must have special headers in the csv file and then some additional metadata in the descriptor file while a data package can have any headers and just a schema to describe it.

So a budget data package is a special case of data packages, with predefined schema components.

Was it your intention to have this as a budget data package or just a data package?

Either way, this is really great and I'm wondering if we can use this to play around with data package support in OpenSpending (we are building a validator and hopefully in the future a mapping tool from a data package to a budget data package). The problem is I don't speak Spanish so I would appreciate if you could provide translation for at least the header row (just so I know what information we have in the file).

darenasc commented 9 years ago

My intention was to have the data as a budget data package to share it in a standard format. I will look into the specification to make it compliant with the budget data package specification.

The original dataset is in http://datos.gob.cl/datasets/ver/31132 http://datos.gob.cl/datasets/ver/31132 it's in a hierarchical way where each level of aggregation is a row with the corresponding aggregated amount per row and does not have a separation between expenses and revenues.

I did put the data in a tabular way and add a variable TIPO (type) to separate GASTOS (expenses) and INGRESOS (revenues).

I don't see any problem to play with the data.

About the translation the dataset has nine fields, only TIPO was added by me to separate expenses and revenues. The fields are PARTIDA (Batch), CAPITULO (Chapter), PROGRAMA (Program), SUBTITULO (Subtitle), ITEM (Item), ASIGNACION (Assignment), MONTO_PESOS (Chilean pesos amount), MONTO_DOLAR (Dollar amount) and TIPO (Type: GASTO = Expense or INGRESO = Revenue).

I'm looking forward to cooperate and share and use the data so let me know if there is anything I can do to improve the data or any further comments about it.