datopian / datahub

🌀 Rapidly build rich data portals using a modern frontend framework
https://datahub.io/opensource
MIT License
2.19k stars 325 forks source link

[OpenSpending] 2014 EU budget data is broken #947

Closed mpolidori closed 1 year ago

mpolidori commented 1 year ago

The Budget of the European Union 2014 dataset is broken. There are currently 2 data files listed, c!SEC!_E!en!0.csv and c!SEC!en!10.csv. These both lead to raw GitHub files. The first of which is missing data/columns (or at least the last comma if EpNote is empty) in some places (here are the first 8 lines—look at the last 2 columns):

Alias,ID,Heading,Expenditure,Language,PoliticalCategory,Year,Amount,EpNote
1 0 0 0,NIV-47483218314141915_47483218314322839@dd2a8986-adea-456d-b71b-b3c9bb23dabe-N10C71,Salaries,exp,en,5.2.11,2014,73 643 70
1 0 0 0,NIV-47483218314141915_47483218314322839@dd2a8986-adea-456d-b71b-b3c9bb23dabe-N10C71,Salaries,exp,en,5.2.11,2013,71 393 07
1 0 0 0,NIV-47483218314141915_47483218314322839@dd2a8986-adea-456d-b71b-b3c9bb23dabe-N10C71,Salaries,exp,en,5.2.11,2013,71 393 074,AM Rapp / Above DB / Reserv
1 0 0 4,NIV-47483218314141915_47483218314322840@dd2a8986-adea-456d-b71b-b3c9bb23dabe-N10C94,Ordinary travel expenses,exp,en,5.2.11,2014,65 400 00
1 0 0 4,NIV-47483218314141915_47483218314322840@dd2a8986-adea-456d-b71b-b3c9bb23dabe-N10C94,Ordinary travel expenses,exp,en,5.2.11,2013,72 343 14
1 0 0 4,NIV-47483218314141915_47483218314322840@dd2a8986-adea-456d-b71b-b3c9bb23dabe-N10C94,Ordinary travel expenses,exp,en,5.2.11,2012,72 103 309.0
1 0 0 4,NIV-47483218314141915_47483218314322840@dd2a8986-adea-456d-b71b-b3c9bb23dabe-N10C94,Ordinary travel expenses,exp,en,5.2.11,2012,72 103 309.00,AM Rapp / Below D

Maybe special characters or something caused some rows to lose the last column (EpNote) when parsed in the migration, though this is only a guess.

Full file: https://raw.githubusercontent.com/os-data/eu-budget-2014/main/data/c!SEC1_E!en!0.csv

The second file returns a 404: https://raw.githubusercontent.com/os-data/eu-budget-2014/main/data/c!SEC1_E!en!10.csv

Acceptance

Tasks

anuveyatsu commented 1 year ago

@mpolidori could you please move open spending related issues to here pls https://github.com/os-data/registry/issues

mpolidori commented 1 year ago

Closing. Moved to os-data repo issue https://github.com/os-data/registry/issues/80