hasadna / knesset-data-pipelines

Main repository for Open Knesset project - contains the knesset data scrapers and processing pipelines
https://oknesset.org/
MIT License
14 stars 26 forks source link

Duplicate and lack of data in 'members_mk_individual' table #217

Closed bobiboMC closed 1 year ago

bobiboMC commented 1 year ago

This table has duplicate columns , "FirstName" is same as "mk_individual_first_name", "LastName" is same as "mk_individual_name". Also, No data for "mk_individual_first_name_eng" column and "mk_individual_first_name_eng" for each row. Not sure what's "mk_status_id" column and if it's necessary because it's value is 0 for each row. "mk_individual_photo" column is a placeholder link photo for each row, real photos need to be added for each row in the future.

OriHoch commented 1 year ago

thanks, nice work, that's very useful

this table combines data from a few different sources, and tries to get a single row per MK, so I assume that's the source of these problems, but anyway, I'll look into it

OriHoch commented 1 year ago

regarding english names, I opened a new issue - #220

OriHoch commented 1 year ago

regarding mk photos - opened an issue - #221

OriHoch commented 1 year ago

other issues are probably due to old data or changes in schema, but we don't want to remove fields because some other pipelines or users might depend on them