padejski / veritzaproto

2 stars 2 forks source link

Serbia app - data integrations #23

Open padejski opened 8 years ago

padejski commented 8 years ago

Public Official Companies - Serbia

    • List of all the Serbia companies that have a public official name in the list of founders, directors or board members. Take “names” column from the public officials table and match all the companies from the companies table that has that name in founders column and “other individuals” column - (note: I can’t actually see “other individuals” column, there is only “directors” column, but you said you made that fix - ref: https://github.com/padejski/veritzaproto/issues/22 )

Public Official Companies in Procurement

    • List of all the government procurements in Serbia where the winning company has a public official as a founder, director or board member. Take company ID number from the previous integration (Public official companies) and cross-check it with the Vendor / Supplier ID from procurement table.

not for now 3. and 4. - data is not available anymore - I was surprised to find out this.

  1. Family Member Companies List of all the Serbian companies that have a family member of a public in the list of founders, directors or board members. Take “spouse” and “children” columns from the public officials table and match all the companies from the companies table that has that name in founders column and “other individuals” column
  2. Family Member Companies in Procurement List of all the government procurements in Serbia where the winning company has a family member of a public official as a founder, director or board member. Take company ID number from the previous integration (Family member companies) and cross-check it with the Vendor / Supplier ID from procurement table.

Political funders companies

Political funders companies in Procurement

Companies political funders in procurement

mattgathu commented 8 years ago

thank you for this

mattgathu commented 8 years ago

I noticed there is a limitation in the number of companies that can be scraped. The script uses a list of ids from a csv file and at the moment there are only about a hundred of them.

padejski commented 8 years ago

hm, we were supposed to figure out this problem somehow - is there any way to auto-generate them?

mattgathu commented 8 years ago

Initially you had given me a csv document with the information.

It has the following headers: Тип субјекта,Матични број,Пословно име,Статус,Детаљније

Maybe you can recall where you obtained the data.

Meanwhile I will add an auto-generator and see if it works.

padejski commented 8 years ago

I did a basic keyword search on the website itself (search form) and got list of results from that - and then I copied them for you into the file

mattgathu commented 8 years ago

Apologies, I overlooked the “other individuals” data when making the fixes. I have been trying to include it but it seems the companies data website is down at the moment. I can't access http://pretraga2.apr.gov.rs/ or http://pretraga2.apr.gov.rs/ObjedinjenePretrage/Search/Search

padejski commented 8 years ago

http://pretraga2.apr.gov.rs/ObjedinjenePretrage/Search/Search works for me

mattgathu commented 8 years ago

seems like my IP is banned, I just reloaded it via a vpn and it worked fine.

padejski commented 8 years ago

maybe it's because of your scripts

mattgathu commented 8 years ago

Yes. I suspect as much. I will switch to a different IP.

mattgathu commented 8 years ago

I have implemented all the data integrations. We can review and discuss the outstanding issues on:

the use of names search doesn't cover all the bases and it's hard to find companies that are political funders.