ror-community / ror-roadmap

Central information about what is happening at ROR and how to contribute feedback
10 stars 2 forks source link

[EXTERNAL ID] IČO – identification number of a legal person (Identifikační číslo osoby) #260

Open hana-her opened 2 months ago

hana-her commented 2 months ago
  1. Name of External ID system: IČO – Identifikační číslo osoby – Identification number of a legal person

  2. Organization maintaining the ID system: Czech Statistical Office, https://csu.gov.cz/home

  3. Brief description: Unique eight-digit identification number in Czechia of a legal entity, a natural person doing business or an organizational unit of the state. Key identifier of organizations used within Czechia.

  4. Is the data associated with the external ID openly accessible via a web interface, API or data file that is not behind a paywall?

    • [x] Yes
    • [ ] No
  5. Provide the URL where the data can be accessed. Include examples of individual record access, if available: IČO data can be accessed the Registry of Economic Subjects / Business Register by the the Czech Statistical Office which can be downloaded in CSV format (basic info: https://csu.gov.cz/business_register; information on Business Register provided as open data: https://data.gov.cz/dataset?iri=https%3A%2F%2Fdata.gov.cz%2Fzdroj%2Fdatov%C3%A9-sady%2F00025593%2F7bad26fdd8554ce715b81b5b416d75f0) It can also be accessed via the ARES system (Administrative Register of Economic Subjects) managed by the Czech Ministry of Finance https://ares.gov.cz/stranky/open-data which might be better documented in English and is available via API.

  6. Describe the available data formats (e.g., JSON, CSV, XML): CSV, XML

  7. What is the license for the data associated with the external ID? Provide a link to the license, where available. See https://data.gov.cz/dataset?iri=https%3A%2F%2Fdata.gov.cz%2Fzdroj%2Fdatov%C3%A9-sady%2F00025593%2F7bad26fdd8554ce715b81b5b416d75f0 obrazek

  8. Is the license compatible with open use and redistribution?

    • [x] Yes
    • [ ] No
    • [ ] Uncertain (please explain)
  9. How frequently is the data updated? the open data set is updated twice per month

  10. Is versioning information available for the data?

    • [x] Yes
    • [ ] No
  11. Is there an existing mapping between this ID system and ROR?

    • [ ] Yes
    • [ ] No
    • [x] Partial
  12. If yes or partial, provide details or link to the mapping: Via wikidata – partial only and likely needs to be checked for errors: https://query.wikidata.org/#SELECT%20%3Fentity%20%3FentityLabel%20%3FrorIdentifier%20%3Fico%20WHERE%20%7B%0A%20%20%23%20Find%20entities%20located%20in%20Czech%20Republic%0A%20%20%3Fentity%20wdt%3AP17%20wd%3AQ213%20%3B%0A%20%20%20%20%20%20%20%20%20%20%23%20Ensure%20they%20have%20a%20ROR%20identifier%0A%20%20%20%20%20%20%20%20%20%20wdt%3AP6782%20%3FrorIdentifier%20.%0A%0A%20%20%23%20Optionally%20get%20the%20ICO%20value%0A%20%20OPTIONAL%20%7B%0A%20%20%20%20%3Fentity%20wdt%3AP4156%20%3Fico%20.%0A%20%20%7D%0A%20%20%23%20Get%20the%20entity%20name%0A%20%20%3Fentity%20rdfs%3Alabel%20%3FentityLabel%20.%0A%20%20FILTER%28LANG%28%3FentityLabel%29%20%3D%20%22en%22%29%0A%7D%0A

  13. Describe the adoption of this ID system in its relevant community or domain, including any major research infrastructure systems that integrate these identifiers: IČO is the primary identifier of organizations used in Czechia, it is used in any system requiring identification of organizations.

  14. Who would benefit from including this external identifier? The Czech R&D community since the inclusion / mapping of IČO with ROR would alleviate any type of issues or concerns the local community would have around using ROR in systems as an organization identifier instead of IČO.

  15. Provide any additional information or context that supports this request: There is currently a fairly strong will to adopt ROR in the Czech Republic at the Office of the Czech Government that manages the Czech national CRIS system (which is now using IČO).

adambuttrick commented 2 months ago

Relative to the comment on point 15, it would be useful to identify the subset of IČO records for organizations currently in use by the Czech national CRIS system. Since this data is very broad, includes both persons and organizations, as well as non-research entities, this proactive filtering would help us to be able to reconcile and integrate.

hana-her commented 1 month ago

Hi Adam, please find attached the list of the IČO numbers extracted from the few last years worth of data in the Czech CRIS system, there is about 700 of them.

riv-instituce.xlsx