ga-group / bsym

Bloomberg open symbology datasets
36 stars 12 forks source link

Unicode issue when import data into neo4j with neosemantics plugin #2

Open JasonPad19 opened 2 years ago

JasonPad19 commented 2 years ago

Hi

I am experiencing some issues to import the data. Do you have any suggestions please?

Issues have been fixed after I manually editing the raw data. (fimp.nq) image

1 - But I have encountered more issues in this file. (lerd.nq), Are we planning to put some fix in the source?

image

2 - Also, I have an issue when importing fird.nq file with errors as below. Any ideas?

image

3 - One more question, I am using Legal Entity data from https://data.world/gleif/lei-data.

Is it compatible with the lerd files you define please?

Thanks, Jason

hroptatyr commented 2 years ago

Hi, thanks for the report.

I added #1 fixes to the dumping process.

2 are column headers that accidentally made it into the dump. You can read from line 2 onwards.

3 It looks like it, I haven't checked any of the files provided but I'm using the gleif ontology too, just not their new data namespaces though.

JasonPad19 commented 2 years ago

Thanks for your reply,@hroptatyr

Re #1, after your fix, do you know when I can download the latest file please?

Re #2, I will try to remove the line 1 and import again.

Re #3, Since I have loaded Global LEI into neo4j, and I can have a check if I manage to load figi too.

hroptatyr commented 2 years ago

The next dump will be started 10:00 UTC on Sunday, 2021-10-03. Seeing as this process takes some 48 hours you should be able to download the new dump on Tuesday.

JasonPad19 commented 2 years ago

Awesome! I have downloaded the latest file and it has been imported successfully.

Now I am testing the search with Apple INC. Looking at the result(as attached below), it finds out ISIN for Apple, but there isn't a figi code.

Question to the data, do we actually have a figi code for Apple INC in the dataset please?

image

hroptatyr commented 2 years ago

I don't know what I'm looking at. The figi for Apple Inc (listed on XNGS) is BBG000B9Y5X2. The figi for the US composite is BBG000B9XRY4.