SDM-TIB / SDM-RDFizer

An Efficient RML-Compliant Engine for Knowledge Graph Construction
https://doi.org/10.5281/zenodo.3872103
Apache License 2.0
107 stars 25 forks source link

SDM-RDFizer not able to semantifying rml mapping rules files with join conditions for accents characters #82

Closed rohit252 closed 2 years ago

rohit252 commented 2 years ago

snap 1 snap1 snap 2 snap2 snap 3 Capture

SDM-RDFizer not creating triples for the above rml file with the join condition, here author_1.csv file contains first, last and institution name with the accents characters but when I create another csv file author_2.csv without the accent characters but with the same content as author_1.csv it is working then.

I have created a test case:

SDM-RDFizer.zip

  1. After unzipping the folder, Inside the SDM-RDFizer folder there is a folder named "exam" , in the exam folder we have three csv files, rml mapping rule file and config file for the SDM-RDFizer.

  2. author_reference.csv file contains id and author_id, I have used this csv file in the triplemap 1 as we can see in the above snap1.

  3. author_1.csv file contains first, last and institution names with the accents characters and I am using join conditions to join triple map1 and triple map2 ( please see snap 2 )

  4. After running the config file for the SDM-RdFizer, I am getting the error, please see snap3

  5. But using author_2.csv if I do the same steps, then it is able to generate n triples.

Please see pics: snap 4 snap 5 snap 6

eiglesias34 commented 2 years ago

Hello @rohit252,

First of all thank you for using the SDM-RDFizer. Coincidentally, I was working on the reading of special characters. I fixed the problem. I ran the example you sent me I ran with no problem. Here is a portion of the resulting nt file.

<http://example.com/ns/author_9> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/ns/sosa/Author>.
<http://example.com/ns/author_9> <http://www.w3.org/2000/01/rdf-schema#institution> "Institute of Physics, Bijenička".
<http://example.com/ns/author_9> <http://www.w3.org/2000/01/rdf-schema#first_name> "Osor".
<http://example.com/ns/author_9> <http://www.w3.org/2000/01/rdf-schema#last_name> "Barišić".
<http://example.com/ns/author_10> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/ns/sosa/Author>.
<http://example.com/ns/author_10> <http://www.w3.org/2000/01/rdf-schema#institution> "EPFL".
<http://example.com/ns/author_10> <http://www.w3.org/2000/01/rdf-schema#first_name> "László".
<http://example.com/ns/author_10> <http://www.w3.org/2000/01/rdf-schema#last_name> "Forró".
<http://example.com/ns/d_1> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/ns/sosa/FeatureOfInterest>.
<http://example.com/ns/d_1> <https://materials.hybrid3.duke.edu/materials/has_author_id> <http://example.com/ns/author_1>.
<http://example.com/ns/d_2> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/ns/sosa/FeatureOfInterest>.
<http://example.com/ns/d_2> <https://materials.hybrid3.duke.edu/materials/has_author_id> <http://example.com/ns/author_2>.
<http://example.com/ns/d_3> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/ns/sosa/FeatureOfInterest>.
<http://example.com/ns/d_3> <https://materials.hybrid3.duke.edu/materials/has_author_id> <http://example.com/ns/author_9>.
<http://example.com/ns/d_4> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://www.w3.org/ns/sosa/FeatureOfInterest>.
<http://example.com/ns/d_4> <https://materials.hybrid3.duke.edu/materials/has_author_id> <http://example.com/ns/author_2>.

Keep in mind that I only updated the code, not the library. Please test it out and tell me if the problem was solved on your end. Afterwards, I will update the library.

Cheers, Enrique

rohit252 commented 2 years ago

Hi @eiglesias34 ,

Thank you for the reply, I have used the updated code and it is working on my end also :))))))

Best, Rohit

eiglesias34 commented 2 years ago

Hello again,

I'm going to close the issue now.

Thank you for using the SDM-RDFizer. Cheers, Enrique