ddavisqa / google-refine

Automatically exported from code.google.com/p/google-refine
0 stars 0 forks source link

Wrong record index when using XML Importer with certain XML structures #393

Closed GoogleCodeExporter closed 8 years ago

GoogleCodeExporter commented 8 years ago
Highlighted in Yellow Box in Screenshot, I think aubrey-da-venne and tamar 
should actually be record number 4 and 5 respectively ?

I specified the <topic id> 's as the record source in the XML selector preview. 
 Refine imported OK.

The other screenshot for the Topic ID shows the first few records in Notepad++ 
program and what I was expecting in their order in Refine.

(btw, opera character's are not mapped to the source opera or play so don't 
worry about that, David.  This XML is just a flat list of topics and no linking 
between the topics, unfortunately.)

Original issue reported on code.google.com by thadguidry on 26 May 2011 at 3:07

Attachments:

GoogleCodeExporter commented 8 years ago
Does this problem still exist when using Refine built from the SVN trunk?  If 
so, can you provide a copy of the data file?

Original comment by tfmorris on 8 Oct 2011 at 7:43

GoogleCodeExporter commented 8 years ago
I'm experiencing the same issue here (with SVN r2362). I have sent you the XML 
file through e-mail. Only 8 out of 10 records are correctly identified, the 
other 2 do appear but have no record ID assigned.

Original comment by raoulwis...@gmail.com on 6 Nov 2011 at 4:26

GoogleCodeExporter commented 8 years ago
My original issue with the index is resolved now in r2363 for my Italian Opera 
data test file (attached).  Aubrey Da Venne is now index 4 and Tamar is 5, 
correctly so.

Original comment by thadguidry on 6 Nov 2011 at 5:28

Attachments:

GoogleCodeExporter commented 8 years ago
Thad - Thanks for the file.  At a glance, it appears that there may still be a 
problem since I got 1341 records, but a text facet on the topic ID says there 
are 1937 choices.  As an example, record 819 has four topic Ids: Type, 
son-pochi-fiori, roberto, and dorotea.

Raoul - I received your file and it looks like your file is also affected by 
the bug in issue 137.

I'm going to close this as a duplicate.  Please follow issue 137 for updates.

Original comment by tfmorris on 6 Nov 2011 at 9:26