The cross_reference-mysql.sql contains duplicates of each record. I discovered the issue when I loaded the script in MySQL and found twice the number of records then in the sqlite version. I ran the following distinct query to confirm if each record had been duplicated: select count(*) from (select distinct vid, sv from cross_reference) a;
I initially raised the concern in my pull request #10 and another user PaulBrownMagic verified the finding against the original source file by using the following bash command: sort cross_references.txt | uniq | wc -l
I would have happily supplied a fix by now but I'm not familiar with MySQL. I'm happy to fix if anybody has some tips.
The
cross_reference-mysql.sql
contains duplicates of each record. I discovered the issue when I loaded the script in MySQL and found twice the number of records then in the sqlite version. I ran the following distinct query to confirm if each record had been duplicated:select count(*) from (select distinct vid, sv from cross_reference) a;
I initially raised the concern in my pull request #10 and another user PaulBrownMagic verified the finding against the original source file by using the following bash command:
sort cross_references.txt | uniq | wc -l
I would have happily supplied a fix by now but I'm not familiar with MySQL. I'm happy to fix if anybody has some tips.