scrollmapper / bible_databases

Bible versions and cross-reference databases.
1.01k stars 348 forks source link

MySQL cross reference script contains duplicates #11

Closed gonzigonz closed 7 years ago

gonzigonz commented 7 years ago

The cross_reference-mysql.sql contains duplicates of each record. I discovered the issue when I loaded the script in MySQL and found twice the number of records then in the sqlite version. I ran the following distinct query to confirm if each record had been duplicated:
select count(*) from (select distinct vid, sv from cross_reference) a;

I initially raised the concern in my pull request #10 and another user PaulBrownMagic verified the finding against the original source file by using the following bash command:
sort cross_references.txt | uniq | wc -l

I would have happily supplied a fix by now but I'm not familiar with MySQL. I'm happy to fix if anybody has some tips.

PaulBrownMagic commented 7 years ago

I've created a fix in a fork and a matching pull-request #12.