Not clear why this happens. In principle all records already in cache are identified upfront, and only records not in cache are looked for... so at this stage, there should be no overlap. By checking some of the entries, I could not find the one actually already in cache.
`File "", line 33, in
matches = scientist1.find_matches(stacked=True, verbose=True, information=False)
File "C:\anaconda3\lib\site-packages\sosia\sosia.py", line 401, in find_matches
cache_authors(res)
File "C:\anaconda3\lib\site-packages\sosia\utils\cache.py", line 148, in cache_authors
conn.executemany(query, df.to_records(index=False))
Not clear why this happens. In principle all records already in cache are identified upfront, and only records not in cache are looked for... so at this stage, there should be no overlap. By checking some of the entries, I could not find the one actually already in cache.
`File "", line 33, in
matches = scientist1.find_matches(stacked=True, verbose=True, information=False)
File "C:\anaconda3\lib\site-packages\sosia\sosia.py", line 401, in find_matches cache_authors(res)
File "C:\anaconda3\lib\site-packages\sosia\utils\cache.py", line 148, in cache_authors conn.executemany(query, df.to_records(index=False))
IntegrityError: UNIQUE constraint failed: authors.auth_id`