EOL / tramea

A lightweight server for denormalized EOL data
Other
2 stars 1 forks source link

pages/3346542 - Explain Bad Merges (Obis and BioLib) #260

Open JRice opened 8 years ago

JRice commented 8 years ago

I've written a bunch of code to help with this stuff. You can now do something like this:

concept = Concept.find 3346542
concept.explain_entries

and get this:

Initial BioLib.cz Import (394):
  Ctenoides (Ctenoides) => 30270963
  Ctenoides Mörch 1853 (Ctenoides) => 30270964
  Strophomorphus ctenoides Pelletier, 1999 (Strophomorphus ctenoides) => 30493749
  Ctenoides concentrica G. B. Sowerby III, 1888 (Ctenoides concentrica) => 30694479
  Ctenoides japonica R. W. Dunker (Ctenoides japonica) => 30694480
  Ctenoides philippinarum Masahito & Habe, 1972 (Ctenoides philippinarum) => 30694482
  Ctenoides suavis P. Masahito, T. Kuroda & T. Habe 1971 (Ctenoides suavis) => 30694484
  Ctenoides scabra glacialis J. F. Gmelin, 1791 (Ctenoides scabra glacialis) => 30854093
AquaMaps Resource (467):
  Ctenoides (Ctenoides) => 32584692
OBIS depth range resource (556):
  Ctenoides (Ctenoides) => 35370596
  Ctenoides barnardi (Ctenoides barnardi) => 35370598
  Ctenoides divaricata (Ctenoides divaricata) => 35370599
  Ctenoides miamiensis (Ctenoides miamiensis) => 35370600
  Ctenoides nimbifer (Ctenoides nimbifer) => 35370601
  Ctenoides sanctipauli (Ctenoides sanctipauli) => 35370604
  Ctenoides suavis (Ctenoides suavis) => 35370605
DiscoverLife resource (647):
  Ctenoides concentricus (Ctenoides concentricus) => 37555364
  Ctenoides miamiensis (Ctenoides miamiensis) => 37555365
  Ctenoides sanctipauli (Ctenoides sanctipauli) => 37555369
  Ctenoides concentrica (Ctenoides concentrica) => 53477373
NMNH image collection resource (464):
  Ctenoides (Ctenoides) => 45965456
  Ctenoides Miamiensis Mikkelsen & Bieler, 2003 (Ctenoides) => 45993162
BOLD Systems Resource (428):
  Ctenoides (Ctenoides) => 48768213
Moorea Biocode (821):
  Ctenoides (Ctenoides) => 50176946
Taxonomic Hierarchy of COL-China 2012 (1139):
  Ctenoides (Ctenoides) => 50300721
NCBI Taxonomy (1172):
  Ctenoides (Ctenoides) => 51308914
Species 2000 & ITIS Catalogue of Life: April 2013 (1188):
  Ctenoides (Ctenoides) => 52565441
iNaturalist Images (987):
  Ctenoides (Ctenoides) => 54479872
OBIS Environmental Information (1307):
  Ctenoides miamiensis Mikkelsen & Bieler (Ctenoides miamiensis) => 55115931
  Ctenoides nimbifer Iredale, 1924 (Ctenoides nimbifer) => 55148084
  Ctenoides Mikkelsen & Bieler (Ctenoides) => 55192166
  Ctenoides barnardi Kilburn (Ctenoides barnardi) => 55197780
  Ctenoides suavis Masahito, 1971 (Ctenoides suavis) => 55209139
  Ctenoides divaricata (Barnard) (Ctenoides divaricata) => 55227116
  Ctenoides sanctipauli Stuardo (Ctenoides sanctipauli) => 55233421
McClain Bivalve Sizes (1314):
  Ctenoides (Ctenoides) => 55276324
  Ctenoides miamiensis (Ctenoides miamiensis) => 55276325
Wikipedia (431):
  Ctenoides (Ctenoides) => 55966813
Inventaire National du Patrimoine Naturel (1388):
  Ctenoides sanctipauli Stuardo, 1968 (Ctenoides sanctipauli) => 57690427
Femorale resource (1395):
  Ctenoides concentricus (Sowerby, 1888) (Ctenoides concentricus) => 59025594
  Ctenoides philippinarum (Masahito & Habe, 1972) (Ctenoides philippinarum) => 59025596
  Ctenoides suavis Masahito, Kuroda & Habe, 1971 (Ctenoides suavis) => 59025598
  Ctenoides (Ctenoides) => 59048885
Paleobiology Database (967):
  Ctenoides Mörch 1853 (Ctenoides) => 59571671
Global Biotic Interactions (1321):
  Ctenoides (Ctenoides) => 60338375
  Ctenoides (Ctenoides) => 61305361
Smithsonian type specimen data (1484):
  Ctenoides (Ctenoides) => 61681667
  Ctenoides miamiensis Mikkelsen & Bieler (Ctenoides miamiensis) => 61681670
EOL Group on Flickr (114):
  Ctenoides (Ctenoides) => 61862030

...which is just for convenience. You can then call obis_entries = concept.entries_from(1307) and run something like

explanation = [] ; obis_entries.each { |entry| explanation << concept.explain_relationships(entry) } ; puts explanation.join("\n") ; 1

to get a long report of how the merges might go. ...Snippet:

It looks like the entry from `OBIS Environmental Information` named `Ctenoides nimbifer Iredale, 1924` (canonical: Ctenoides nimbifer) would NOT match the entry from `OBIS Environmental Information` named `Ctenoides suavis Masahito, 1971` (canonical: Ctenoides suavis) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides nimbifer Iredale, 1924` (canonical: Ctenoides nimbifer) would NOT match the entry from `OBIS Environmental Information` named `Ctenoides divaricata (Barnard)` (canonical: Ctenoides divaricata) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides nimbifer Iredale, 1924` (canonical: Ctenoides nimbifer) would NOT match the entry from `OBIS Environmental Information` named `Ctenoides sanctipauli Stuardo` (canonical: Ctenoides sanctipauli) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides nimbifer Iredale, 1924` (canonical: Ctenoides nimbifer) would NOT match the entry from `McClain Bivalve Sizes` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides nimbifer Iredale, 1924` (canonical: Ctenoides nimbifer) would NOT match the entry from `McClain Bivalve Sizes` named `Ctenoides miamiensis` (canonical: Ctenoides miamiensis) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides nimbifer Iredale, 1924` (canonical: Ctenoides nimbifer) would NOT match the entry from `Wikipedia` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides nimbifer Iredale, 1924` (canonical: Ctenoides nimbifer) would NOT match the entry from `Inventaire National du Patrimoine Naturel` named `Ctenoides sanctipauli Stuardo, 1968` (canonical: Ctenoides sanctipauli) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides nimbifer Iredale, 1924` (canonical: Ctenoides nimbifer) would NOT match the entry from `Femorale resource` named `Ctenoides concentricus (Sowerby, 1888)` (canonical: Ctenoides concentricus) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.

...which you can then filter as needed, e.g.:

puts explanation.select { |e| e =~ /would match/ }.join "\n" ; 1

In this case, it's blank, which suggests that none of these entries would have matched this page. ...Which is ... interesting...

JRice commented 8 years ago

So the result that surprises me is that Ctenoides Mikkelsen & Bieler (Ctenoides) => 55192166 from OBIS doesn't match here.

obis_entry = concept.entry(55192166)
exp = concept.explain_relationships(obis_entry) ; 1
puts exp.join("\n") ; 1

yields:

It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Initial BioLib.cz Import` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Initial BioLib.cz Import` named `Ctenoides Mörch 1853` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Initial BioLib.cz Import` named `Strophomorphus ctenoides Pelletier, 1999` (canonical: Strophomorphus ctenoides) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Initial BioLib.cz Import` named `Ctenoides concentrica G. B. Sowerby III, 1888` (canonical: Ctenoides concentrica) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Initial BioLib.cz Import` named `Ctenoides japonica R. W. Dunker` (canonical: Ctenoides japonica) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Initial BioLib.cz Import` named `Ctenoides philippinarum Masahito & Habe, 1972` (canonical: Ctenoides philippinarum) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Initial BioLib.cz Import` named `Ctenoides suavis P. Masahito, T. Kuroda & T. Habe 1971` (canonical: Ctenoides suavis) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Initial BioLib.cz Import` named `Ctenoides scabra glacialis J. F. Gmelin, 1791` (canonical: Ctenoides scabra glacialis) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `AquaMaps Resource` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `OBIS depth range resource` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `OBIS depth range resource` named `Ctenoides barnardi` (canonical: Ctenoides barnardi) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `OBIS depth range resource` named `Ctenoides divaricata` (canonical: Ctenoides divaricata) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `OBIS depth range resource` named `Ctenoides miamiensis` (canonical: Ctenoides miamiensis) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `OBIS depth range resource` named `Ctenoides nimbifer` (canonical: Ctenoides nimbifer) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `OBIS depth range resource` named `Ctenoides sanctipauli` (canonical: Ctenoides sanctipauli) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `OBIS depth range resource` named `Ctenoides suavis` (canonical: Ctenoides suavis) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `DiscoverLife resource` named `Ctenoides concentricus` (canonical: Ctenoides concentricus) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `DiscoverLife resource` named `Ctenoides miamiensis` (canonical: Ctenoides miamiensis) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `DiscoverLife resource` named `Ctenoides sanctipauli` (canonical: Ctenoides sanctipauli) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `NMNH image collection resource` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `NMNH image collection resource` named `Ctenoides Miamiensis Mikkelsen & Bieler, 2003` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `BOLD Systems Resource` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. The ancestry was empty. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Moorea Biocode` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. The ancestry was empty. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Taxonomic Hierarchy of COL-China 2012` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `NCBI Taxonomy` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Species 2000 & ITIS Catalogue of Life: April 2013` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `DiscoverLife resource` named `Ctenoides concentrica` (canonical: Ctenoides concentrica) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `iNaturalist Images` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `OBIS Environmental Information` named `Ctenoides miamiensis Mikkelsen & Bieler` (canonical: Ctenoides miamiensis) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `OBIS Environmental Information` named `Ctenoides nimbifer Iredale, 1924` (canonical: Ctenoides nimbifer) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `OBIS Environmental Information` named `Ctenoides barnardi Kilburn` (canonical: Ctenoides barnardi) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `OBIS Environmental Information` named `Ctenoides suavis Masahito, 1971` (canonical: Ctenoides suavis) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `OBIS Environmental Information` named `Ctenoides divaricata (Barnard)` (canonical: Ctenoides divaricata) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `OBIS Environmental Information` named `Ctenoides sanctipauli Stuardo` (canonical: Ctenoides sanctipauli) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `McClain Bivalve Sizes` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `McClain Bivalve Sizes` named `Ctenoides miamiensis` (canonical: Ctenoides miamiensis) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Wikipedia` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Inventaire National du Patrimoine Naturel` named `Ctenoides sanctipauli Stuardo, 1968` (canonical: Ctenoides sanctipauli) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Femorale resource` named `Ctenoides concentricus (Sowerby, 1888)` (canonical: Ctenoides concentricus) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Femorale resource` named `Ctenoides philippinarum (Masahito & Habe, 1972)` (canonical: Ctenoides philippinarum) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Femorale resource` named `Ctenoides suavis Masahito, Kuroda & Habe, 1971` (canonical: Ctenoides suavis) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Femorale resource` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Paleobiology Database` named `Ctenoides Mörch 1853` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Global Biotic Interactions` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Global Biotic Interactions` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Smithsonian type specimen data` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. Ancestry was scored with a multiplier of 0.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `Smithsonian type specimen data` named `Ctenoides miamiensis Mikkelsen & Bieler` (canonical: Ctenoides miamiensis) with a confidence of 0. The names did not match. The synonyms were checked, but did not match.
It looks like the entry from `OBIS Environmental Information` named `Ctenoides Mikkelsen & Bieler` (canonical: Ctenoides) would NOT match the entry from `EOL Group on Flickr` named `Ctenoides` (canonical: Ctenoides) with a confidence of 0. The canonical names matched. The kingdoms did NOT match. Non-kingdom ancestors did NOT match. The ancestry was empty. Ancestry was scored with a multiplier of 0.
JRice commented 8 years ago

Ahhh... that explains it: the kingdoms don't match. The kingdom on Obis is nil...that's why (in fact, the entry has NO ancestors). The new code doesn't like these "floating" entries and won't match them. ...I'm not sure that's desirable... we should talk about it.

jhammock commented 8 years ago

Sorry I missed you! I have read this. It is friggin awesome :) The question of merging sad orphan names without ancestors but with perfect string matches is a good one. @KatjaSchulz should weigh in. Factors to consider:

Can we orphan in the dubious cases that are merged already but don't qualify any more? How many are there? How easy would it be for us (and/or partners ) to fill in ancestry? <- real solution