GlobalNamesArchitecture / gni

Global Names Index
http://wiki.github.com/GlobalNamesArchitecture/gni
22 stars 2 forks source link

Figure out how to do multiple datasource name resolution #26

Open dimus opened 12 years ago

dimus commented 12 years ago

Right now we search multiple data sources for every step. For example if search 5 data sources, and a name had exact string match in only one of them -- it gets out of the search pattern and not checked for canonical of fuzzy search, even if it can be found in the rest 4 data sources. We have to figure out if it is what we want or we want a different approach.

Least surprise behavior.

I would imagine that if I submitted names and serveral data sources, I would expect to get back a sum of search for all of these data sources, as if I would get by resolving a name against each of them separately. However it significantly increases the load on the system. With limited resources it might make sense to decide on more economical approach.

Possible solutions:

  1. Bite the bullet and do search thorougly for each and every data source
  2. Do what we do now
  3. Allow only search of 1 dat source at a time

An independent option for all 3 would be to use whlole GNI data to find names which were not found in selected databases.