Super-confident predictions

Wikidata / soweego

Link Wikidata items to large catalogs

GNU General Public License v3.0

95 stars 8 forks source link

It seems that ensembles are commonly used in RL pipelines to predict if 2 entities match or not. Ensembles are usually composed of diverse models (eg, SVM + Decision Trees). A couple of examples:

In [1] a final ensemble is used to self learn on a partially labelled dataset. In [2] they use multiple classifiers to predict a match, the final classifier is selected depending on its score and how interpretable the model is.

A novel ensemble learning approach to unsupervised record linkage (2017)
Magellan: Toward Building Entity Matching Management Systems (2016)

Wikidata / soweego

Super-confident predictions #305