zinggAI / zingg

Scalable identity resolution, entity resolution, data mastering and deduplication using ML
GNU Affero General Public License v3.0
957 stars 120 forks source link

Performance testing with stop words #273

Open sonalgoyal opened 2 years ago

sonalgoyal commented 2 years ago

We need to assess if performance has been impacted by adding stop words.

sonalgoyal commented 2 years ago

let us run the febrl120k example first without stop words, then add stopwords to some of the fields and run again. please generate documentation using https://docs.zingg.ai/zingg/generatingdocumentation which will give you the stop words.