More efficient deduplication algorithm

elizagrames / litsearchr

litsearchr is an R package to partially automate search term selection for systematic reviews using keyword co-occurrence networks. In addition to identifying search terms, it can write Boolean searches and translate them into over 50 languages.

https://elizagrames.github.io/litsearchr

101 stars 24 forks source link

More efficient deduplication algorithm #14

Closed elizagrames closed 5 years ago

elizagrames commented 5 years ago

Tokens is really slow; there has got to be an existing matching algorithm that can deduplicate the results much more quickly

elizagrames commented 5 years ago

Deduplication should also be reciprocal if using a token approach, which is probably not a good approach in the first place.

elizagrames commented 5 years ago

This is now covered by the synthesisr package.