ajenhl / tacl

Tool for performing basic text analysis on the CBETA corpus
GNU General Public License v3.0
30 stars 9 forks source link

Improve performance when re-adding texts #26

Closed ajenhl closed 9 years ago

ajenhl commented 10 years ago

When adding n-grams to a database, a text that already has its n-grams in the database still has its n-grams generated. Move the check to before the n-gram generation to improve performance.