TutteInstitute / vectorizers

Vectorizers for a range of different data types
BSD 3-Clause "New" or "Revised" License
93 stars 23 forks source link

Fixed SkipGramVectorizer to filter tokens properly. #21

Closed cjweir closed 4 years ago

cjweir commented 4 years ago

I don't think this was previously changed properly when the token-preprocessing switched things to sequences of ints. It caused an issue when using it to vectorize documents.

codecov-io commented 4 years ago

Codecov Report

Merging #21 into master will decrease coverage by 0.02%. The diff coverage is n/a.

Impacted file tree graph

@@            Coverage Diff             @@
##           master      #21      +/-   ##
==========================================
- Coverage   64.46%   64.44%   -0.03%     
==========================================
  Files           9        9              
  Lines        1694     1693       -1     
==========================================
- Hits         1092     1091       -1     
  Misses        602      602              
Impacted Files Coverage Δ
vectorizers/_vectorizers.py 88.59% <ø> (-0.03%) :arrow_down:

Continue to review full report at Codecov.

Legend - Click here to learn more Δ = absolute <relative> (impact), ø = not affected, ? = missing data Powered by Codecov. Last update 54da3d6...420a0ed. Read the comment docs.