TutteInstitute / vectorizers

Vectorizers for a range of different data types
BSD 3-Clause "New" or "Revised" License
93 stars 23 forks source link

Document_context EM #65

Closed cjweir closed 3 years ago

cjweir commented 3 years ago

This includes document context EM, document context token masking, and the ability to normalize windows via L_1 or Bayesian normalization.

codecov-commenter commented 3 years ago

Codecov Report

Merging #65 (6400af9) into master (1d40f9d) will decrease coverage by 0.48%. The diff coverage is 32.55%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master      #65      +/-   ##
==========================================
- Coverage   64.03%   63.55%   -0.49%     
==========================================
  Files          19       19              
  Lines        2981     3054      +73     
==========================================
+ Hits         1909     1941      +32     
- Misses       1072     1113      +41     
Impacted Files Coverage Δ
vectorizers/coo_utils.py 7.77% <0.00%> (ø)
vectorizers/token_cooccurrence_vectorizer.py 51.84% <7.01%> (-4.26%) :arrow_down:
vectorizers/utils.py 49.09% <50.00%> (+3.17%) :arrow_up:
vectorizers/tests/test_common.py 99.79% <100.00%> (+<0.01%) :arrow_up:

Continue to review full report at Codecov.

Legend - Click here to learn more Δ = absolute <relative> (impact), ø = not affected, ? = missing data Powered by Codecov. Last update 1d40f9d...6400af9. Read the comment docs.