CouncilDataProject / cdp-data

Data Utilities and Processing Generalized for All CDP Instances
https://councildataproject.org/cdp-data
MIT License
5 stars 4 forks source link

feature/semantic-sim-history #14

Closed evamaxfield closed 2 years ago

evamaxfield commented 2 years ago

Resolves #11

Adds two functions:

  1. Compute the max, min, and mean semantic similarity of all sentences in each event given a datetime range
  2. Plot this data over time with the roughly the same specification as the previous ngrams plotting.

The computation function requires extra dependencies. These can be installed with pip install cdp-data[transformers]

codecov[bot] commented 2 years ago

Codecov Report

Merging #14 (617f12b) into main (9de461c) will increase coverage by 7.23%. The diff coverage is 94.11%.

@@            Coverage Diff             @@
##             main      #14      +/-   ##
==========================================
+ Coverage   63.10%   70.34%   +7.23%     
==========================================
  Files          15       18       +3     
  Lines         515      671     +156     
==========================================
+ Hits          325      472     +147     
- Misses        190      199       +9     
Impacted Files Coverage Δ
cdp_data/datasets.py 89.43% <ø> (+0.70%) :arrow_up:
cdp_data/utils/db_utils.py 100.00% <ø> (ø)
cdp_data/keywords.py 83.24% <91.66%> (+2.81%) :arrow_up:
cdp_data/utils/incremental_average.py 93.44% <93.44%> (ø)
cdp_data/tests/test_keywords.py 100.00% <100.00%> (ø)
cdp_data/tests/utils/__init__.py 100.00% <100.00%> (ø)
cdp_data/tests/utils/test_incremental_average.py 100.00% <100.00%> (ø)

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.