neomatrix369 / nlp_profiler

A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text data, NLP Profiler will return either high-level insights or low-level/granular statistical information about the text in that column.
Other
243 stars 37 forks source link

Add granular feature: syllables count applied to text #61

Closed neomatrix369 closed 2 years ago

neomatrix369 commented 3 years ago

To be able to merge a pull request, there are a few checks:

Checklist

Please check the options that you have completed and strike-out the options that do not apply via this pull request:

Goal or purpose of the PR

Added new feature(s) to the granular features groups:

Changes implemented in the PR

codecov[bot] commented 3 years ago

Codecov Report

Merging #61 (7a01d6f) into master (a1c112d) will not change coverage. The diff coverage is 100.00%.

:exclamation: Current head 7a01d6f differs from pull request most recent head 3203597. Consider uploading reports for the commit 3203597 to get more accurate results Impacted file tree graph

@@            Coverage Diff            @@
##            master       #61   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files           25        26    +1     
  Lines          490       498    +8     
  Branches        73        74    +1     
=========================================
+ Hits           490       498    +8     
Impacted Files Coverage Δ
nlp_profiler/constants.py 100.00% <100.00%> (ø)
nlp_profiler/granular_features/__init__.py 100.00% <100.00%> (ø)
nlp_profiler/granular_features/syllables.py 100.00% <100.00%> (ø)

Continue to review full report at Codecov.

Legend - Click here to learn more Δ = absolute <relative> (impact), ø = not affected, ? = missing data Powered by Codecov. Last update db094b9...3203597. Read the comment docs.

sourcery-ai[bot] commented 3 years ago

Sourcery Code Quality Report

❌  Merging this PR will decrease code quality in the affected files by 0.31%.

Quality metrics Before After Change
Complexity 0.00 ⭐ 0.00 ⭐ 0.00
Method Length 108.33 🙂 111.67 🙂 3.34 👎
Working memory 5.88 ⭐ 5.94 ⭐ 0.06 👎
Quality 78.75% 78.44% -0.31% 👎
Other metrics Before After Change
Lines 131 135 4
Changed files Quality Before Quality After Quality Change
nlp_profiler/constants.py 80.50% ⭐ 80.17% ⭐ -0.33% 👎
nlp_profiler/granular_features/init.py 51.29% 🙂 50.77% 🙂 -0.52% 👎

Here are some functions in these files that still need a tune-up:

File Function Complexity Length Working Memory Quality Recommendation
nlp_profiler/granular_features/init.py apply_granular_features 0 ⭐ 114 🙂 53 ⛔ 50.77% 🙂 Extract out complex expressions

Legend and Explanation

The emojis denote the absolute quality of the code:

The 👍 and 👎 indicate whether the quality has improved or gotten worse with this pull request.


Please see our documentation here for details on how these metrics are calculated.

We are actively working on this report - lots more documentation and extra metrics to come!

Let us know what you think of it by mentioning @sourcery-ai in a comment.