sfu-db / dataprep

Open-source low code data preparation library in python. Collect, clean and visualization your data in python with a few lines of code.
http://dataprep.ai
MIT License
1.99k stars 203 forks source link

fix(clean): remove usaddress library #815

Closed qidanrui closed 2 years ago

qidanrui commented 2 years ago

Description

Remove the usaddress library and include their code in our library. Related to the issue #795

How Has This Been Tested?

See the snapshot

Snapshots:

Screen Shot 2022-02-15 at 12 13 50 AM Screen Shot 2022-02-15 at 12 14 11 AM

Checklist:

codecov[bot] commented 2 years ago

Codecov Report

Merging #815 (39580df) into develop (1be5fce) will increase coverage by 0.20%. The diff coverage is 82.85%.

Impacted file tree graph

@@             Coverage Diff             @@
##           develop     #815      +/-   ##
===========================================
+ Coverage    55.11%   55.31%   +0.20%     
===========================================
  Files          293      293              
  Lines        18855    18989     +134     
===========================================
+ Hits         10391    10503     +112     
- Misses        8464     8486      +22     
Impacted Files Coverage Δ
dataprep/clean/clean_lat_long.py 86.22% <ø> (ø)
dataprep/clean/clean_address.py 95.45% <50.00%> (-0.04%) :arrow_down:
dataprep/clean/clean_date_utils.py 76.84% <81.25%> (+0.25%) :arrow_up:
dataprep/clean/address_utils.py 84.25% <83.33%> (-15.75%) :arrow_down:
dataprep/clean/clean_country.py 93.75% <100.00%> (ø)
dataprep/eda/distribution/compute/overview.py 94.05% <0.00%> (+0.54%) :arrow_up:

Continue to review full report at Codecov.

Legend - Click here to learn more Δ = absolute <relative> (impact), ø = not affected, ? = missing data Powered by Codecov. Last update 5b3a9f4...39580df. Read the comment docs.