Closed renovate[bot] closed 1 year ago
Merging #30 (f4b608f) into main (58b032a) will not change coverage. The diff coverage is
n/a
.
@@ Coverage Diff @@
## main #30 +/- ##
=======================================
Coverage 98.71% 98.71%
=======================================
Files 3 3
Lines 312 312
=======================================
Hits 308 308
Misses 4 4
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.
This PR contains the following updates:
==2.0.12
->==3.0.1
Release Notes
Ousret/charset_normalizer
### [`v3.0.1`](https://togithub.com/Ousret/charset_normalizer/blob/HEAD/CHANGELOG.md#301-httpsgithubcomOusretcharsetnormalizercompare300301-2022-11-18) [Compare Source](https://togithub.com/Ousret/charset_normalizer/compare/3.0.0...3.0.1) ##### Fixed - Multi-bytes cutter/chunk generator did not always cut correctly (PR [#233](https://togithub.com/Ousret/charset_normalizer/issues/233)) ##### Changed - Speedup provided by mypy/c 0.990 on Python >= 3.7 ### [`v3.0.0`](https://togithub.com/Ousret/charset_normalizer/blob/HEAD/CHANGELOG.md#300-httpsgithubcomOusretcharsetnormalizercompare211300-2022-10-20) [Compare Source](https://togithub.com/Ousret/charset_normalizer/compare/2.1.1...3.0.0) ##### Added - Extend the capability of explain=True when cp_isolation contains at most two entries (min one), will log in details of the Mess-detector results - Support for alternative language frequency set in charset_normalizer.assets.FREQUENCIES - Add parameter `language_threshold` in `from_bytes`, `from_path` and `from_fp` to adjust the minimum expected coherence ratio - `normalizer --version` now specify if current version provide extra speedup (meaning mypyc compilation whl) ##### Changed - Build with static metadata using 'build' frontend - Make the language detection stricter - Optional: Module `md.py` can be compiled using Mypyc to provide an extra speedup up to 4x faster than v2.1 ##### Fixed - CLI with opt --normalize fail when using full path for files - TooManyAccentuatedPlugin induce false positive on the mess detection when too few alpha character have been fed to it - Sphinx warnings when generating the documentation ##### Removed - Coherence detector no longer return 'Simple English' instead return 'English' - Coherence detector no longer return 'Classical Chinese' instead return 'Chinese' - Breaking: Method `first()` and `best()` from CharsetMatch - UTF-7 will no longer appear as "detected" without a recognized SIG/mark (is unreliable/conflict with ASCII) - Breaking: Class aliases CharsetDetector, CharsetDoctor, CharsetNormalizerMatch and CharsetNormalizerMatches - Breaking: Top-level function `normalize` - Breaking: Properties `chaos_secondary_pass`, `coherence_non_latin` and `w_counter` from CharsetMatch - Support for the backport `unicodedata2` ### [`v2.1.1`](https://togithub.com/Ousret/charset_normalizer/blob/HEAD/CHANGELOG.md#211-httpsgithubcomOusretcharsetnormalizercompare210211-2022-08-19) [Compare Source](https://togithub.com/Ousret/charset_normalizer/compare/2.1.0...2.1.1) ##### Deprecated - Function `normalize` scheduled for removal in 3.0 ##### Changed - Removed useless call to decode in fn is_unprintable ([#206](https://togithub.com/Ousret/charset_normalizer/issues/206)) ##### Fixed - Third-party library (i18n xgettext) crashing not recognizing utf\_8 (PEP 263) with underscore from [@aleksandernovikov](https://togithub.com/aleksandernovikov) ([#204](https://togithub.com/Ousret/charset_normalizer/issues/204)) ### [`v2.1.0`](https://togithub.com/Ousret/charset_normalizer/blob/HEAD/CHANGELOG.md#210-httpsgithubcomOusretcharsetnormalizercompare2012210-2022-06-19) [Compare Source](https://togithub.com/Ousret/charset_normalizer/compare/2.0.12...2.1.0) ##### Added - Output the Unicode table version when running the CLI with `--version` (PR [#194](https://togithub.com/Ousret/charset_normalizer/issues/194)) ##### Changed - Re-use decoded buffer for single byte character sets from [@nijel](https://togithub.com/nijel) (PR [#175](https://togithub.com/Ousret/charset_normalizer/issues/175)) - Fixing some performance bottlenecks from [@deedy5](https://togithub.com/deedy5) (PR [#183](https://togithub.com/Ousret/charset_normalizer/issues/183)) ##### Fixed - Workaround potential bug in cpython with Zero Width No-Break Space located in Arabic Presentation Forms-B, Unicode 1.1 not acknowledged as space (PR [#175](https://togithub.com/Ousret/charset_normalizer/issues/175)) - CLI default threshold aligned with the API threshold from [@oleksandr-kuzmenko](https://togithub.com/oleksandr-kuzmenko) (PR [#181](https://togithub.com/Ousret/charset_normalizer/issues/181)) ##### Removed - Support for Python 3.5 (PR [#192](https://togithub.com/Ousret/charset_normalizer/issues/192)) ##### Deprecated - Use of backport unicodedata from `unicodedata2` as Python is quickly catching up, scheduled for removal in 3.0 (PR [#194](https://togithub.com/Ousret/charset_normalizer/issues/194))Configuration
📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).
🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.
â™» Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about this update again.
This PR has been generated by Mend Renovate. View repository job log here.