Ousret/charset_normalizer (charset-normalizer)
### [`v3.4.0`](https://redirect.github.com/Ousret/charset_normalizer/blob/HEAD/CHANGELOG.md#340-2024-10-08)
[Compare Source](https://redirect.github.com/Ousret/charset_normalizer/compare/3.3.2...3.4.0)
##### Added
- Argument `--no-preemptive` in the CLI to prevent the detector to search for hints.
- Support for Python 3.13 ([#512](https://redirect.github.com/Ousret/charset_normalizer/issues/512))
##### Fixed
- Relax the TypeError exception thrown when trying to compare a CharsetMatch with anything else than a CharsetMatch.
- Improved the general reliability of the detector based on user feedbacks. ([#520](https://redirect.github.com/Ousret/charset_normalizer/issues/520)) ([#509](https://redirect.github.com/Ousret/charset_normalizer/issues/509)) ([#498](https://redirect.github.com/Ousret/charset_normalizer/issues/498)) ([#407](https://redirect.github.com/Ousret/charset_normalizer/issues/407)) ([#537](https://redirect.github.com/Ousret/charset_normalizer/issues/537))
- Declared charset in content (preemptive detection) not changed when converting to utf-8 bytes. ([#381](https://redirect.github.com/Ousret/charset_normalizer/issues/381))
### [`v3.3.2`](https://redirect.github.com/Ousret/charset_normalizer/blob/HEAD/CHANGELOG.md#332-2023-10-31)
[Compare Source](https://redirect.github.com/Ousret/charset_normalizer/compare/3.3.1...3.3.2)
##### Fixed
- Unintentional memory usage regression when using large payload that match several encoding ([#376](https://redirect.github.com/Ousret/charset_normalizer/issues/376))
- Regression on some detection case showcased in the documentation ([#371](https://redirect.github.com/Ousret/charset_normalizer/issues/371))
##### Added
- Noise (md) probe that identify malformed arabic representation due to the presence of letters in isolated form (credit to my wife)
### [`v3.3.1`](https://redirect.github.com/Ousret/charset_normalizer/blob/HEAD/CHANGELOG.md#331-2023-10-22)
[Compare Source](https://redirect.github.com/Ousret/charset_normalizer/compare/3.3.0...3.3.1)
##### Changed
- Optional mypyc compilation upgraded to version 1.6.1 for Python >= 3.8
- Improved the general detection reliability based on reports from the community
### [`v3.3.0`](https://redirect.github.com/Ousret/charset_normalizer/blob/HEAD/CHANGELOG.md#330-2023-09-30)
[Compare Source](https://redirect.github.com/Ousret/charset_normalizer/compare/3.2.0...3.3.0)
##### Added
- Allow to execute the CLI (e.g. normalizer) through `python -m charset_normalizer.cli` or `python -m charset_normalizer`
- Support for 9 forgotten encoding that are supported by Python but unlisted in `encoding.aliases` as they have no alias ([#323](https://redirect.github.com/Ousret/charset_normalizer/issues/323))
##### Removed
- (internal) Redundant utils.is_ascii function and unused function is_private_use_only
- (internal) charset_normalizer.assets is moved inside charset_normalizer.constant
##### Changed
- (internal) Unicode code blocks in constants are updated using the latest v15.0.0 definition to improve detection
- Optional mypyc compilation upgraded to version 1.5.1 for Python >= 3.7
##### Fixed
- Unable to properly sort CharsetMatch when both chaos/noise and coherence were close due to an unreachable condition in \__lt\_\_ ([#350](https://redirect.github.com/Ousret/charset_normalizer/issues/350))
### [`v3.2.0`](https://redirect.github.com/Ousret/charset_normalizer/blob/HEAD/CHANGELOG.md#320-2023-06-07)
[Compare Source](https://redirect.github.com/Ousret/charset_normalizer/compare/3.1.0...3.2.0)
##### Changed
- Typehint for function `from_path` no longer enforce `PathLike` as its first argument
- Minor improvement over the global detection reliability
##### Added
- Introduce function `is_binary` that relies on main capabilities, and optimized to detect binaries
- Propagate `enable_fallback` argument throughout `from_bytes`, `from_path`, and `from_fp` that allow a deeper control over the detection (default True)
- Explicit support for Python 3.12
##### Fixed
- Edge case detection failure where a file would contain 'very-long' camel cased word (Issue [#289](https://redirect.github.com/Ousret/charset_normalizer/issues/289))
Configuration
📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).
🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.
♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about this update again.
[ ] If you want to rebase/retry this PR, check this box
This PR contains the following updates:
==3.1.0
->==3.4.0
Release Notes
Ousret/charset_normalizer (charset-normalizer)
### [`v3.4.0`](https://redirect.github.com/Ousret/charset_normalizer/blob/HEAD/CHANGELOG.md#340-2024-10-08) [Compare Source](https://redirect.github.com/Ousret/charset_normalizer/compare/3.3.2...3.4.0) ##### Added - Argument `--no-preemptive` in the CLI to prevent the detector to search for hints. - Support for Python 3.13 ([#512](https://redirect.github.com/Ousret/charset_normalizer/issues/512)) ##### Fixed - Relax the TypeError exception thrown when trying to compare a CharsetMatch with anything else than a CharsetMatch. - Improved the general reliability of the detector based on user feedbacks. ([#520](https://redirect.github.com/Ousret/charset_normalizer/issues/520)) ([#509](https://redirect.github.com/Ousret/charset_normalizer/issues/509)) ([#498](https://redirect.github.com/Ousret/charset_normalizer/issues/498)) ([#407](https://redirect.github.com/Ousret/charset_normalizer/issues/407)) ([#537](https://redirect.github.com/Ousret/charset_normalizer/issues/537)) - Declared charset in content (preemptive detection) not changed when converting to utf-8 bytes. ([#381](https://redirect.github.com/Ousret/charset_normalizer/issues/381)) ### [`v3.3.2`](https://redirect.github.com/Ousret/charset_normalizer/blob/HEAD/CHANGELOG.md#332-2023-10-31) [Compare Source](https://redirect.github.com/Ousret/charset_normalizer/compare/3.3.1...3.3.2) ##### Fixed - Unintentional memory usage regression when using large payload that match several encoding ([#376](https://redirect.github.com/Ousret/charset_normalizer/issues/376)) - Regression on some detection case showcased in the documentation ([#371](https://redirect.github.com/Ousret/charset_normalizer/issues/371)) ##### Added - Noise (md) probe that identify malformed arabic representation due to the presence of letters in isolated form (credit to my wife) ### [`v3.3.1`](https://redirect.github.com/Ousret/charset_normalizer/blob/HEAD/CHANGELOG.md#331-2023-10-22) [Compare Source](https://redirect.github.com/Ousret/charset_normalizer/compare/3.3.0...3.3.1) ##### Changed - Optional mypyc compilation upgraded to version 1.6.1 for Python >= 3.8 - Improved the general detection reliability based on reports from the community ### [`v3.3.0`](https://redirect.github.com/Ousret/charset_normalizer/blob/HEAD/CHANGELOG.md#330-2023-09-30) [Compare Source](https://redirect.github.com/Ousret/charset_normalizer/compare/3.2.0...3.3.0) ##### Added - Allow to execute the CLI (e.g. normalizer) through `python -m charset_normalizer.cli` or `python -m charset_normalizer` - Support for 9 forgotten encoding that are supported by Python but unlisted in `encoding.aliases` as they have no alias ([#323](https://redirect.github.com/Ousret/charset_normalizer/issues/323)) ##### Removed - (internal) Redundant utils.is_ascii function and unused function is_private_use_only - (internal) charset_normalizer.assets is moved inside charset_normalizer.constant ##### Changed - (internal) Unicode code blocks in constants are updated using the latest v15.0.0 definition to improve detection - Optional mypyc compilation upgraded to version 1.5.1 for Python >= 3.7 ##### Fixed - Unable to properly sort CharsetMatch when both chaos/noise and coherence were close due to an unreachable condition in \__lt\_\_ ([#350](https://redirect.github.com/Ousret/charset_normalizer/issues/350)) ### [`v3.2.0`](https://redirect.github.com/Ousret/charset_normalizer/blob/HEAD/CHANGELOG.md#320-2023-06-07) [Compare Source](https://redirect.github.com/Ousret/charset_normalizer/compare/3.1.0...3.2.0) ##### Changed - Typehint for function `from_path` no longer enforce `PathLike` as its first argument - Minor improvement over the global detection reliability ##### Added - Introduce function `is_binary` that relies on main capabilities, and optimized to detect binaries - Propagate `enable_fallback` argument throughout `from_bytes`, `from_path`, and `from_fp` that allow a deeper control over the detection (default True) - Explicit support for Python 3.12 ##### Fixed - Edge case detection failure where a file would contain 'very-long' camel cased word (Issue [#289](https://redirect.github.com/Ousret/charset_normalizer/issues/289))Configuration
📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).
🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.
♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about this update again.
This PR was generated by Mend Renovate. View the repository job log.