The Cleaner() now scans for hidden JavaScript code embedded
within CSS comments. In certain contexts, such as within <svg> or <math> tags,
<style> tags may lose their intended function, allowing comments
like /* foo */ to potentially be executed by the browser.
If a suspicious content is detected, only the comment is removed.
0.3.1 (2024-10-09)
Features added
Do not parse URL addresses when it is not necessary.
0.3.0 (2024-10-09)
Features added
Parsing of URL addresses has been enhanced and Cleaner removes ambiguous URLs.
0.2.2 (2024-08-30)
Bugs fixed
sdist now includes all test files and changelog.
0.2.1 (2024-08-29)
Bugs fixed
Memory efficiency is now much better for HTML pages where cleaner removes
a lot of elements. (#14)
0.2.0 (2024-07-29)
Features added
... (truncated)
Commits
a074425 Remove only the CSS comment if a suspicious content is detected
Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.
Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR:
- `@dependabot rebase` will rebase this PR
- `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it
- `@dependabot merge` will merge this PR after your CI passes on it
- `@dependabot squash and merge` will squash and merge this PR after your CI passes on it
- `@dependabot cancel merge` will cancel a previously requested merge and block automerging
- `@dependabot reopen` will reopen this PR if it is closed
- `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
- `@dependabot show ignore conditions` will show all of the ignore conditions of the specified dependency
- `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
You can disable automated security fix PRs for this repo from the [Security Alerts page](https://github.com/dmm-com/pagoda/network/alerts).
Bumps lxml-html-clean from 0.1.1 to 0.4.0.
Changelog
Sourced from lxml-html-clean's changelog.
... (truncated)
Commits
a074425
Remove only the CSS comment if a suspicious content is detected90bcfa8
Release 0.4.03b644e9
Scan for JS code also in CSS commentsdcbc163
Release 0.3.195455db
When host_whitelist is empty, don't bother parsing the URLs in allow_embedded...88973ec
Release 0.3.08b3c612
Raise warning for unstable URL parsing8ce436d
Improve documentation about parsing URLs in lxml_html_clean.0d1a6e1
sdist now includes all test files and changelogcbb88d9
Release 0.2.1Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting
@dependabot rebase
.Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show