jsoup 1.14.1 is out now, with simple request session management, increased parse robustness, and a ton of other improvements, speed-ups, and bug fixes.
See the full announcement for all the details on what's changed.
Bugfix: corrected a potential case of the parser input stream not being closed immediately on a read exception.
Bugfix: when making a HTTP POST, if the request write fails, make sure the connection is immediately cleaned up.
Bugfix: updated the HtmlTreeParser resetInsertionMode to the current spec for supported elements
jhy/jsoup#1491
Bugfix [Fuzz]: fixed a slow parse when a tag has thousands of null characters in it.
jhy/jsoup#1580
Bugfix [Fuzz]: the adoption agency algorithm can have an incorrect bookmark position
jhy/jsoup#1576
Bugfiz [Fuzz]: malformed HTML could result in null elements on stack
jhy/jsoup#1579
Bugfix [Fuzz]: malformed deeply nested table elements could create a stack overflow.
jhy/jsoup#1577
*** Release 1.14.1 [2021-Jul-10]
Change: updated the minimum supported Java version from Java 7 to Java 8.
Change: updated the minimum Android API level from 8 to 10.
Change: although Node#childNodes() returns an UnmodifiableList as a view into its children, it was still
directly backed by the internal child list. That made some uses, such as looping and moving those children to
another element, throw a ConcurrentModificationException. Now this method returns its own list so that they are
separated and changes to the parent's contents will not impact the children view. This aligns with similar methods
such as Element#children(). If you have code that iterates this list and makes parenting changes to its contents,
you may need to make a code update.
jhy/jsoup#1431
Change: the org.jsoup.Connection interface has been modified to introduce new methods for sessions and the cookie
store. If you have a custom implementation of this interface, you will need to add implementations of these methods.
Improvement: added HTTP request session management support with Jsoup.newSession(). This extends the Connection
implementation to support (optional) sessions, which allow request defaults (timeout, proxy, etc) to be set once and
then applied to all requests within that session.
Cookies are re-implemented to correctly support path and domain filtering when used within a session. A default
in-memory cookie store is used for the session, or a custom implementation (perhaps disk-persistent, or pre-set)
can be used instead.
Forms submitted using the FormElement#submit() use the same session that was used to fetch the document and so pass
cookies and other defaults appropriately.
The session is multi-thread safe and can execute multiple requests concurrently. If the user accidentally tries to
Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.
Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR:
- `@dependabot rebase` will rebase this PR
- `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it
- `@dependabot merge` will merge this PR after your CI passes on it
- `@dependabot squash and merge` will squash and merge this PR after your CI passes on it
- `@dependabot cancel merge` will cancel a previously requested merge and block automerging
- `@dependabot reopen` will reopen this PR if it is closed
- `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
- `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
- `@dependabot use these labels` will set the current labels as the default for future PRs for this repo and language
- `@dependabot use these reviewers` will set the current reviewers as the default for future PRs for this repo and language
- `@dependabot use these assignees` will set the current assignees as the default for future PRs for this repo and language
- `@dependabot use this milestone` will set the current milestone as the default for future PRs for this repo and language
- `@dependabot badge me` will comment on this PR with code to add a "Dependabot enabled" badge to your readme
Additionally, you can set the following in your Dependabot [dashboard](https://app.dependabot.com):
- Update frequency (including time of day and day of week)
- Pull request limits (per update run and/or open at any time)
- Out-of-range updates (receive only lockfile updates, if desired)
- Security updates (receive only security updates, if desired)
Bumps jsoup from 1.8.3 to 1.14.1.
Release notes
Sourced from jsoup's releases.
Changelog
Sourced from jsoup's changelog.
... (truncated)
Commits
b24f2e4
[maven-release-plugin] prepare release jsoup-1.14.10bd5885
Release prep changelog updatef034561
Code cleanup0a5a7ef
Minor cleanupcedf83c
Cleanup UTF BOM recognition87085a8
Changelog for #14828db724e
resolve abnormal urls in compliance with rfc3986 (#1482)661523f
Changelog for adoption agency fixe45e53c
Complete adoption agency algorithm (#1517)f49f92c
More robust prevention of XML Declaration recursionDependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting
@dependabot rebase
.Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) - `@dependabot use these labels` will set the current labels as the default for future PRs for this repo and language - `@dependabot use these reviewers` will set the current reviewers as the default for future PRs for this repo and language - `@dependabot use these assignees` will set the current assignees as the default for future PRs for this repo and language - `@dependabot use this milestone` will set the current milestone as the default for future PRs for this repo and language - `@dependabot badge me` will comment on this PR with code to add a "Dependabot enabled" badge to your readme Additionally, you can set the following in your Dependabot [dashboard](https://app.dependabot.com): - Update frequency (including time of day and day of week) - Pull request limits (per update run and/or open at any time) - Out-of-range updates (receive only lockfile updates, if desired) - Security updates (receive only security updates, if desired)