mediacloud / metadata-lib

How Media Cloud approaches extracting metadata from online news stories
Apache License 2.0
12 stars 5 forks source link

Update htmldate requirement from ==1.6.* to >=1.6,<1.8 #75

Closed dependabot[bot] closed 10 months ago

dependabot[bot] commented 10 months ago

Updates the requirements on htmldate to permit the latest version.

Release notes

Sourced from htmldate's releases.

htmldate-1.7.0

  • fix meta property updated vs. original behavior (#121)
  • support for LXML version 5.0+ (#127)
  • fix image links in Readme
Changelog

Sourced from htmldate's changelog.

1.7.0

  • fix meta property updated vs. original behavior (#121)
  • support for LXML version 5.0+ (#127)
  • fix image links in Readme

1.6.1

1.6.0

  • focus on precision, stricter extraction patterns (#103, #105, #106, #112)
  • simplified code base (#108, #109)
  • replaced lxml.html.Cleaner (#104)
  • extended evaluation

1.5.2

  • fix for missing months keys in custom extractor (#100)
  • fix for None in try_date_expr() (#101)

1.5.1

  • fix regression for fast extraction introduced in e8b3538 (#96)
  • fix setup by making backports-datetime-fromisoformat optional (#95)

1.5.0

  • slightly higher accuracy with revised heuristics
  • simplified code structure for better performance
  • setup: support for 3.12, fromisoformat backport if applicable
  • HTML parsing fixes: more lenient parsing, pinned LXML version for MacOS

1.4.3

  • maintenance release: upgrade urllib3 dependency

1.4.2

  • support min_date/max_date as datetimes or datetime strings with @​kernc (#73)
  • add date attributes to HTML extraction with @​kernc (#74)
  • fix for extraction of updated and original dates in time elements
  • code refactoring and maintenance

1.4.1

  • better coverage of relevant HTML attributes
  • automatically define upper time bound at each function call (#70)
  • reviewed and simplified extraction code
  • cache validation for format diverging from %Y-%m-%d
  • updated dependencies and removed real-world tests from package

1.4.0

  • additional search of free text in whole document (#67)
  • optional parameter for subdaily precision with @​getorca (#66)
  • fix for HTML doctype parsing (#44)
  • cleaner code for multilingual month expressions
  • extended expressions for extraction in HTML meta fields

... (truncated)

Commits


Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
rahulbot commented 10 months ago

Superceeded by #76

dependabot[bot] commented 10 months ago

OK, I won't notify you again about this release, but will get in touch when a new version is available. If you'd rather skip all updates until the next major or minor version, let me know by commenting @dependabot ignore this major version or @dependabot ignore this minor version. You can also ignore all major, minor, or patch releases for a dependency by adding an ignore condition with the desired update_types to your config file.

If you change your mind, just re-open this PR and I'll resolve any conflicts on it.