archivesunleashed / aut

The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
https://aut.docs.archivesunleashed.org/
Apache License 2.0
137 stars 33 forks source link

Hadoop 3.2 support #491

Closed ruebot closed 2 years ago

ruebot commented 4 years ago

GitHub issue(s):

What does this Pull Request do?

This PR is #375 + Hadoop 3.2 support.

How should this be tested?

Same as #375 + It should be tested with spark-3.0.0-preview-bin-hadoop3.2.

Additional Notes:

ruebot commented 4 years ago

Looks like Hadoop 3.2 is target for Spark 3.1.x

(@helgeho fyi)

ruebot commented 2 years ago

See: https://github.com/archivesunleashed/aut/issues/329#issuecomment-1129269951