commoncrawl / ia-web-commons

Web archiving utility library
Apache License 2.0
9 stars 6 forks source link

Upgrade to a recent Hadoop version #34

Closed sebastian-nagel closed 8 months ago

sebastian-nagel commented 9 months ago

ia-web-commons is based on a quite old Hadoop version (0.20.2) which should be upgraded.

In production we already use a separate branch hadoop-3.2.2" (I know, branch names should not include a version number) which ia-hadoop-tools depend on.

Maybe it's time to upgrade master as well?

Detailed upgrades:

jnioche commented 8 months ago

Should the file pom-cdh4.xml be removed?