saagie / technologies

Technologies listing used by Saagie
Apache License 2.0
5 stars 14 forks source link

[SDKTECHNO-106] Check and repare all DL from Cloudera #260

Closed finalspy closed 1 year ago

finalspy commented 3 years ago

Since Cloudera changed its policies : https://www.cloudera.com/downloads/paywall-expansion.html Downloads from archive.cloudera.com doesnt work anymore.

So this implies to either use internal archive or use apache clients

Needs first to summarize images and binaries impacted. Then find a workaround.

github-actions[bot] commented 3 years ago

:+1: JIRA Issue created : SDKTECHNO-106

finalspy commented 3 years ago

Impacted images :

finalspy commented 3 years ago

Cloudera archives concerned :

I also noticed references to the following repos :

finalspy commented 3 years ago

3 URLs starting with downloads.cloudera.com are still downloadable at the time I wrote this comment. But all parent directories are unreachable and trying to get them results in a 404 http response or an empty directory listing depending on the final /

finalspy commented 3 years ago

Jupyter-minimal image uses cloudera archives to install hdfs-sentry-plugin. This results in the following packages installed from Cloudera :

finalspy commented 3 years ago

Here's a summary of jar libs differences I found between cloudera and Apache Hive ans Sqoop tat.gz distributions https://docs.google.com/spreadsheets/d/1oZToezEXyACnVOWB0F7qr2bG5h6YqcaecF9kqkwZPJU/edit?usp=sharing