GELOG / adamcloud

Portable cloud infrastructure for a genomic transformation pipeline using Adam
2 stars 0 forks source link

Choose the base Linux/Java distribution for Dockerfile #5

Closed davidonlaptop closed 9 years ago

davidonlaptop commented 9 years ago

Linux Distribution

Ideally, it would be best to have a single stack of software distribution to reduce the download time and facilitate the maintenability.

We chose Ubuntu 14.04 LTS because it is the most widely used linux distro. If there is strong arguments in favor of another distribution, now would be the good time to discuss it. Or it is required to support multiple Guest OS?

Note: we didn't use the "ubuntu:latest" image, because it adds extra package scripts which did not seem worth the extra size.

Java Provider and Java Version

Same question for Java. Currently we use Oracle Java 1.7 (latest version).

OpenJDK vs Oracle JDK

Historically there used to be some issues with OpenJDK and Hadoop - as far as I can remember that was with OpenJDK 1.6. Now perhaps those issues are resolved with OpenJDK 1.7?

Also, I'm not sure if we are violating Oracle's license by distributing Oracle JDK within a docker container.

It seems acceptable to switch to OpenJDK 1.7.

Java 1.7 vs 1.8

It seems best to stick with java 1.7.

flangelier commented 9 years ago

We'll use OpenJDK because we didn't find a way to distribute Oracle JDK in a Docker image by letting the user accept the license.