Predict Docker Image Size given lib embedding

I used a simple keras MLP with two hidden layers of 100 neurons and a mean squared error loss.

_________________________________________________________________
Layer (type)                 Output Shape              Param #   
=================================================================
dense_1 (Dense)              (None, 100)               30100     
_________________________________________________________________
activation_1 (Activation)    (None, 100)               0         
_________________________________________________________________
dense_2 (Dense)              (None, 100)               10100     
_________________________________________________________________
activation_2 (Activation)    (None, 100)               0         
_________________________________________________________________
dense_3 (Dense)              (None, 1)                 101       
=================================================================
Total params: 40,301
Trainable params: 40,301
Non-trainable params: 0
_________________________________________________________________

The neural network quickly converges and gives an estimator that is ~3 times better than taking the mean size of docker images. This is using a subset of 5% of the final dataset.

src-d / ml-backlog

Predict Docker Image Size given lib embedding #85