Closed 3ygun closed 7 years ago
I have not seen this issue. I've had as high as 87% accuracy.
One thing I see in this code base that needs changed is the init op for tensorflow. At TensorFlowTrainer.java check line 99:
trainingInterface.run(new String[]{}, new String[]{initName});
In the current code base, this line will run after every communication round with the server, which is wrong. I have since changed this error and it will appear in future commits.
@tylermzeller I guess don't update to TensorFlow 1.12 yet then.
@3ygun oh jeez
Could you post the server config and tensorflow python model file you are using? I think problems may be stemming from that by chance.
Resolved the issue... TLDR make sure you are iterating through the correct files...
Basically, I was using the MNISTTestImages.50.l2.dat
with MNISTTestLabels.dat
so only a few of the labels matched 🤕 now I'm getting up to 83% after looking at ~20000 of the samples 🍷
It's not training.
In the
TensorFlowTrainer.java
replace:with:
And look at the log debug with weights aren't changing between runs only the data we run the initial random weights against which explains why we're not getting >20% accuracy.