Closed tomsherborne closed 6 years ago
You're right, I will fix that with the next commit (just iterate over batch
instead of dataset.values
). Regarding the other problem: You basically try to just retrieve the TensorFlow records, right? I suspect the TF queue initialization might be missing:
session = tf.Session()
coordinator = tf.train.Coordinator()
queue_threads = tf.train.start_queue_runners(sess=session, coord=coordinator)
[...]
coordinator.request_stop()
coordinator.join(threads=queue_threads)
session.close()
I realise that this is somewhere hidden in my models code, and I should add a note to the readme and the Python file, if that indeed solves the problem. So please let me know. :-)
Hi Alex,
I'm playing around with your tf_util interface for loading batches of data and I find that if I generate a small set as:
Then when running the example data loading:
I get the error
I find this is because of the loop on line
75
that iterates through the key, value pairs from thedataset.values
dict but thebatch
dict no longer containsalternatives
due to the call ofrecords.pop('alternatives')
. I added a breaking condition to fix this as:I have this as a PR from my fork that I can submit but I'm finding a larger problem with loading data in this way as the call to evaluate a batch:
hangs for an unreasonably long time. I've not measured exactly because it might never recover but it appears to demand at least 10 minutes of setup time whereas the data loading modules I've written take almost no time to evaluate a batch. I'm not sure where the issue is but I'm happy to look somewhere if you can point me in the right direction if you find the same issue trying to evaluate a batch.
[Using Mac OSX 10.13.2, Python 3.5.4, Tensorflow 1.5.0]