Generate.py doesn't work for test set

elliottd / GroundedTranslation

Multilingual image description

https://staff.fnwi.uva.nl/d.elliott/GroundedTranslation/

BSD 3-Clause "New" or "Revised" License

46 stars 25 forks source link

Generate.py doesn't work for test set #20

Closed evanmiltenburg closed 8 years ago

evanmiltenburg commented 8 years ago

This kept me up yesterday: I was getting 507 sentences for the test set, rather than 1000. Looking at generate.py I found the problem was this part:

                seen += text.shape[0]
                if seen == self.data_gen.split_sizes['val']:
                    # Hacky way to break out of the generator
                    break

I fixed it by changing line 107 from for data in generator: to this:

# Use enumerate() to count the number of data points, starting at 1.
for seen, data in enumerate(generator, start=1):

and lines 242-245 (the 'hacky way') to:

                if seen == len(self.dataset[prefix]):
                    # Hacky way to break out of the generator
                    break

This solution should now work for any part of the dataset you want to test on :)

evanmiltenburg commented 8 years ago

Some other additions on my branch that I've found really useful are:

making an additional variable for the completed sentence, rather than use this multiple times (for logging, writing to the valGenerated/testGenerated file, and for JSON output --see below):

' '.join([x for x
                                          in itertools.takewhile(
                                              lambda n: n != "<E>",
                                              complete_sentences[i])])

Writing out JSON files with the image ID and the generated sentence. This required changing data_generator so that it also yields the ID:

               yield {'text':arrays[0], 'img': arrays[1],
                      'output': targets, 'ident': ident}

elliottd commented 8 years ago

The first problem has already been fixed in the experimental_datagen branch although perhaps I had not pushed the code to the repository. Sorry for that!

I like the idea of yielding extra key:value pairs in the data generator. But how does the model behave when fit_generator() and predict() receive an input that is never used in the model?

evanmiltenburg commented 8 years ago

Not sure, and I don't have time at the moment to test this. I'd hope that it just ignores unused keys (that would be the best way to deal with cases like this anyway, just use the keys that it's specified to use and don't make any additional assumptions). It works very well for generating descriptions, though! Using the IDs makes comparing the generated sentences easier and more reliable (it's more difficult to mix up the images).

elliottd commented 8 years ago

@evanmiltenburg logically, it sounds like model.predict() can safely ignore the {... , 'ident': ident} pair in your dictionary because it doesn't seem to be complaining about it.

Perhaps we could look at including this type of additional information in a batch dictionary but mark it as experimental to signify that it might break at any time.

elliottd commented 8 years ago

Fixed in 38be3ed05cd30497db5b53f20e38079f848ec3ef