Closed Somedaywilldo closed 4 years ago
Hi Yihang,
This is what Valentin had to say about this:
We use only the plot text, as they did for GMU, so their interpretation is ok. Concerning data preparation, I am not able to find the data preparation scripts again. But there should be almost nothing to do, basically downloading lisi1.unal.edu.co/mmimdb/mmimdb.tar.gz and use images and plot as modalities.
Cheers
Juan
On Tue, 21 Jul 2020, 18:23 Yihang Yin, notifications@github.com wrote:
Dear Authors,
Would you mind sharing the scripts to prepare the mm_imdb raw dataset? Or could you tell me if my interpretation is right or not.
This part in your datasets/mm_imdb.py:
image = np.load(imagepath) label = np.load(labelpath) text = np.load(textpath)
The "image" is the poster image, the "label" is "genres", and the text is "plot", here a sample from mm_imdb raw dataset:
"plot": [ "A stationary camera looks at a large anvil with a blacksmith behind it and one on either side. The smith in the middle draws a heated metal rod from the fire, places it on the anvil, and all three begin a rhythmic hammering. After several blows, the metal goes back in the fire. One smith pulls out a bottle of beer, and they each take a swig. Then, out comes the glowing metal and the hammering resumes.", "Three men hammer on an anvil and pass a bottle of beer around." ], "votes": 1335, "title": "Blacksmith Scene", "smart canonical title": "Blacksmith Scene", "long imdb canonical title": "Blacksmith Scene (1893)", "certificates": [ "USA:Unrated" ], "long imdb title": "Blacksmith Scene (1893)", "country codes": [ "us" ], "smart long imdb canonical title": "Blacksmith Scene (1893)", "cover url": "http://ia.media-imdb.com/images/M/MV5BNDg0ZDg0YWYtYzMwYi00ZjVlLWI5YzUtNzBkNjlhZWM5ODk5XkEyXkFqcGdeQXVyNDk0MDg4NDk@._V1. _SX100SY75.jpg", "sound mix": [ "lent" ], "genres": [ "Short" ],
— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/juanmanpr/mfas/issues/8, or unsubscribe https://github.com/notifications/unsubscribe-auth/AA7Y7EG5ALBZQIZP3A2ZWBDR4XFKRANCNFSM4PD2HKAQ .
Thank you so much!
Dear Authors,
Would you mind sharing the scripts to prepare the mm_imdb raw dataset? Or could you tell me if my interpretation is right or not.
This part in your datasets/mm_imdb.py:
The "image" is the poster image, the "label" is "genres", and the text is "plot", here a sample from mm_imdb raw dataset: