jperezrua / mfas

Implementation of CVPR 2019 paper "Mfas: Multimodal fusion architecture search"
77 stars 20 forks source link

Preparing MM_IMDB Dataset #8

Closed Somedaywilldo closed 4 years ago

Somedaywilldo commented 4 years ago

Dear Authors,

Would you mind sharing the scripts to prepare the mm_imdb raw dataset? Or could you tell me if my interpretation is right or not.

This part in your datasets/mm_imdb.py:

image = np.load(imagepath)
label = np.load(labelpath)
text = np.load(textpath)

The "image" is the poster image, the "label" is "genres", and the text is "plot", here a sample from mm_imdb raw dataset:

     "plot": [
         "A stationary camera looks at a large anvil with a blacksmith behind it and one on either side. The smith in the middle draws a        heated metal rod from the fire, places it on the anvil, and all three begin a rhythmic hammering. After several blows, the metal goes          back in the fire. One smith pulls out a bottle of beer, and they each take a swig. Then, out comes the glowing metal and the hammering         resumes.",
         "Three men hammer on an anvil and pass a bottle of beer around."
     ],
     "votes": 1335,
     "title": "Blacksmith Scene",
     "smart canonical title": "Blacksmith Scene",
     "long imdb canonical title": "Blacksmith Scene (1893)",
     "certificates": [
         "USA:Unrated"
     ],
     "long imdb title": "Blacksmith Scene (1893)",
     "country codes": [
         "us"
     ],
     "smart long imdb canonical title": "Blacksmith Scene (1893)",
     "cover url": "http://ia.media-imdb.com/images/M/MV5BNDg0ZDg0YWYtYzMwYi00ZjVlLWI5YzUtNzBkNjlhZWM5ODk5XkEyXkFqcGdeQXVyNDk0MDg4NDk@._V1.      _SX100_SY75_.jpg",
     "sound mix": [
         "lent"
     ],
     "genres": [
         "Short"
     ],
jperezrua commented 4 years ago

Hi Yihang,

This is what Valentin had to say about this:

We use only the plot text, as they did for GMU, so their interpretation is ok. Concerning data preparation, I am not able to find the data preparation scripts again. But there should be almost nothing to do, basically downloading lisi1.unal.edu.co/mmimdb/mmimdb.tar.gz and use images and plot as modalities.

Cheers

Juan

On Tue, 21 Jul 2020, 18:23 Yihang Yin, notifications@github.com wrote:

Dear Authors,

Would you mind sharing the scripts to prepare the mm_imdb raw dataset? Or could you tell me if my interpretation is right or not.

This part in your datasets/mm_imdb.py:

image = np.load(imagepath) label = np.load(labelpath) text = np.load(textpath)

The "image" is the poster image, the "label" is "genres", and the text is "plot", here a sample from mm_imdb raw dataset:

"plot": [ "A stationary camera looks at a large anvil with a blacksmith behind it and one on either side. The smith in the middle draws a heated metal rod from the fire, places it on the anvil, and all three begin a rhythmic hammering. After several blows, the metal goes back in the fire. One smith pulls out a bottle of beer, and they each take a swig. Then, out comes the glowing metal and the hammering resumes.", "Three men hammer on an anvil and pass a bottle of beer around." ], "votes": 1335, "title": "Blacksmith Scene", "smart canonical title": "Blacksmith Scene", "long imdb canonical title": "Blacksmith Scene (1893)", "certificates": [ "USA:Unrated" ], "long imdb title": "Blacksmith Scene (1893)", "country codes": [ "us" ], "smart long imdb canonical title": "Blacksmith Scene (1893)", "cover url": "http://ia.media-imdb.com/images/M/MV5BNDg0ZDg0YWYtYzMwYi00ZjVlLWI5YzUtNzBkNjlhZWM5ODk5XkEyXkFqcGdeQXVyNDk0MDg4NDk@._V1. _SX100SY75.jpg", "sound mix": [ "lent" ], "genres": [ "Short" ],

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/juanmanpr/mfas/issues/8, or unsubscribe https://github.com/notifications/unsubscribe-auth/AA7Y7EG5ALBZQIZP3A2ZWBDR4XFKRANCNFSM4PD2HKAQ .

Somedaywilldo commented 4 years ago

Thank you so much!