lujiaying / MUG-Bench

Data and code of the Findings of EMNLP'23 paper MuG: A Multimodal Classification Benchmark on Game Data with Tabular, Textual, and Visual Fields
https://aclanthology.org/2023.findings-emnlp.354/
Other
8 stars 0 forks source link

Proposal #1

Closed lujiaying closed 1 year ago

lujiaying commented 2 years ago

During my internship, I have the chance to be involved in the automl multimodal classification setting, where the modality includes tabular, text, and image. There already exist many text--image datasets (images with captions, visual question answering), and text--tabular datasets (Multimodal AutoML on Structured Tables with Text Fields). However, there is no comprehensive publically available table, text, and image datasets, which is quite unique and challenging itself.

After some discussion with Wenjing, we find out games like HearthStone, Magic, DOTA, LoL are ideal resources for this multimodal classification testbeds.

For instance, below are several cards from HeathStone. They contain tabular features (cost, attack, HP, card type, minion type, etc.); image features; textual features (descriptions of its effect, or background story). It is relatively easy to convert the data into a classification task (e.g. we can predict the cost of a card by all other features).

image

I was wondering if you are interested in a resource paper, which I think might be a low-hanging fruit to achieve. If multimodal classification is not very interested in our lab direction. there might be a chance that we can convert it to a graph problem or even multimodal KG/KB (I would recommend after releasing the resource paper and using some existing multimodal automl models to set up baselines).

The risk I can name now is mainly the copyright risk. But I do find some Common-Creative copyright resources we can use. I would like to invite my wife as one of the co-authors for the resource paper because she contributes a lot to shape this idea. Moreover, the beginning stage looks relatively easy, so if it is possible I may want to hire some undergraduates to help.

lujiaying commented 2 years ago

Aug 18

Project Deliverables

Project Status

Aug - Sep Plan

lujiaying commented 2 years ago

Aug 20- 27 Plan

I'd suggest the following things for next week

Meeting Notes

lujiaying commented 2 years ago

Aug 28 - Sep 2

Observations

To discuss

Next week plan

lujiaying commented 2 years ago

Sep 3 - Sep 9

To discuss

                  importance    stddev       p_value  n  p99_high   p99_low
Personality         0.409677  0.008834  2.593102e-08  5  0.427867  0.391488
Hobby               0.035484  0.021030  9.777106e-03  5  0.078784 -0.007817
Species             0.000000  0.000000  5.000000e-01  5  0.000000  0.000000
Subtype             0.000000  0.000000  5.000000e-01  5  0.000000  0.000000
Birthday            0.000000  0.000000  5.000000e-01  5  0.000000  0.000000
Catchphrase         0.000000  0.000000  5.000000e-01  5  0.000000  0.000000
Favorite Song       0.000000  0.000000  5.000000e-01  5  0.000000  0.000000
Favorite Saying     0.000000  0.000000  5.000000e-01  5  0.000000  0.000000
Style 1             0.000000  0.000000  5.000000e-01  5  0.000000  0.000000
Style 2             0.000000  0.000000  5.000000e-01  5  0.000000  0.000000
Color 1             0.000000  0.000000  5.000000e-01  5  0.000000  0.000000
Color 2             0.000000  0.000000  5.000000e-01  5  0.000000  0.000000
Default Umbrella    0.000000  0.000000  5.000000e-01  5  0.000000  0.000000
Wallpaper           0.000000  0.000000  5.000000e-01  5  0.000000  0.000000
Flooring            0.000000  0.000000  5.000000e-01  5  0.000000  0.000000
lujiaying commented 2 years ago

Sep 10 - 16

To-Dos

lujiaying commented 1 year ago

Sep 24 - 30

[210 rows x 16 columns]



- [x] Currently, we may not need to upload trained artifacts into the cloud folder. Or If we upload, it would be great to use `exec.py` to save exp_arguments and exp_results.
- [x] Discuss whether we set it as a multi-column prediction or just multiple tasks (no dependency among different tasks)
- [x] Create a Turing server account for Yongchen, because we want to have a fair comparison between different baselines (same CPU cores, same GPU core). @lujiaying need to discuss this with Dr. Yang.

## own algorithm idea
auto-gnn: automatically construct a graph by categorical feature. Then it would be a heterogeneous graph with different types of nodes and types of edges.
lujiaying commented 1 year ago

Sep 31 - Oct 6

TODOs: