scofield7419 / UMMT-VSH

Code for the ACL 2023 paper Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination
9 stars 1 forks source link
multimodal-machine-translation

UMMT-VSH

Code for the ACL 2023 paper Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination


Step 0. install prerequisites

conda env create -f environments/full.yml
conda activate UMMT-VSH
pip install -e fairseq/
pip install -e taming-transformers/ 

Step 1. prepare data

Step 2. preprocess data

Step 3. SG parsing for data

parse the SG structures for all images and texts by the tools in SG-parsing/VSG and SG-parsing/LSG.

Step 4. train system

Step 5. test with system