Closed yukw777 closed 1 year ago
Thanks! We are working on final check on our dataset, and will get back to you soon.
Thank you for your interest in Otter!
We acknowledge the issue with the wrong "instruction" file upload and apologize for this. And, we've checked and ensured that the "instruction" file stored in the local is correct.
The correct LLaVA "instruction" file will be uploaded this week. Stay tuned!
Here's our updated instructions at OneDrive.
Inside the folder named LA
, you will find multiple instruction files with the suffix _instructions.json
. These files contain the instructions we have provided. Additionally, there are other folders and files already available, except for the TVC
folder that we need to do a final check. We will soon update our readme to announce the release of instructions and provide guidance on how to convert-it
from public datasets to our required image file format, such as LA.json
.
Please note that the annotations I previously provided may be incorrect because I intended to merge them into a single file. However, our training process did not actually combine the LA_CONV
, LA_DD
, LACR_I2I
, and LACR_T2T
annotations together.
Our training pipeline loads them with comma separator, like the following, note that --images_path
should also be repeated 4 times to align with --mimicit_path
and --train_config_path
.
export PYTHONPATH=.
accelerate launch --config_file=./pipeline/accelerate_configs/accelerate_config_fsdp.yaml \
pipeline/train/instruction_following.py \
--pretrained_model_name_or_path=/home/luodian/azure_storagev2/otter/checkpoints/otter9B_DC_fullset_no_desc_june12 \
--dataset_resampled \
--mimicit_path="/path/to/LACR_I2I_instructions.json,/path/to/LACR_T2T_instructions.json,/path/to/LACONV_instructions.json,/path/to/LADD_instructions.json" \
--images_path="/path/to/LA.json,/path/to/LA.json,/path/to/LA.json,/path/to/LA.json" \
--train_config_path="/path/to/LACR_I2I_train.json,/path/to/LACR_T2T_train.json,/path/to/LACONV_train.json,/path/to/LADD_train.json" \
--batch_size=16 \
--num_epochs=6 \
--report_to_wandb \
--wandb_entity=ntu-slab \
--external_save_dir=./checkpoints \
--save_hf_model \
--run_name=otter9B_LA \
--wandb_project=otter9B \
--workers=8 \
--cross_attn_every_n_layers=4 \
--lr_scheduler=cosine \
--delete_previous_checkpoint \
--learning_rate=1e-5 \
--warmup_steps_ratio=0.01
The --mimicit_path
loads our provided _instructions.json
and --train_config_path
loads _train.json
.
Inside _train.json
, each instruction is associated with its related instructions. We provide it for more flexibly define each instruction's related instructions. It serves for different in-context objectives.
I was looking around the annotations for LA in-context, I noticed that the instructions specified as related instructions do not exist. Dense Caption doesn't seem to have this problem.
Maybe I've misunderstood what related instructions are? Either way, please let me know!