-
Hi,
Thank you for your nice work.
You used both video data and image data during training, what was your ratio of video to images at each stage? I am adding a lot of image data, specifically usi…
-
Hi,
Thank you for your nice work.
You use video and image data in the training process. How much does adding image data help the final generated result? I found that adding image data did not im…
-
I read your paper for this project. Cool work!
Is the raw training data available?
-
I am facing the error in the Training data arguments in the dataset in this line {
training_args = TrainingArguments(
output_dir="models",
learning_rate=5e-6,
weight_decay=0.01,
o…
-
# llm run for step evaluation
prompts, prompts_span = self.value_preprocess(valid_solvers)
After executing this line, prompt always got [], and prompts_span got all-zeros list, which makes the tr…
-
Hi there,
First of all, thanks so much for providing the high-quality results and detailed codebase!
There's one question that seems not mention in the paper. What is the training data that is …
-
How can I train with a different language that gollie-transfusion did not train on yet?
-
hi
I found that in the mimic dataset, one report often corresponds to multiple chest X-rays. I would like to ask whether your research uses single image input or multiple image input?
-
hello, very nice work! Would you like to release your training data?
-
Identify and list all data items with queries that are not suitable for use as input sounds in training data: jan-hq/instruction-speech-v1.5 and jan-hq/instruction-speech-v1.