Closed TorresYangX closed 2 months ago
Hi @TorresYangX, thanks for raising the question! When training the retriever, we combined both the training and test datasets to train the projector but we do not use the test data for RAG and training the VLM. Thank you very much!
Hello, authors, thanks for your work.
I noticed something unusual. Could you explain why both test data and train data are used simultaneously when training the retriever? Specifically, in https://github.com/YuanJianhao508/RAG-Driver/blob/24a0bd01f56c6fa4f2563a4c94925856e2ba707e/retrieval/train.py#L33
conv
includes bothtrain_conv
andtest_conv
. Could you explain the reason behind doing this?Thanks for your assistance!