Closed jvel07 closed 1 month ago
We have not yet tested MemoRAG on Spanish, and the built-in prompts are currently available only in English and Chinese. This may be a contributing factor to the observed performance degradation. In future development versions, I plan to test additional languages; however, I cannot guarantee optimal performance across multiple languages at this stage.
@qhjqhj00 again, nice repo!
I am using a legal Spanish language (bge-m3 covers Spanish as well) PDF of around 150 pages; and, even using the small_part of it as below,
qa
performs better (although still not decent) thanmemorag
.Using latest memorag version and Llama3.1 inst:
For a simple question in Spanish like: "what is the article 8 about?"
qa
--> outputs a not so detailed but accurate answer (in Spanish), butmemorag
--> outputs (in Spanish) "no information on article 8 is given in the text provided"May I ask, am I missing something or is this the expected performance reported on the paper?