Open manisnesan opened 6 months ago
JaColBERT vastly outperform all previous monolingual retrieval approaches and competes with the best multilingual methods, despite unfavourable evaluation settings (out-of-domain vs. in-domain for the multilingual models). JaColBERT reaches an average Recall@10 of 0.813, noticeably ahead of the previous monolingual best-performing model (0.716) and only slightly behind multilingual-e5-base (0.820)
https://www.sarvam.ai/blog/announcing-openhathi-series - Bilingual LLMs frugally
Here are the notes from Sarvam's OpenHathi Series Launch. For people unfamiliar: Sarvam is an Indian startup focused on training Foundational LLMs for Indian languages. They launched OpenHathi series of model yesterday. Open Hathi is an attempt to add support of a new language to an existing open model such as LLama2 or Mistral.
Amazing work Prof. @Prof. Pratyush Kumar Sarvam. This is the ChatGPT moment for Bharath!