Open dionis opened 5 months ago
Study the LLM models trained on Spanish-language corpora, giving priority to those that have been built on the basis of:
Expected result: The argued selection of which models to use for auto-tuning (possibly pretraining) to use for training a model in a medical context.
Was implemented on branch
Steps for test QLora on Epfl hugginface library
Select and LLM Spanish model and check if possible use QLora training process with it.
Study and learn the QLora concept and Epfl using
Check if possible the Meditron finetunning pipeline adapt to use QLora and single GPU training process
Execute a tiny test with all configurations
Evaluate the LLM pretraining only with a spanish text or more spanish text than other.
The training library used in Meditron can use with the selected LLM
The LLM can be used for finetunning and will be not expensive in time and memory
Obtenined hugginface models:
projecte-aina/aguila-7b (Falcon base)
clibrain/Llama-2-7b-ft-instruct-es (Llama2 base)
TheBloke/Barcenas-Mistral-7B-GGUF (Mistral base)
clibrain/lince-zero (Llama2 base)
clibrain/Llama-2-13b-ft-instruct-es (Llama2 base)
google/gemma-7b-it (Gemminis base)
allenai/OLMo-7B (Olmo base)
clibrain/Llama-2-13b-ft-instruct-es-gptq-4bit (Llama2 base)
clibrain/lince-mistral-7b-it-es (Misttral base)
Kukedlc/Llama-7b-spanish (Llama2 base)
google/gemma-7b (Gemminis base)
allenai/OLMo-1B (Olmo base)
Conclusions:
Use a LLM for pretrainig model if we have enough data
Research Objetive
Research Criteria:
Spanish Language Models Refrences to resources https://github.com/PlanTL-GOB-ES/lm-spanish
Biomedical and clinical language model for Spanish https://github.com/PlanTL-GOB-ES/lm-biomedical-clinical-es
Biomedical language model for Spanish https://huggingface.co/PlanTL-GOB-ES/bsc-bio-es
Study the LLM models trained on Spanish-language corpora, giving priority to those that have been built on the basis of:
Expected result: The argued selection of which models to use for auto-tuning (possibly pretraining) to use for training a model in a medical context.