Unable to use self developed pre-trained model for finetuning in MosaicML

mosaicml / llm-foundry

LLM training code for Databricks foundation models

Apache License 2.0

3.83k stars 502 forks source link

❓ Question

I have developed a pretrained model on my own dataset using custom Attention(without using MosaicML llmfoundary). But when i try to finetune the pre-trained model via mosaicML by giving my model's local or google cloud storage(in the finetuning config yaml)), MosaicML downloads the model artifact(which it considers appropriate) from HuggingFace rather than using my model location Wanted to check how can i use my model's artifact while finetuning?

Additional context

Note: saved my pretrained model using torch.save()

mosaicml / llm-foundry

Unable to use self developed pre-trained model for finetuning in MosaicML #1291

❓ Question

Additional context