alan-turing-institute / reginald

Reginald repository for REG Hack Week 23
3 stars 0 forks source link

Make LlamaIndexHF default model microsoft/phi-1_5 #80

Closed rchan26 closed 11 months ago

rchan26 commented 1 year ago

Currently the default HF model is StabilityAI/stablelm-tuned-alpha-3b but maybe we can use microsoft/phi-1_5 which is half the size and reportedly has comparable performance to larger 7B models. Probably preferable to have a smaller model which is used by default.

Need to wait for them to add attention_mask to their generation function. Currently, we will get an error about attention_mask not being an argument to model.forward.

rchan26 commented 11 months ago

See https://huggingface.co/microsoft/phi-1_5/discussions/7