Closed kimborgen closed 1 year ago
As determined in #2, split the model into two different models. One that uses Rotary embeddings and one using Alibi functionality. This will increase readability and make it easier to understand and extend the models.
Extracted rotary code, alibi is defered and put into not_working folder...
As determined in #2, split the model into two different models. One that uses Rotary embeddings and one using Alibi functionality. This will increase readability and make it easier to understand and extend the models.