The moment you send the first message, the model is loaded. This is done because usually more than 1 model cannot fit in the device memory, and in order not to lose context when switching chat, the model is loaded only when the first message is sent. I will add animation later.
The moment you send the first message, the model is loaded. This is done because usually more than 1 model cannot fit in the device memory, and in order not to lose context when switching chat, the model is loaded only when the first message is sent. I will add animation later.