I have a question related how to run the code, i follow up all the instructions that are mentioned in the repo but my confusion is will the model be downloaded itself, for example i want to test the code for lama2 7b chat model, how to use this streaming llama code for that?
I have a question related how to run the code, i follow up all the instructions that are mentioned in the repo but my confusion is will the model be downloaded itself, for example i want to test the code for lama2 7b chat model, how to use this streaming llama code for that?