Issue: Llama 2 loaded, but seems dumber?

enzyme69 commented 1 year ago

Issue you'd like to raise.

Screenshot 2023-07-24 at 12 27 29 am

I am using Llama 2 model that I got from here: https://gist.github.com/adrienbrault/b76631c56c736def9bc1bc2167b5d129

When running using above command, the Llama model seems to be smarter.

Compared to the one I am running via gpt4all interface. I wonder why? Is there a setting or temperature I need to adjust manually?

echo "Prompt: " \
    && read PROMPT \
    && ./main \
        -t 8 \
        -ngl 1 \
        -m llama-2-13b-chat.ggmlv3.q4_0.bin \
        --color \
        -c 2048 \
        --temp 0.7 \
        --repeat_penalty 1.1 \
        -n -1 \
        -p "[INST] <<SYS>> ${SYSTEM} <</SYS>> ${PROMPT} [/INST]"