Open tsailiming opened 4 months ago
I moved this issue over to the sdg repo as that’s where the relevant code is.
Note to self / other devs: This is with the “simple” pipeline and the default merlinite model.
I believe this is generally resolved by using a larger teacher model, such as mixtral-8x7b instead of merlinite. Or, at least for me, I also saw this when using merlinite but it went away when swapping over to mixtral. I know that's not a great answer, as mixtral takes a lot more resources to run for inference than merlinite. Perhaps there is something we could do to optimize the simple pipeline used by merlinite to reduce the frequency of this happening, and that may be worth investigating.
This issue has been automatically marked as stale because it has not had activity within 90 days. It will be automatically closed if no further activity occurs within 30 days.
Describe the bug Looking at the file
messages_merlinite-7b-lab-Q4_K_M_2024-07-21T05_02_22.jsonl
, there are numerous of such contentThe alignment process tends to cause the model to return a response that replied with a ' or "Answer:"
I am not sure whether this is a bug?
To Reproduce Steps to reproduce the behavior:
2 A sample knowledge in taxonomy/knowledge/parasol/overview/qna.yaml
https://raw.githubusercontent.com/gshipley/backToTheFuture/main/qna.yaml
Expected behavior
Screenshots
Device Info (please complete the following information):
Additional context