YeonwooSung / ai_book

AI book for everyone
24 stars 5 forks source link

Mistral-7b, Zephyr-7b-alpha #52

Open YeonwooSung opened 11 months ago

YeonwooSung commented 11 months ago

Mistral-7b-v0.1, Zephyr-7b-alpha

DPO vs PPO (DPO is better for finetuning?)

YeonwooSung commented 11 months ago

source code for mistral llm