mrseanryan / gpt-dm

Data modelling via natural language using an LLM. Outputs JSON or SQL. Also generates Test data in SQL or CSV format.
MIT License
2 stars 0 forks source link

Try generating higher volume of test data (via Mistral 7B ?) #10

Open mrseanryan opened 4 months ago

mrseanryan commented 4 months ago

Approach:


LLM - Mistral 7B

Context size: a sliding 4K window - see https://huggingface.co/mistralai/Mistral-7B-v0.1/discussions/4

There are many flavours of Mistral-7B:

  1. regular mistral-7B, quantized

  2. cognitivecomputations/dolphin-2.6-mistral-7b-dpo

4 - instruct prompting, and newer:

  1. a base version, suitable for FT

  2. hermes-2.5 - mistral-7B

  3. phi from Microsoft