Add dev accuracy to the plot.

microsoft / deep-language-networks

We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts at each layer. We stack two such layers, feeding the output of one layer to the next. We call the stacked architecture a Deep Language Network - DLN

MIT License

91 stars 13 forks source link

Add dev accuracy to the plot. #38

Closed MarcCote closed 9 months ago

MarcCote commented 9 months ago

This PR add the dev accuracy to the plot. It also deals with loading the same dataset multiple times (i.e., different seed).

Usage

streamlit run ../demo/demo.py log/text-davinci-003_text-davinci-003/navigate/DLN-1/*/*.json