microsoft / deep-language-networks

We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts at each layer. We stack two such layers, feeding the output of one layer to the next. We call the stacked architecture a Deep Language Network - DLN
MIT License
87 stars 12 forks source link

Add wandb support #13

Closed sordonia closed 10 months ago

matheper commented 11 months ago

Looks good to me. We can merge this PR as is and work on a callback system for the different loggers.