deep-diver / llamaduo

This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.
https://huggingface.co/papers/2408.13467
Apache License 2.0
286 stars 29 forks source link

add notebook on data analysis #3

Closed deep-diver closed 7 months ago

deep-diver commented 7 months ago

initial look into the dataset

looks like category for most entries is None which is useless. Maybe need to label them by using other model (specialized model or LLM). What do you think?

review-notebook-app[bot] commented 7 months ago

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

sayakpaul commented 7 months ago

Looks good but including some plain English observations would be nice too!