Open finnless opened 2 weeks ago
Two things are not clear to me.
What does "Design and conduct experiments to evaluate the model's performance" mean? Can you be more specific? Relatedly, CI and evaluation feel like very different concepts and probably shouldn't be lumped together in the same category.
It's not clear to me what the entire "Dataset Management" section is for. This seems like something that should be out of scope for this library and another library is used to load data, but I could be misunderstanding.
For publicizing: Does it make sense for this to be on pypi?
Finally, we should chat sometime about how to improve your writing. You have a lot of spurious adjectives in your rubric and this makes your writing less credible.
Title: Build open-text social survey response coder library Team: Nolan Project Description: This project is a open source library that uses language models to code open-ended response questions often used in public opinion polling. The target audience for this project social science researches so the library will be easy to use. There will be features for setting a codebook, inputing survey responses, and validating the classifiers accuracy on manually label ground truth test data.
Rubric:
- Develop a comprehensive
README.md
that includes clear instructions for setting up the environment, installing dependencies, and running the model. Ensure the setup process is smooth and reproducible.- Implement the core model class to handle text classification tasks. Ensure the model can process datasets and return accurate predictions with or without confidence scores.
- Develop a robust dataset manager to handle data loading, sampling, and hashing. Ensure the manager can efficiently process both valid and invalid data inputs.
- Design and conduct experiments to evaluate the model's performance. This includes setting up test cases, running the model, and collecting results. Integrate these tests into a CI/CD pipeline to ensure all functionality is tested automatically on new commits.
- Implement comprehensive error handling and validation mechanisms for both the model and dataset components. Ensure that appropriate exceptions are raised and handled gracefully.
- Maintain high-quality documentation throughout the codebase. This includes clear docstrings and inline comments.
- Share the project by writing a blog post, posting to Hacker News, and sharing on r/LocalLlama. Include a detailed project description and usage instructions in each post.