oss-slu / Enhancing-Bioinformatics-Research-through-LLM

Apache License 2.0
0 stars 0 forks source link

Develop a simple data preprocessing pipeline for a specific dataset. #3

Open AjithAkuthota23 opened 2 months ago

AjithAkuthota23 commented 2 months ago

Create a basic data preprocessing pipeline for a specific bioinformatics dataset to prepare it for LLM training. The pipeline should include steps for data cleaning, tokenization, and formatting

kungfuchicken commented 2 months ago

this is a big rock