oss-slu / Enhancing-Bioinformatics-Research-through-LLM

0 stars 0 forks source link

Develop a simple data preprocessing pipeline for a specific dataset. #3

Open AjithAkuthota23 opened 3 weeks ago

AjithAkuthota23 commented 3 weeks ago

Create a basic data preprocessing pipeline for a specific bioinformatics dataset to prepare it for LLM training. The pipeline should include steps for data cleaning, tokenization, and formatting

kungfuchicken commented 3 weeks ago

this is a big rock