oss-slu / Enhancing-Bioinformatics-Research-through-LLM

Apache License 2.0
0 stars 0 forks source link

Collect and preprocess a small sample dataset of bioinformatics code snippets #2

Open AjithAkuthota23 opened 1 month ago

AjithAkuthota23 commented 1 month ago

Identify relevant sources for the dataset (e.g., open-source bioinformatics projects, research papers). Preprocess the data by tokenizing, removing unnecessary characters, and formatting for LLM input.

kungfuchicken commented 1 month ago

this is a big rock