zswitten / Antimicrobial-Peptides

Collecting AMP MIC data from different sources, then running a GAN to output promising sequences
65 stars 14 forks source link

Cleaning data for Hemolytik database #12

Open qm-intel opened 2 years ago

qm-intel commented 2 years ago

Hi Thanks for sharing the data and code. I have a few questions.

  1. May I ask which code did you use for the Hemolytik_data.csv?

This code creates a dictionary of those fields that we want to save in*.data file.

  1. What about non-hemolysis sequences?
  2. Does the 0 value in log10_HC50 Cleaned_hemolytic_data.csv means non-hemolysis?
  3. How did you decide on activity values to consider it as hemolysis/non-hemolysis?

Thanks