Superzchen / iLearnPlus

iLearnPlus is the first machine-learning platform with both graphical- and web-based user interface that enables the construction of automated machine-learning pipelines for computational analysis and predictions using nucleic acid and protein sequences.
91 stars 33 forks source link

Show a warning if special fasta headers format is violated #2

Open kim-fehl opened 3 years ago

kim-fehl commented 3 years ago

In a large dataset of automatically downloaded sequences there can be names including "|" symbol. I concatenate class and train/test labels also automatically. So, when I try to analyze this file, there are uninformative error messages like:

which are caused by incorrect fasta headers:

A simple check when importing the file could show a warning to the user.

Superzchen commented 3 years ago

Thank you for your suggestion. We will deal with this problem as soon as possible.

li6in9muyou commented 1 year ago

I added a helpful error message for ill-formed headers in PR #3.