homeveg / nuctools

software for analysis of chromatin feature occupancy profiles from high-throughput sequencing data
GNU General Public License v3.0
15 stars 3 forks source link

aggregate_profile.pl: duplicate GeneID issue #12

Open epigenereg opened 4 years ago

epigenereg commented 4 years ago

TLDR: Change the default settings to NOT remove regions with duplicate GeneIDs

GeneID is an optional parameters that is missing in most kinds our input files. The default action of the script is to remove duplicate GeneIDs which is very problematic when the input file has not Gene ID and the user is selecting some other column as GeneID, e.g. the coordinate column. In this case some regions with non-unique coordinates get filtered out