GeneMANIA / pipeline

GeneMANIA data processing pipeline
1 stars 1 forks source link

missing attribute names and descriptions #28

Open kzuberi opened 9 years ago

kzuberi commented 9 years ago

Only attribute ids such as IPR016132 for InterPro appear to be making it as far as generic_db.

kzuberi commented 9 years ago

The attribute processing scripts are expecting two column (attribute_id, attribute_description) files but the input is actually a triplet of (attribute_id, attribute_name, attribute_description). This causes a join during processing to produce an empty set of descriptions.

I've committed a fix, but need to update the documentation. Needs testing.

kzuberi commented 9 years ago

@haroldr, let me know if this works out, thanks.