fijareplous / qbb2024-answers

0 stars 0 forks source link

Day 2 Afternoon Exercise #4

Open schidambaran opened 2 months ago

schidambaran commented 2 months ago
Exercise Description Completion
Q1 grep.py completed Yes
sys.argv indexed Yes
Text file processed Yes
Newline characters removed Yes
Q2 gtf2bed.py completed Yes
Chromosome, start coordinate, stop coordinate printed Yes
gene_name cleaned up and printed Yes
Q3 tally-fixed.py completed Yes
Bug 1 fixed Yes
Bug 2 fixed Yes
Bug 3 fixed Yes

Great work!

Comments: - In grep.py, it doesn't seem like you're checking if a particular value is in each line of the file. Consider adding another command line argument for the value of interest, and using in to see if it's in each line. - In gtf2bed.py, the "gene_name" text is not being removed.

fijareplous commented 2 months ago

When I run gtf2bed.py on the genes.gtf file we made during class, the "gene_name" string is not printed out, e.g.:

chrM    10760   12137   MT-ND4
chrM    12138   12206   MT-TH
chrM    12207   12265   MT-TS2
chrM    12266   12336   MT-TL2
chrM    12337   14148   MT-ND5
chrM    14149   14673   MT-ND6
chrM    14674   14742   MT-TE
chrM    14747   15887   MT-CYB
chrM    15888   15953   MT-TT
chrM    15956   16023   MT-TP

Please let me know if there's anything I need to change!

schidambaran commented 2 months ago

Thank you! Updated the rubric, you are at 100% completion now!