deepcelllineage / mitolin

Use mitochondrial sequence from single cells to determine cell lineage relationships
BSD 3-Clause "New" or "Revised" License
2 stars 4 forks source link

mis-matched r1 and r2 #1

Closed Deena-B closed 5 years ago

Deena-B commented 5 years ago

Problem

There are two txt files, named r1_list.txt and r2_list.txt here: https://github.com/deepcelllineage/mitolin/tree/master/generated/nguyen_nc_2018/ind2/

R stands for read. r1 is the first read (aka the sequence as read from the forward direction) and r2 is the second read (aka the sequence as read from the reverse direction).

Each file holds a list of filenames and the filenames should match, but a line count revealed that there is one extra filename in one of the lists.

We need to iterate through these lists, and generate new lists that have the filenames that are paired and don't have the filenames that do not have a matching pair.

Output

sergeman commented 5 years ago

@mkeisenbach, You are the first person to submit a pull request to this repo! Kudos and thank you for the solution! @Deena-B we need a better file structure for the code and data. Right now @mkeisenbach code and data sit in the root directory, we would need to refactor that into proper places. After that we should be able to close the issue, isn't it?

sergeman commented 5 years ago

moved to data folder