Thomas-Chuang / Naive-Sorted-TEV

Finding overlap/nonoverlap data of two large sets of data with python
1 stars 0 forks source link

find out how to compare two csv file, ad export a csv file with the non-overlap sequences #1

Open Thomas-Chuang opened 3 years ago

Thomas-Chuang commented 3 years ago

We have the Sequence, Frequency, and Enrichment.

In order to calculate the ratio, we need Enrichment from both the Sorted and Naive Library of the same Sequence.
Other than that, we need Enrichment from both libraries to create the Scatter plot.

Also, we need the Non-overlap sequences for further analysis

Thomas-Chuang commented 3 years ago

The issue is solved. I used the set to find the overlapping and non-overlapping sequence. Then the sequence is recorded with the comparison to dictionaries.