WeiWei060512 / NUMTs-detection

Detecting NUMTs from WGS
11 stars 2 forks source link

Question about searchBreakpoint_fromblatoutputs.py #2

Open zqlns opened 1 year ago

zqlns commented 1 year ago

Dear author, I have been reading your code carefully recently. I am very confused with the following questions about searchBreakpoint_fromblatoutputs.py. Could you tell me why? Thank you for your time.

1、Why subtract 10 instead of other numbers in "df1 = df_input[(df_input['matchLEN'] < (readLEN - 10)) & (df_input['misMatch'] <= mismatchLEN)]" 2、What is the purpose of filtering out df2? I output df1 and df2 respectively, and there is no difference. 3、what is the meaning of mtLeft in the column of "group" like "df_mtLeftTstartG['Group'] = "mtLeft""

notthathime commented 1 year ago

HI! I got different outputs of df1 and df2 ( excluding chrUN ). But I am also curious why setting df1 and df2.