akikuno / DAJIN2

🔬 Genotyping tool for genome-edited samples, utilizing nanopore sequencer target sequencing
MIT License
7 stars 0 forks source link

midsmask:最大頻度の塩基でマスクする方法では、ヘテロがホモになってしまう。 #1

Closed akikuno closed 2 years ago

akikuno commented 2 years ago

課題

解決案

期待する結果

テストデータ



cat << EOF | sort >tmp1.csv
fooa,!!0
foob,!!0
fooc,!00
food,!00
EOF

cat << EOF | sort >tmp2.csv
fooa,M,M,M
foob,M,M,M
fooc,M,S,M
food,M,M,M
EOF

join -t, tmp1.csv tmp2.csv |
  maskMS > tmp_maskMS.csv

cat tmp_maskMS.csv |
  Rscript ./library/03-preprocess/maskMIDS.R

rm tmp1.csv tmp2.csv

<!-- Edit the body of your new issue then click the ✓ "Create Issue" button in the top right of the editor. The first line will be the issue title. Assignees and Labels follow after a blank line. Leave an empty line before beginning the body of the issue. -->