pachterlab / seqspec

machine-readable file format for genomic library sequence and structure
MIT License
114 stars 17 forks source link

expand sequence in schema.json for IUPAC nucleotides #19

Closed chooliu closed 1 year ago

chooliu commented 1 year ago

Expands seqspec.schema.json to incorporate IUPAC codes for nucleotide bases beyond ACGTNX for precision.

Example case: A sequencing library containing an artifactual region of either a cytosine or thymine. Can be represented more precisely by IUPAC code "Y" (pYrimidine) instead of "N" (aNy base).

Ref: genome.ucsc.edu/goldenPath/help/iupac.html

sbooeshaghi commented 1 year ago

This is a great addition! Thank you for the PR- I've just merged.