Bioconductor / BSgenome

Software infrastructure for efficient representation of full genomes and their SNPs
https://bioconductor.org/packages/BSgenome
7 stars 9 forks source link

Proposed task for Outreachy applicants: Forge BSgenome data package for UCSC genome xenTro10 #40

Closed hpages closed 1 year ago

hpages commented 1 year ago

This task depends on this issue being completed first (i.e. PR accepted and merged, and issue closed). Although it's not a requirement that the 2 tasks be completed by the same applicant, it will be a more interesting learning experience if they are.

BSgenome data packages are one of the many types of annotation packages available in Bioconductor. They contain the genomic sequences, which comprise chromosome sequences and other DNA sequences, of a particular genome assembly for a given organism. For example BSgenome.Hsapiens.UCSC.hg19 is a BSgenome data package that contains the genomic sequences of the hg19 genome from UCSC. Users can easily and efficiently access the sequences, or portions of the sequences, stored in these packages, via a common API implemented in the BSgenome software package.

This task's goal is to make a new BSgenome data package for UCSC genome xenTro10. The process of making such package is documented in the "How to forge a BSgenome data package" vignette from the BSgenome software package. The landing page for the BSgenome package contains a link to this vignette.

Other useful links:

IMPORTANT NOTES TO OUTREACHY APPLICANTS:

Simplecodez commented 1 year ago

Please assign me to this task.

Simplecodez commented 1 year ago

Thank you

Simplecodez commented 1 year ago

I have been able to run forgeBSgenomeDataPkg(), R CMD build, R CMD check with 3 notes and R CMD INSTALL successfully and thus have created a PR. I will be anticipating your feedback. Thank you.

hpages commented 1 year ago

Hi @Simplecodez ,

You're BSgenome.Xtropicalis.UCSC.xenTro10-seed file looks good. Nice work!

Here's some important reading about package templates and the PkgExamples field: https://github.com/Bioconductor/BSgenome/pull/46#issuecomment-1291424086 Don't hesitate to ask if you have any questions.

Next task in your group is #41. It's still about Frog! :frog: Whenever you are ready, go there and ask to be assigned.

Simplecodez commented 1 year ago

Okay, thank you sir