statonlab / hardwoods_site

Hardwoods Genomics bugs, data loading, and general issues
GNU General Public License v3.0
2 stars 1 forks source link

Hydrangea macrophylla #604

Closed RaymondS1 closed 3 years ago

RaymondS1 commented 3 years ago

Publication and Data Information

https://academic.oup.com/dnaresearch/advance-article/doi/10.1093/dnares/dsaa026/5974207

Additional Information

Link to genome (HMA_r1.2_1): https://plantgarden.jp/en/list/t23110/genome/t23110.G003

Checklist

See New Genome Documentation for detailed instructions.

RaymondS1 commented 3 years ago

Organism created: https://hardwoodgenomics.org/bio_data/4302783?tripal_pane=group_summary_tripalpane

Old Organism: https://hardwoodgenomics.org/organism/Hydrangea/macrophylla?tripal_pane=group_summary_tripalpane

RaymondS1 commented 3 years ago

Swissprot

#PBS -N swissprot_BLAST
#PBS -S /bin/bash
#PBS -j oe
#PBS -A ACF-UTK0011
#PBS -t 1-200
#PBS -l nodes=1:ppn=2
#PBS -l walltime=12:00:00

cd $PBS_O_WORKDIR

module load blast

blastx \
 -query /lustre/haven/proj/UTK0032/projects/undergrads/hydrangea_macrophylla/raw_data/BLAST_splits/HMA_r1.2_1.cds.fasta.$PBS_ARRAYID \
 -db /lustre/haven/proj/UTK0032/library/uniprot/uniprot_sprot.fasta \
 -out /lustre/haven/proj/UTK0032/projects/undergrads/hydrangea_macrophylla/BLAST/swissprot/macrophylla_swissprot.$PBS_ARRAYID.xml \
 -evalue 1e-5 \
 -outfmt 5
RaymondS1 commented 3 years ago

Trembl

#PBS -N trembl_BLAST
#PBS -S /bin/bash
#PBS -j oe
#PBS -A ACF-UTK0011
#PBS -t 1-200
#PBS -l nodes=1:ppn=2
#PBS -l walltime=12:00:00

cd $PBS_O_WORKDIR

module load blast

blastx \
 -query /lustre/haven/proj/UTK0032/projects/undergrads/hydrangea_macrophylla/raw_data/BLAST_splits/HMA_r1.2_1.cds.fasta.$PBS_ARRAYID \
 -db /lustre/haven/proj/UTK0032/library/uniprot/uniprot_trembl_plants_July_2018.fasta \
 -out /lustre/haven/proj/UTK0032/projects/undergrads/hydrangea_macrophylla/BLAST/trembl/macrophylla_trembl.$PBS_ARRAYID.xml \
 -evalue 1e-5 \
 -outfmt 5
RaymondS1 commented 3 years ago

IPS

#PBS -N hydrangea_ips
#PBS -A ACF-UTK0011
#PBS -S /bin/bash
#PBS -t 1-500
#PBS -j oe
#PBS -l nodes=1:ppn=4
#PBS -l walltime=18:00:00

cd $PBS_O_WORKDIR

module load python3

/lustre/haven/proj/UTK0032/software/interproscan-5.34-73.0/interproscan.sh \
 -i /lustre/haven/proj/UTK0032/projects/undergrads/hydrangea_macrophylla/raw_data/IPS_splits/HMA_r1.2_1.protein.fasta.$PBS_ARRAYID \
 -f XML \
 -d /lustre/haven/proj/UTK0032/projects/undergrads/hydrangea_macrophylla/IPS/xmls \
 --disable-precalc \
 --iprlookup \
 --goterms \
 --pathways \
 --tempdir /lustre/haven/proj/UTK0032/projects/undergrads/hydrangea_macrophylla/IPS/TMP \
 > /lustre/haven/proj/UTK0032/projects/undergrads/hydrangea_macrophylla/IPS/TMP/$PBS_ARRAYID.out
RaymondS1 commented 3 years ago

Publication Created https://hardwoodgenomics.org/Publication/4302784

RaymondS1 commented 3 years ago

Reference Genome https://hardwoodgenomics.org/Genome-assembly/4302785

RaymondS1 commented 3 years ago

Files to be loaded to the site

CDS: /lustre/haven/proj/UTK0032/projects/undergrads/hydrangea_macrophylla/raw_data

Protein: /lustre/haven/proj/UTK0032/projects/undergrads/hydrangea_macrophylla/raw_data/HMA_r1.2_1.protein.fasta

Swissprot /lustre/haven/proj/UTK0032/projects/undergrads/hydrangea_macrophylla/BLAST/swissprot/xml

Trembl /lustre/haven/proj/UTK0032/projects/undergrads/hydrangea_macrophylla/BLAST/trembl/xml

IPS /lustre/haven/proj/UTK0032/projects/undergrads/hydrangea_macrophylla/IPS/xmls

noah-77 commented 3 years ago

FTP downloads available at https://hardwoodgenomics.org/Genome-assembly/4302785?tripal_pane=group_downloads