iobis / sequence-search

0 stars 1 forks source link

OBIS sequence data #9

Closed humbleflowers closed 1 week ago

humbleflowers commented 1 week ago

Hello @pieterprovoost

Is there a way i can download underlying nucleotide fasta database from OBIS to do analysis?

Thank you.

pieterprovoost commented 1 week ago

@humbleflowers This can be done using the robis R package, but it can be fairly slow to get the entire database. Let me know if you would like to have a full export.

This is an example for just the most recent data (2022 onwards):

library(robis)

occ <- occurrence(hasextensions = "DNADerivedData", extensions = "DNADerivedData", startdate = "2022-01-01")
dna <- unnest_extension(occ, "DNADerivedData", fields = c("scientificName", "decimalLongitude", "decimalLatitude", "eventDate", "datasetName", "materialSampleID"))
humbleflowers commented 1 week ago

Hello @pieterprovoost , Thanks for the reply. I would like to have a full export. Do you have FTP? and Any idea how big will be the FASTA file?

Thank you.

humbleflowers commented 1 week ago

@pieterprovoost , i got the files i needed through R commands. Thank you.