Open docmanny opened 3 years ago
There are many studies that have generated PopGen data for Myotis lucifugus, including (in no particular order):
Of these papers, only Davy et al (2017) gives estimates of Ne. Using the Fst estimates in the other papers to calculate Theta and Ne and a variety of commonly-used mutation rate estimates, we get Ne's in the 10^6 - 10^9 range...
The individual used for the MyoLuc2 genome is from Framingham, MA according to the Biosample: https://www.ncbi.nlm.nih.gov/biosample/SAMN02953645/.
We are settling on the Davy et al (2017) population size of 1682.9 (794.0–261 726.6) based on their sequences from Eastern Canada, as it is the only reasonable Ne estimate close to Massachusetts.
For Generation time, these are some of the citations we found:
Everyone seems to cite Humphrey and Cope (1976) as saying 2 years for generational time; the closest we find in that publication is a table that says that females are reproductively mature at 1 y/o and live a max lifespan of 2 years. The Frick et al (2010) paper also mentions 2 years through their analysis.
(Note for posterity, all comments here are alongside @lauterbur )
Mutation Rate:
Ray et al (2008) calculated a neutral mutation rate of 2.366 × 10^−9 per site per year by aligning 4 introns from genes in a variety of species, then use the phylogeny and a site model of TIM+G (Γ = 1.2597) to arrive at that estimate.
Ensemble link to MyoLuc2: https://uswest.ensembl.org/Myotis_lucifugus/Info/Annotation
Citation for MyoLuc2:
Recombination rate:
There are no published Recomb Rates or maps for M. lucifugus as far and as wide as we have seen. A recombination rate could be calculated if we had locus SNP data for a contiguous scaffold of M. lucifugus, however, that data does not yet exist. However, this data does exist for M. myotis. We can settle on some default mammalian recombination rate once we find such a citation.
Bats' closest relatives are horses and cows. Recombination rates for these species work out to be ~10^-8 according to Dumont and Payseur (2008). As such, we will use a default of 10^-8 for the recombination rate as the default for now.
These people estimated a mutation-to-recombination rate ratio for a horseshoe bat: https://onlinelibrary.wiley.com/doi/full/10.1111/j.1365-294X.2006.02879.x
I think they just use mtDNA though.
Description: The Little Brown Bat, Myotis lucifugus, was one of the first non-human mammals sequenced by the NIH Mammalian Genome Project. The Ensembl accession ID is Myoluc2.0 (GCA_000147115.1).
Note: Maybe you don't know a lot of the things below - but, that's OK! The criteria are that (a) the choice is reasonable, and (b) the rationale is clear. What we want is a "reasonable estimate" (or, "best guess") - if you were to use a number in a paper, what would you use and how would you justify it to your collaborators? For instance, for "generation time", best would be to cite a paper that actually measures or estimates generation time. But most species don't have this; next best would be to give a number used in the literature, and provide a citation that used it (and hopefully the publication gives a justification also). Lacking this, you might provide a number from a related species (and a citation for this).
Here is the checklist of things that we need to add a species to the catalogue. Each thing should be provided, with a justification (maybe short) and a citation.
Demographic information:
Chromosome structure:
Note: The assembly should be chromosome-level, i.e., not composed of thousands of scaffolds.
Note: species in Ensembl can be found in one of these lists: vertebrates, "metazoa", plants, fungi, protists, or bacteria.
Recombination rates:
Mutation rate:
Demographic model:
Other information:
These are things we don't currently use, but will want to use in the future: