green-striped-gecko / dartR

Importing and Analysing DArT type snp and silicodart data
GNU General Public License v3.0
31 stars 21 forks source link

dartR

An accessible genetic analysis platform for conservation, ecology and agriculture

Main repository: CRAN
checks R-CMD-check R-CMD-check-beta

Publication:

Zenodo: DOI

Dev repositories: R-CMD-check-dev R-CMD-check-dev_Arthur R-CMD-check-dev_Bernd R-CMD-check-dev_Luis R-CMD-check-dev_Carlo

Overview

dartR is a user-friendly R package that delivers a variety of analyses and pipelines on the same platform, as well as excellent user support via high-quality tutorials and documentation.

dartR is a collaboration between the University of Canberra, CSIRO, Diversity Arrays Technology and Monash University , and is supported with funding from the ACT Priority Investment Program, CSIRO and the University of Canberra.

               

Installation

dartR is on CRAN, so to install it simply type:

install.packages("dartR")

Because of CRAN limitations, the default installation does not include all packages needed to run all the functions; hence, specialised analyses need the installation of additional packages. The following code guarantees that all necessary packages for all dartR analyses are set up and ready to go.

gl.install.vanilla.dartR()

You can install the development version of dartR from GitHub with:

gl.install.vanilla.dartR(flavour = "dev")

Please consult this installation tutorial if you run into any problems during setup.

Usage

dartR provides several functions for handling all the steps involved in genetic data analysis, from reading multiple data input formats to manipulating, filtering, exploring and analysing the data.

We use the prefix ‘gl’ in function names to acknowledge the use of the genlight object from package adegenet as our input format.

In most cases, the following term in the function name indicates a subset of functions.

Similarly, in most cases the first function parameter (input) is the genlight object.

For instance, you might use the code below to generate a report and then filter your data based on the percentage of missing data:

library(dartR)
test <- platypus.gl
rep <- gl.report.callrate(test)
test_1 <- gl.filter.callrate(test)

Getting started

  1. Are you a R rookie? If you want to learn R and RStudio without any fuss, have a look at our R-refresher tutorial.

  2. Let’s get started by reading your genetic data into dartR; if you have DArT data, follow this tutorial; if not, follow this one.

  3. Checking out our data manipulation tutorial is the easiest way to get your feet wet with dartR.

  4. This tutorial will provide some pointers on how to filter your data effectively, an important step that depends on making sound threshold assessments.

  5. Check out our simulations tutorial to learn more about our simulation model, a powerful tool for illuminating intricate evolutionary and genetic processes.

  6. In more advanced topics, check our technical notes on Genetic Distances and their Visualization in Population Genetics and Fixed Difference Analysis.

Getting help

  1. Google groups Q&A forum in support of users can be accessed here.

  2. The RStudio community provides a welcoming environment in which to ask any inquiry.

  3. Answers to frequently asked topics may usually be found on Stack Overflow.

Contribute

If you want to help shape the future of dartR, this tutorial is for you.

Citation

Please acknowledge dartR if you use it in your study. Copy and paste the following code to the R console to retrieve the citation information:

citation("dartR")

Check out our articles and our awards.

Have fun working with dartR!

Cheers,

Bernd, Arthur, Luis, Carlo, Olly & Diana