wind22zhu / BioMedR

generating various molecular representations for chemicals, proteins, DNAs/RNAs and their interactions.
GNU General Public License v3.0
16 stars 14 forks source link

GitHub forks GitHub watchers CRAN/METACRAN GitHub repo size CRAN/METACRAN

Introduction

The BioMedR package offers an R package for generating various molecular descriptors for chemicals, proteins, DNAs/RNAs and their interactions. Also, this package realized some similarity searching methods and powerful clustering methods as well as several useful auxiliary tools, which aims at building integrated analysis pipelines from data acquisition, data checking, and descriptor calculation to modeling. See vignette('BioMedR') for the comprehensive user guide.

Installation

To install the BioMedR package in R, simply type

   install.packages('BioMedR')

or

   install.packages('devtools')
   library(devtools)
   install_github('wind22zhu/BioMedR')

if the installation invokes i386 arch, try to use:

   install_github('wind22zhu/BioMedR', INSTALL_opts="--no-multiarch")

To make the BioMedR package fully functional (especially the Open Babel and rjava related functionalities), we recommend the users some tips:

>>Common installing tips >>How to install rjava

Several dependencies of the BioMedR package may require some system-level libraries, check the corresponding manuals of these packages for detailed installation guides.

Examples

Code snippet 1: example for getting molecules

> require('BioMedR')
> id <- c('DB00859', 'DB00860')
> SMILES <- BMgetDrug(id, 'drugbank', 'smile')
> print(SMILES[1])

Code snippet 2: example for calculating molecular descriptors

> require(BioMedR)
> smi_f <- system.file('vignettedata/test.smi', package = 'BioMedR')
> mol1 <- readMolFromSmi(smi_f, type = 'mol')
> des_drug <- extrDrugALOGP(mol1)
> print(des_drug[2,])

Code snippet 3: example for calculating sequence descriptors

> require(BioMedR)
> dna_seq <- readFASTA(system.file('dnaseq/hs.fasta', package = 'BioMedR'))
> des_seq <- extrDNAkmer(dna_seq[[1]])
> head(des_seq)

Code snippet 4: example for calculating interaction descriptors

> require(BioMedR)
> # results from above snippets
> drug1_mat <- as.matrix(des_drug[2,])
> drug2_mat <- as.matrix(des_drug[3,])
> seq1_mat <- t(as.matrix(des_seq))
> # Calculating
> des_CCI1 <- getCCI(drug1_mat, drug2_mat, type='combine')
> des_CDI1 <- getCDI(drug1_mat, seq1_mat, type='tensorprod')
> # Show result
> colnames(des_CCI1) = paste('CCI', 1:dim(des_CCI1)[2], sep = '_')
> rownames(des_CCI1) = paste('ID', 1:dim(des_CCI1)[1], sep = '_')
> print(des_CCI1)

Features

BioMedR implemented and integrated the state-of-the-art Chem/Bio molecular descriptors and various data modeling analysis functionalities with R.

1) For protein sequences, the BioMedR package could

2) For small molecules, the BioMedR package could:

3) For DNA molecules, the BioMedR package could:

4) 9 kinds of descriptor classes were provided for Proteochemometric (PCM) modeling derived by Principal Components Analysis, Factor Analysis and so on.

5) By combining various types of descriptors for drugs, proteins and DNA in different methods, interaction descriptors representing protein-protein, compound-compound, DNA-DNA, compound-DNA compound-protein and DNA-protein interactions could be conveniently generated with BioMedR, including:

6) 7 specific functionalities for similarity calculation or comparison, and 4 popular clustering analysis algorithms were implemented in BioMedR.

7) Several useful auxiliary utilities are also shipped with BioMedR:

The computed protein sequence descriptors, molecular descriptors/fingerprints, interaction descriptors and pairwise similarities are widely used in various research fields relevant to drug disvery, primarily bioinformatics, chemoinformatics, proteochemometrics and chemogenomics.

Publication

Jie Dong, Min-Feng Zhu, Dong-Sheng Cao, et al. BioMedR: An R/CRAN Package for Integrated Data Analysis Pipeline in Biomedical Study. Briefings in Bioinformatics, submitted.

Contact

The BioMedR package is developed by Computational Biology and Drug Design Group, Central South University, China.