DEIB-GECO / GMQL

GMQL - GenoMetric Query Language
http://www.bioinformatics.deib.polimi.it/geco/
Apache License 2.0
18 stars 11 forks source link
architecture bigdata bioinformatics cross-engine-execution data-analysis data-management-engine dataset engine engine-configurations gdm gdm-dataset-architecture genometric-query-language genomic-tertiary-analysis genomics gmql-script scala

GenoMetric Query Language (GMQL) Engine

Join the chat at https://gitter.im/DEIB-GECO/GMQL Build status Maven Central

A GMQL script is expressed as a sequence of GMQL operations with the following structure:

<dataset> = operation(<parameters>) <datasets>

where each dataset stands for a Genomic Data Model (GDM) dataset. Operations are either unary (with one input dataset), or binary (with two input datasets), and construct one result dataset.

For Quick Start please refer to:

Installation Guide

For detailed GMQL language documentation:

GMQL Language Commands and documentation.

For a look on GDMS architecture:

Engine architecture and deployments.

For programatical importing of GDMS kernel JARs in Scala applications and programatically scripting GMQL in Scala:

Scripting GMQL programatically.

For more information about GDMS repository architecture and repository manager:

Repository Manager

GDMS repository is based on a dataset notion, for more information about the data module and GDM dataset architecture:

GDM DataSet architecture.

Shell API is provided for GDMS repository, to list datasets, add, delete, alter datasets in GDMS repository:

Repository Manager shell API

The first step in the installation is to understand the engine configurations, currently we have two sets of configurations. One set of configurations for the repository and the other for the executor.

Engine Configurations.