bio4j / bio4j-titan

Titan-specific bio4j implementation
https://github.com/bio4j/bio4j
6 stars 2 forks source link

Bio4j Titan

This is the Titan DB distribution of Bio4j.

Titan is a highly scalable graph database optimized for storing and querying large graphs with billions of vertices and edges distributed across a multi-machine cluster. Titan is a transactional database that can support thousands of concurrent users.

We use here our open source library angulillos-titan which in turn depends on angulillos, a Java 8 library for working with strongly typed graph data in a generic way. This allows us to keep in different layers on the one hand Titan technology-specific stuff and, on the other hand, the logic regarding how to deal with graphs in a generic way. Thanks to that we have an effective and organized way to import and store the data.

In the specific case of Titan, we use Titan types for representing the rich structure of the Bio4j domain model and vertex-centric indices for effective local indexing which allows to avoid the super-node problem.

SBT dependency

To use it in an sbt-project, add the following to libraryDependencies in your build.sbt file:

"bio4j" % "bio4j-titan" % "<version>"

Using a pre-built release in AWS

Please go to this section if you want to use an already pre-built Titan Bio4j database in Amazon Web Services.

Importing Titan Bio4j

Please refer to this section if you want to import your own Titan Bio4j database.