IBMStreams / administration

Umbrella project for the IBMStreams organization. This project will be used for the management of the individual projects within the IBMStreams organization.
Other
19 stars 10 forks source link

Proposal for new repository: streamsx.graphdb #60

Closed apyasic closed 8 years ago

apyasic commented 9 years ago

I am proposing to create a project for integration with graph databases.

Graph databases are widely used in a lot of areas: content management and access control, bioinformatics, cyber security, network asset management, public transportation, social applications and many others. Bringing graph databases to Streams will allow better implementation of applications dealing with inter-connected data sets in areas above.

Technically, such an integration will allow to run queries in a real-time for sub-graph pattern matches, manage/persist continuously evolving inter-connected data sets in real-time, apply graph mining algorithms in a real-time, etc.

As a first step I propose an integration with IBM System G Native Graph Store: http://systemg.research.ibm.com/db-nativestore.html

chanskw commented 9 years ago

+1... think it will be useful to add in this toolkit.

Would the initial contribution contains a source and sink operator for reading and writing to System G? What else will be included in the contribution?

apyasic commented 9 years ago

The initial version is about to include native functions for CRUD operations on graphs.

chanskw commented 9 years ago

Only received one vote on this. Any objection to creating this repository?
Propose to close voting by May 29.

mikespicer commented 9 years ago

How can a customer get System G and what support is available for them?

apyasic commented 9 years ago

The first toolkit version will contain System G Native Store capabilities. The Native Store itself is about to be integrated as a shared library into a toolkit, so it would be transparent to a customer. In fact, the toolkit design will allow to swich graph databases if required with no changes in SPL code. Couple of other native candidates for this toolkit are Neo4J and Titan.

hildrum commented 9 years ago

+1

mikespicer commented 9 years ago

+1

ddebrunner commented 9 years ago

+1

cancilla commented 9 years ago

+1

leongor commented 9 years ago

+1

chanskw commented 9 years ago

Who should we assign as initial committers to this project?

apyasic commented 9 years ago

Leonid Gorelik and myself at this point.

chanskw commented 9 years ago

Closing... created streamsx.graphdb repository.

ddebrunner commented 9 years ago

@apyasic I had someone interested in this toolkit yesterday but there's been no activity since its creation, is there likely to be a contribution or should the repository be deleted?

apyasic commented 9 years ago

@ddebrunner the toolkit is work in progress. The first release is about to be published till the end of October.

ddebrunner commented 9 years ago

Any reason the development can't be done in the open?

apyasic commented 9 years ago

The key reason for delay was in finding a way to publish SystemG libraries as a part of the toolkit which appears to be non-trivial from IP perspective. The best compromise we've found is to publish open source (tinkerpop/blueprint based toolkit version first and in parallel continue with a SystemG direction. As we initially started with SystemG some extra efforts are required to port a toolkit functionality to pure tinkerpop/blueprint implementation.

mikespicer commented 8 years ago

Integration with a graph DB has come up from a couple of Streams customers recently. I have also heard that IBM is hoping to standardize on tinkerpop as the query api for graph db implementations. Is there any development happening on this toolkit and could it be done in the open with tinkerpop?

apyasic commented 8 years ago

@mikespicer the development is in progress. The sources would be published soon.

chanskw commented 8 years ago

I am closing this. Please continue discussion about graph DB in its repository.

Thanks!