thegooglecodearchive / cumulusrdf

Automatically exported from code.google.com/p/cumulusrdf
0 stars 0 forks source link

Simple keyword search #35

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
Simple keyword search: just a conjunction of terms tokenised from literals. 

  * Could be done using CQL collections: http://www.datastax.com/documentation/cql/3.0/webhelp/index.html#cql/cql_using/use_collections_c.html#useCollections
  * Lucence/Solr integration
    * Stargate: http://tuplejump.github.io/stargate/index.html //looks cool
    * Lucandra/Solandra: https://github.com/tjake/Solandra //not maintained
    * Datastax Enterprise search(DSE) //not open-source

Original issue reported on code.google.com by andreas.josef.wagner on 12 Feb 2014 at 4:18

GoogleCodeExporter commented 9 years ago
Great topic! I usually work with SOLR so maybe I can give a more substantial 
contribute here

Original comment by a.gazzarini@gmail.com on 13 Feb 2014 at 3:53

GoogleCodeExporter commented 9 years ago
I agree - this is a quite interesting topic. There are not too many (scalable) 
RDF stores with fulltext search.

From my point of view, we could target too kinds of keyword-query support:
(1) Provide a more efficient implementation of FILTER contains() in SPARQL (see 
[1])
(2) Provide an actual keyword search over RDF data.

I don't think (2) is an "easy to add" feature right now. However, (1) would 
probably be done with reasonable effort.

From the first look, 'Stargate' seems quite nice. However, one would add an 
additional major library (SOLR) and an additional dependency (i.e., Stargate). 
CQL collections, on the other, seem to target "small texts" and are probably 
not intended for this use-case ...

Let me know what you think.

- Andreas

Original comment by andreas.josef.wagner on 13 Feb 2014 at 10:05

GoogleCodeExporter commented 9 years ago
[1] http://www.w3.org/TR/sparql11-query/#func-contains 

Original comment by andreas.josef.wagner on 13 Feb 2014 at 10:06