SmartDataAnalytics / jena-sparql-api

A collection of Jena-extensions for hiding SPARQL-complexity from the application layer
Other
57 stars 14 forks source link

Binary Search fails on a databus resource #45

Open Aklakan opened 3 years ago

Aklakan commented 3 years ago

The request below results in a syntax error in the binary search engine (currently in jena-sparql-api-io-core). The reason is that the block boundary in the bzip2 is not correctly detected causing data to get cut off. The buggy code still uses my regex matching and needs to be updated to use the hadoop codec.

SELECT * {
  SERVICE <x-binsearch:vfs:https://downloads.dbpedia.org/repo/lts/transition/links/2017.11.01/links_domain=bricklink_lang=en.nt.bz2> {
    { SELECT * {
      ?s ?p ?o
    } LIMIT 10 }
  }
}
Aklakan commented 3 weeks ago

The url is still online and it works with the revised code which now only relies on hadoop's Bzip2Codec (and none of my old matching stuff).

Remaining work before finally closing this issue: