racket / scribble

Other
203 stars 93 forks source link

Semantic overhaul of Scribble with RDFa #228

Open gregid opened 4 years ago

gregid commented 4 years ago

This is a proposal for an overhaul of current Scribble implementation.

The goal of the project is for every document generated by Scribble to be semantically annotated with RDFa + JSON-LD and search to be (also) implemented with HDT/SPARQL as engine.

The benefits:

The means/Required work:

  1. implement RDF toolset in Racket:
    • RDF turtle/NQuads
    • JSON-LD (implemented by @cwebber)
    • RDFa parser
    • HDT ffi (for data storage and querying)
  2. Building OWL ontology for Scribble documentation/code
  3. Rework of Scribble output to replace current class based annotation with RDFa. If you are not familiar with Semantic annotation take a look at https://schema.org/Article especially scroll to the bottom to the RDFa tab.
  4. Data querying based on data stored in HDT files for local documentation and possibly triple store (e.g. RDF4J) for the server side documentation. http://www.rdfhdt.org/what-is-hdt/

If you have any questions regarding benefits/implementation details feel free to ask here.

jackfirth commented 4 years ago

I'm a little confused, are you proposing this as something you'd like to do? Or the current maintainers of Scribble? The latter seems unlikely to happen since this is a very large project.

gregid commented 4 years ago

I am writing it down so it won't be lost in a case I won't be able to come back to this myself and to open this for discussion and critical review.

Referenced issue and pull request were just the trigger as with semantified scribble these issues could be easily solved (and more).

I would love to do this myself as this is my area of expertise. But as mentioned on Racket Users at the moment I can't afford to spend time on free work, at least until I start having some income from my business in the making.

And yes I agree this is a large project, but it is also possible it could interest someone as a research project in the Uni Departments dealing with semantic data/ontologies/knowledge graphs.

In the meantime the least I can do is discuss the proposal if there is any interest at all.