zotero / translators

Zotero Translators
http://www.zotero.org/support/dev/translators
1.28k stars 757 forks source link

Implement HTML5 microdata #366

Open simonster opened 12 years ago

simonster commented 12 years ago

Mozilla bug 591467 will land microdata on mozilla-central. Once this happens, we should consider supporting the rather anemic schema.org types.

aurimasv commented 9 years ago

While the JS API was implemented (though I haven't found much documentation), there's some talk of removing it. https://bugzilla.mozilla.org/show_bug.cgi?id=909633

Still would be good to support this.

zuphilip commented 9 years ago

The Project Gutenberg should be handled with a (not-yet existing) microdata translator, see https://forums.zotero.org/discussion/47999/gutenberg-import-not-working/

dgerber commented 7 years ago

This branch has a generic microdata parser, and some mapping from schema.org vocabulary to zotero added in the RDF translator, as suggested in #917.

zuphilip commented 7 years ago

@dgerber This looks interesting. Is your code ready or still work in progress? CC @adam3smith

dgerber commented 7 years ago

The mapping microdata->RDF works fine on the few pages I tried, and should support any microdata markup.

So, functionally, it would be ready after adding a proper detectWeb and some more extended support for schema.org terms (in RDF translator).

One feature missing is identifying nodes, e.g. sameAs-related when an author ends up in more than one blank nodes, or http:and https: prefixes. Maybe the underlying rdf store can apply such inference rules (?) but it's not exposed to the translator sandbox environment.

Also, architecturally, the way it calls the RDF translator is somewhat of a hack, and generic translators (EM, microdata, JSON-LD?) could be better coordinated.