JATS4R / JATS4R-Participant-Hub

The hub for all JATS4R meeting notes, examples, draft recommendations, documents, and issues.
http://jats4r.org
17 stars 20 forks source link

Which version and flavor of JATS? #105

Closed Klortho closed 9 years ago

Klortho commented 9 years ago

This was discussed a little bit in the telecon this morning, mostly regarding the difference between green (archiving) and blue (publishing).

Some questions for us to consider:

Some people this morning said that blue is often too restrictive. I think the trouble with green is that it is way too permissive, and one of the focusses of this group is regularizing a lot of the markup. Brainstorming, I am wondering if we might be able to validate green documents against the blue JATS schema, and make validation errors into JATS4R warnings, for example.

Here are some notes from the minutes of the telecon this morning:

hubgit commented 9 years ago

Given that the recommendations are for publishing XML, it would make sense to base those recommendations on the Publishing DTD.

Is anyone publishing article XML using the Archiving DTD?

vincentml commented 9 years ago

Is anyone publishing article XML using the Archiving DTD?

Yes, most definitely.

The Publishing DTD makes a certain set of choices about what to restrict based on the Archiving DTD. Those choices don't meet with every publishers' needs. In our version of JATS we started with the Archiving DTD and chose restrictions that are in some areas similar to the Publishing DTD and in some areas different than Publishing DTD.

hubgit commented 9 years ago

Is anyone publishing article XML using the Archiving DTD?

Yes, most definitely.

Can you point to an example XML file online?

jeffbeckncbi commented 9 years ago

All of the PMC articles - including the ones from the OAI that Daniel used to first complain about them (and get this whole thing started) are tagged in version of the Archiving Tag Set.

I can get some examples if you need them for testing.

hubgit commented 9 years ago

All of the PMC articles […] are tagged in version of the Archiving Tag Set.

Were they originally published in that format, or converted to the Archiving DTD for archiving? (I should have clarified in my original question - by "anyone", I meant publishers).

jeffbeckncbi commented 9 years ago

Most, but not all, were created in Publishing. We have a good number of publishers sending us content in Archiving models.

hubgit commented 9 years ago

We have a good number of publishers sending us content in Archiving models.

It might be useful to know who those are and which elements they're using that aren't in the Publishing DTD. Maybe that's out of scope for this group, though.

Klortho commented 9 years ago

We discussed this at the last telecon. Minutes are here, under "August 6".

I think the consensus was that there are some good reasons to use Green, and so we really can't require that publishers use Blue, but we could recommend it. Among the reasons is that some attributes in Blue are defined as a fixed set of values (token type) whereas in Green they are open to any value. There was the suggestion that we submit comments to the JATS steering committee to request that they be de-restricted. Melissa added a comment here and I added a follow-up comment here.

One of the ideas that came up was that we could look at the differences between the two, and incorporate some of the Blue rules into our schematrons. There's even a tool, dtdschematron, that would convert the Blue DTD into schematron for us.

Nikos-Markantonatos commented 9 years ago

All our client publishers are using the Green DTD because that's the DTD that the Atypon platform supports. To be honest, I never understood why publishers should limit themselves to the Blue DTD when the Green DTD offers higher flexibility and more functionality. The argument that the Green DTD is "too permissive" is rather mute, because DTDs are by definition too permissive, that's why people resort to Schematron and other such techniques to harness XMLs in how they want them to be encoded. On the other hand, Blue DTD is becoming Greener by the day to the degree that the JATS committee is considering dropping the distinction between the two flavors in the future. In fact, this will actually be good news to the JATS4R initiative, because that's one less DTD flavor to write recommendations about.

But in any case, I see no concrete arguments whereby JATS4R would recommend Blue in favor of Green. In fact, the fixed set of values in Blue are an indication that Green maybe the preferred DTD flavor.

Klortho commented 9 years ago

I wrote this up, that we are flavor-agnostic.