Closed notenoughneon closed 9 years ago
Hi Emma
The parsing rules I am following here are:
If a property (p-name
) is empty do not add it to the output. In this case "empty" is classed as not containing any non-whitespace text. As far as I known there is no guidance on how to handle "empty" properties in microfomats paring rules, so I followed the conventions of JSON API's not to return "empty" properties.
The side effect of the above is that p-name
also has a number of "implied rules". The "implied rules" try to automatically fill properties like p-name
if there is no defined value. In your example it uses the full text content of the parent h-entry
.
I can see why the resulting different outputs would seem a little unexpected.
This is not really a bug, but a valid questions about how the parsing rules should work:
p-name
to an element with no text contentI am going to have to post this issue to microfomats IRC and see if we can define the rules a bit more clearly for your use case. Once we have an agreed approached I can update the parser.
Personally I would recommend to any author of microformats to always add a p-name
with some text to every h-*
. Also with my parser try setting the options to {'textFormat': 'normalised'}
you may find the resulting text more useful.
Thanks for clarifying. It sounds like the implied property rule breaks the "note type algorithm" and recommended practice of including a p-name in notes, if the note happens to be a photo with no text. Should this be documented on http://microformats.org/wiki/microformats2-parsing-issues?
I have add this problem to http://microformats.org/wiki/microformats2-parsing-issues. Sorry its taken a little while, but I wanted to go through my code carefully to make sure it was not an issue with my parser.
Please feel free to add your own view on how this should be dealt with to the wiki page.
Interesting question! mf2py and php-mf2 will both happily include empty string values in the output and not generate an implied name. Added comments/votes to the wiki.
Hi Emma, Its been a while but your view of what the output should be got agreed and I have now updated all my javascript parser code. You can try out the html in your example in http://glennjones.net/tools/microformats and the p-name should now return as an empty string.
The example below is not parsing correctly. I would expect the entry "name" to be the empty string. Adding any non-whitespace text to the e-content causes it to revert to expected behavior.