No author on Grapefruit

aaronpk / XRay

X-Ray returns structured data from any URL

MIT License

90 stars 15 forks source link

This may require a lot more refactoring than I initially thought. It looks like, whenever a fragment URL is provided, XRay is only going to parse that little piece of HTML:

https://github.com/aaronpk/XRay/blob/417cc1b3cc77ed86edccf72db174853ade1d9d2b/lib/XRay/Formats/HTML.php#L82-L92

From that point in the code forward, it doesn’t even know the h-entry was part of an h-feed.

I also noticed there that PHP’s default DOMDocument is used to parse and then save the HTML. This could potentially mess up some HTML. As php-mf2 supports taking a DOMDocument as input, it definitely shouldn’t get saved to HTML first. (And it should possibly use the userland HTML parser.)

Not sure if a simply solution is available here.

aaronpk / XRay

No author on Grapefruit #69