TYPO3-Documentation / t3docs-search-indexer

TYPO3 Docs Search
4 stars 7 forks source link

#49 Adapt documentation indexing for new content structure #51

Closed soee closed 11 months ago

soee commented 11 months ago

Implemented new documentation rendering based on PHP, which works well with the current code. However, due to minor changes in the rendered HTML file structure, it was necessary to introduce some adjustments.

When checking HTML files for content to be indexed, the old rendering method used the crawler to search for elements by the tag and attribute div[itemprop="articleBody"]. In contrast, the new rendering uses the article tag. Additionally, in the old rendering, snippets were generated based on div.section elements, whereas the new rendering matches them by section tags.

These updates are made in the ParseDocumentationHTMLService class. This class also determines whether the file content was generated with the old or new rendering method by checking the meta attribute generator. The new rendering produces:

<meta content="phpdocumentor/guides" name="generator">

This commit also includes:

Resolves issue: #49

linawolf commented 11 months ago

@soee please rebase and solve the conflicts