It's not clear that the _toc.xml files are useful (the few that I examined were pretty incomplete), but the current parsing code doesn't match the schema in the XML files at all. The TOC parsing code expects a simple lists of elements at the top level of _toc.xml which contain <title> and <pageno> elements, but the actual structure is:
It's not clear that the _toc.xml files are useful (the few that I examined were pretty incomplete), but the current parsing code doesn't match the schema in the XML files at all. The TOC parsing code expects a simple lists of elements at the top level of _toc.xml which contain
<title>
and<pageno>
elements, but the actual structure is: