Closed leobruneau closed 11 months ago
Hi @johentsch ! I will address all the points during this week as well as add unittests for the new code
Thank you. P.S.: I had forgotten to push my merging the main branch. Just pushed the commit, please pull...
Hey @johentsch ! All problems should've been addressed as well as unittests added for the methods. I also re-implemented random excerpt generation according to what we had talked about and what is expected within the unittests that you wrote. I think we should be ready to close this pull request and open another one containing the requested changes
With new updates it is now possible to:
An ulterior addition is the cleansing of the XML that, during excerpt creation, removes all tags that refer to repeat-like structures to avoid generating "broken" and/or nonsensical excerpts
All the methods have been clearly described within corresponding docstrings