Closed alnutile closed 1 year ago
https://github.com/alnutile/larachain/actions/runs/5248223724 this might help. I could not replicate the issue but setup it up and saw two other issues that could lead to something.
1) when scraping a root site page it named the file .html so just to make that cleaner it now is index.html 2) I remove unicode from the html string just to make sure it does not mess up any thing.
There are some fixes here but since I could not replicate the code I will just see if it happens again
Reported issue
The combination was something like