Closed iuliaturc closed 1 day ago
Interesting, ccing @tomkosm here.
Hey @iuliaturc thanks for bringing this up! The backslashes you’re seeing are actually due to the way our markdown parser handles text that’s part of a link or button. In this case, the text you’re referring to is likely inside an expandable block (with the "expand 14 parameters" button). The parser adds these backslashes to preserve the link functionality within markdown.
We’ll be closing this issue as "not planned," but feel free to reopen it or create a new issue if needed. Let me know if you have any further questions!
Thanks for the explanation!
When scraping https://huggingface.co/docs/transformers/main_classes/pipelines, I'm seeing a lot of back slashes:
Firecrawl Markdown:
Note these back slashes don't always show up. For instance, when I scrape https://huggingface.co/transformers/main_classes/tokenizer.html#transformers, I get cleaner Markdown: