The current DocsContextProvider mostly uses the default from `node-html-markdown to convert HTML to markdown, but this sometimes includes junk, including ads, navigation text, etc... We just want the important content (headers, paragraphs, etc...).
hey, hi! how can i know more about this and do we want to shift to a new tool like puppeteer and cheerio or we just want to change configs of node-html-markdown according to our needs.
The current
DocsContextProvider
mostly uses the default from `node-html-markdown to convert HTML to markdown, but this sometimes includes junk, including ads, navigation text, etc... We just want the important content (headers, paragraphs, etc...).