jugglerchris / rust-html2text

Rust library to render HTML as text.
MIT License
152 stars 27 forks source link

Raw extract #125

Closed sftse closed 7 months ago

sftse commented 8 months ago

A raw mode was requested in #53

I'm also in need of this, as well as a mode that simply does not draw borders. The difference between the two is that raw mode can be used as input to a full text index, as it respects locality of words, words that are inside one cell should be adjacent in the input to a search index.

I don't quite get how this works with deeply nested tables (see deeply_nested test), but think this could benefit from your input at this stage.

sftse commented 8 months ago

Builds on #124

jugglerchris commented 7 months ago

Hi - I've rebased (hopefully I didn't break it) as #128 and will look over it there. Sorry for messing things around!

sftse commented 7 months ago

Closing in favor of #129