dglazkov / polymath

MIT License
132 stars 9 forks source link

Fix content bugs in the medium importer #66

Open jkomoros opened 1 year ago

jkomoros commented 1 year ago

The medium importer has a few weird things, like escape codes for quotes and ellipses because medium does fancy things with formatting. Those should be normalized.

This should likely run in the strip_emojis and run on all input.

jkomoros commented 1 year ago
jkomoros commented 1 year ago

See also #99