Open phonique opened 9 years ago
b6cd08baa1de174e540767a19fc23c0542911ee0, 2ffbd66df90ae5af9166a8b787c65329369f0069, and 513338aa9e6b241dd9214d4d8d651c804a8f4395
supersede this by replacing docopt with argparse (more control?).
Unicode support should work properly now, including rss links. Added a proper switch and CLI is more forgiving now (courtesy of argparse).
Would appreciate if someone would look at the commented lxml code and get it working, since currently the code runs BeautifulSoup twice.
Also, needs testing with git (I only need to generate a local copy without gh).
Should properly parse UTF-8 now. Added some options (See readme). Should produce (mostly) valid HTML5, when
--replace-all=yes
.Should solve #51, #43, #39 Adds alternative for #41
Needed the changes for a project, and thought I'd share.
Code contains some more TODO:s.