Hi, first time poster here. I apologize in advance for not following issue-submission protocol that may exist.
I am working on converting corporate annual reports (default format html, yet no standardized form of html) to text with markdown syntax. HTML2Text works perfectly for and tags, but not for type tags. In these instances, the text is displayed with no markdown tags. I am a novice Python programmer and I cannot overcome this issue on my own.
This research is very important as it will expose certain companies that were either negligent or incompetent in the years before and surrounding the recent financial meltdown. Any help will be greatly appreciated.
Here is some sample html that exhibits the problem I described above...
@ordinaryProfessor this is not a good method of sharing HTML (I am afraid Google Docs do some conversion about it). Do you want to say that your example is http://mcepl.fedorapeople.org/tmp/SampleHTML.html ?
Hi, first time poster here. I apologize in advance for not following issue-submission protocol that may exist.
I am working on converting corporate annual reports (default format html, yet no standardized form of html) to text with markdown syntax. HTML2Text works perfectly for and tags, but not for type tags. In these instances, the text is displayed with no markdown tags. I am a novice Python programmer and I cannot overcome this issue on my own.
This research is very important as it will expose certain companies that were either negligent or incompetent in the years before and surrounding the recent financial meltdown. Any help will be greatly appreciated.
Here is some sample html that exhibits the problem I described above...
https://docs.google.com/document/d/1PUSJWCfnddFCMzb_qiIg7dQYxwyBJpsh-T_cR55oa-A/edit?usp=sharing