mozillascience / software-discovery-dashboard

MIT License
19 stars 8 forks source link

Remove HTML from descriptions #80

Open lukecoy opened 8 years ago

lukecoy commented 8 years ago

A lot of datacite descriptions can have random HTML tags for some reason. The json response has them, the actual datacite article pages have them, but they shouldn't be there. We ought to parse the HTML and extract the inner text or something.

lukecoy commented 8 years ago

I'll parse and validate in my end