Open symroe opened 3 years ago
A third use case is when a council publishes SOPN info as a webpage rather than as a pdf. Relying on users to convert pdf can lead to parsing failures as was the case here.
We could accept the sopn url, check it's not linked to a pdf, and convert html sopns to pdf before parsing using https://pypi.org/project/pdfkit/
Bury posted an HTML SOPN in a non-standard format (https://www.bury.gov.uk/council-and-democracy/elections-and-voting/statement-of-persons-nominated). We should discuss whether this is something the parser should support or whether it would be easier to address this on a case-by-case basis.
There are two use cases for the feature:
To solve both of these, we should make sure that the SOPN upload functions aren't tied to the view code, and add a feature to fetch a PDF by URL