mysociety / za-hansard

A parser for South African Hansards, as published at http://www.parliament.gov.za/live/content.php?Category_ID=119
Other
2 stars 3 forks source link

Q&A: in answer scraping, cope with written and oral numbers in reverse order. #41

Open duncanparkes opened 10 years ago

duncanparkes commented 10 years ago

Currently our regex for matching written and oral question numbers in the NA answers just rejects anything which has a written number and then an oral number - this could be fixed relatively easily with regex changes.

For example, we currently miss documents with names like

RNW855O134-091015

which is Written question 855 and Oral question 134.