mysociety / za-hansard

A parser for South African Hansards, as published at http://www.parliament.gov.za/live/content.php?Category_ID=119
Other
2 stars 3 forks source link

Parsing roles with a specified name doesn't work #25

Closed osfameron closed 10 years ago

osfameron commented 10 years ago

Geoff writes:

I noticed at least once instance where speeches by the House Chairperson are not being identified - e.g. http://za.mzalendo.dev.mysociety.org/hansard/3#s6

The following line isn't recognized as starting a new speech:

The HOUSE CHAIRPERSON (Mr C T Frolick): Hon member, that includes you.

We currently only allow a party sigil in the name_regexp (e.g. \w+) as follows:

https://github.com/mysociety/za-hansard/blob/master/za_hansard/parse.py#L263

obviously we can relax that rule, but: