queenvictoria / represent

Other
1 stars 2 forks source link

Support for earlier XML formats #11

Open queenvictoria opened 12 years ago

queenvictoria commented 12 years ago

Also find out when the cut off was for changing the format. Sometime between 2006 and June 2011

queenvictoria commented 12 years ago

Is this the split? 10 May 2011 we see a filename change and a change from no bills to lots of bills. None of the 42nd parliament files get through the sanity check either.

Current file is: ../data/parlinfo.aph.gov.au/hansard/43/8043-5.xml ../data/parlinfo.aph.gov.au/hansard/43/8043-5.xml 0 debates found. Current file is: ../data/parlinfo.aph.gov.au/hansard/43/House of Representatives_2011_05_10_10_Official.xml ../data/parlinfo.aph.gov.au/hansard/43/House of Representatives_2011_05_10_10_Official.xml 4 debates found.

queenvictoria commented 12 years ago

Older format mentions a bill reference -- does the new format also ? This <inline ref="R4534">bill</inline>