I was looking through a book of sermons and noticed that all of the chapter numbers in the scripture references where Roman numerals. In the interest of being able to easily parse scanned text for scripture references, we probably ought to update our regular expressions to find references with chapter numbers in Roman numeral form and update the normalize function to convert it into the appropriate integer value.
These references also do not contain the colon (probably since it is unnecessary with the chapter numbers and verse numbers being in a different format) but rather a period.
For example:
Matthew xvii. 19-21
Isa. iii. 10, 11
Jeremiah xlviii. 11, 12
1 John v. 10
2 Kings vii. 2
I was looking through a book of sermons and noticed that all of the chapter numbers in the scripture references where Roman numerals. In the interest of being able to easily parse scanned text for scripture references, we probably ought to update our regular expressions to find references with chapter numbers in Roman numeral form and update the normalize function to convert it into the appropriate integer value.
These references also do not contain the colon (probably since it is unnecessary with the chapter numbers and verse numbers being in a different format) but rather a period.
For example:
Matthew xvii. 19-21 Isa. iii. 10, 11 Jeremiah xlviii. 11, 12 1 John v. 10 2 Kings vii. 2