I couldn't leave this alone. More FYI: I noticed that running the extraction on RFC 3986 not only collects duplicates, but also collects a few lines that are not even intended to be ABNF:
result = ""
$1 = http:
$2 = http
$3 = //www.ics.uci.edu
$4 = www.ics.uci.edu
$5 = /pub/ietf/uri/
$6 =
$7 =
$8 = #Related
$9 = Related
Fixing duplicates is trivial to script, but fixing these bogons looks like an AI problem. This really strengthens the argument that actual code should be embedded in and the checker applied to the xml source, going forward.
I couldn't leave this alone. More FYI: I noticed that running the extraction on RFC 3986 not only collects duplicates, but also collects a few lines that are not even intended to be ABNF: result = "" $1 = http: $2 = http $3 = //www.ics.uci.edu $4 = www.ics.uci.edu $5 = /pub/ietf/uri/ $6 =
$7 =
$8 = #Related
$9 = Related
Fixing duplicates is trivial to script, but fixing these bogons looks like an AI problem. This really strengthens the argument that actual code should be embedded in and the checker applied to the xml source, going forward.