USPTO / PatentPublicData

Utility tools to help download and parse patent data made available to the public
Other
180 stars 81 forks source link

transformer tool java.lang.IllegalArgumentException #103

Closed sotnikov-s closed 4 years ago

sotnikov-s commented 4 years ago

Hey, long time no see :) another transformer issue here:

transformer params: -f=pftaps20010109_wk02.zip --type=json --outDir=. --outBulk=false --prettyPrint=true --skip=2291 --limit=1

output:

log4j:WARN No appenders could be found for logger (gov.uspto.patent.PatentDocFormatDetect).
log4j:WARN Please initialize the log4j system properly.
log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more info.
Exception in thread "main" java.lang.IllegalArgumentException: Illegal character in name: '--'.
    at org.dom4j.QName.validateName(QName.java:340)
    at org.dom4j.QName.<init>(QName.java:151)
    at org.dom4j.QName.<init>(QName.java:143)
    at org.dom4j.tree.QNameCache.createQName(QNameCache.java:230)
    at org.dom4j.tree.QNameCache.get(QNameCache.java:86)
    at org.dom4j.DocumentFactory.createQName(DocumentFactory.java:195)
    at org.dom4j.DocumentFactory.createElement(DocumentFactory.java:140)
    at org.dom4j.DocumentHelper.createElement(DocumentHelper.java:53)
    at gov.uspto.parser.keyvalue.KeyValue2Dom4j.genXml(KeyValue2Dom4j.java:237)
    at gov.uspto.parser.keyvalue.KvParser.parse(KvParser.java:67)
    at gov.uspto.patent.PatentReader.read(PatentReader.java:82)
    at gov.uspto.bulkdata.tools.transformer.TransformerRecordProcessor.process(TransformerRecordProcessor.java:72)
    at gov.uspto.bulkdata.RecordReader.read(RecordReader.java:195)
    at gov.uspto.bulkdata.RecordReader.read(RecordReader.java:122)
    at gov.uspto.bulkdata.RecordReader.read(RecordReader.java:85)
    at gov.uspto.bulkdata.RecordReader.read(RecordReader.java:43)
    at gov.uspto.bulkdata.cli.Transformer.exec(Transformer.java:77)
    at gov.uspto.bulkdata.cli.Transformer.main(Transformer.java:115)