USPTO / PatentPublicData

Utility tools to help download and parse patent data made available to the public
Other
180 stars 81 forks source link

transformer tool java.lang.IllegalArgumentException #102

Closed sotnikov-s closed 4 years ago

sotnikov-s commented 4 years ago

Hello. Howdy :) Another transformer issue here: running it with the following parameters -f=pftaps19950704_wk27.zip --type=json --outDir=. --outBulk=false --prettyPrint=true --skip 1596 --limit 1 results in an exception:

log4j:WARN No appenders could be found for logger (gov.uspto.patent.PatentDocFormatDetect).
log4j:WARN Please initialize the log4j system properly.
log4j:WARN See http://logging.apache.org/log4j/1.2/faq.html#noconfig for more info.
Exception in thread "main" java.lang.IllegalArgumentException: Comparison method violates its general contract!
    at java.base/java.util.TimSort.mergeLo(TimSort.java:781)
    at java.base/java.util.TimSort.mergeAt(TimSort.java:518)
    at java.base/java.util.TimSort.mergeCollapse(TimSort.java:448)
    at java.base/java.util.TimSort.sort(TimSort.java:245)
    at java.base/java.util.Arrays.sort(Arrays.java:1516)
    at java.base/java.util.ArrayList.sort(ArrayList.java:1717)
    at java.base/java.util.stream.SortedOps$RefSortingSink.end(SortedOps.java:392)
    at java.base/java.util.stream.Sink$ChainedReference.end(Sink.java:258)
    at java.base/java.util.stream.AbstractPipeline.copyIntoWithCancel(AbstractPipeline.java:503)
    at java.base/java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:488)
    at java.base/java.util.stream.AbstractPipeline.wrapAndCopyInto(AbstractPipeline.java:474)
    at java.base/java.util.stream.FindOps$FindOp.evaluateSequential(FindOps.java:150)
    at java.base/java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:234)
    at java.base/java.util.stream.ReferencePipeline.findFirst(ReferencePipeline.java:543)
    at gov.uspto.patent.model.entity.Name.getShortestSynonym(Name.java:50)
    at gov.uspto.patent.serialize.JsonMapperStream.writeName(JsonMapperStream.java:380)
    at gov.uspto.patent.serialize.JsonMapperStream.writeEntity(JsonMapperStream.java:327)
    at gov.uspto.patent.serialize.JsonMapperStream.output(JsonMapperStream.java:127)
    at gov.uspto.patent.serialize.JsonMapperStream.write(JsonMapperStream.java:97)
    at gov.uspto.bulkdata.tools.transformer.TransformerRecordProcessor.writeOutputType(TransformerRecordProcessor.java:142)
    at gov.uspto.bulkdata.tools.transformer.TransformerRecordProcessor.process(TransformerRecordProcessor.java:90)
    at gov.uspto.bulkdata.RecordReader.read(RecordReader.java:195)
    at gov.uspto.bulkdata.RecordReader.read(RecordReader.java:122)
    at gov.uspto.bulkdata.RecordReader.read(RecordReader.java:85)
    at gov.uspto.bulkdata.RecordReader.read(RecordReader.java:43)
    at gov.uspto.bulkdata.cli.Transformer.exec(Transformer.java:77)
    at gov.uspto.bulkdata.cli.Transformer.main(Transformer.java:115)

Thanks for attention!