USPTO / PatentPublicData

Utility tools to help download and parse patent data made available to the public
Other
182 stars 80 forks source link

TransformerCLI doesn't bring over all CPC classifications #72

Closed legolego closed 5 years ago

legolego commented 5 years ago

In this example, doc-number 09603307 in the source XML has one Main CPC classification and four Further classifications. In the resulting JSON file, there are only two.

The source file with the single patent is attached.

CPCs09603307.zip

The resulting JSON is:

"cpc": [ { "type": "main", "raw": "A01B 71/08", "normalized": "A01B 71/08", "facets": [ "0/A", "1/A/A01", "2/A/A01/A01B", "3/A/A01/A01B/A01B71", "4/A/A01/A01B/A01B71/A01B7108" ] }, { }, { "type": "main", "raw": "A01F 12/444", "normalized": "A01F 12/444", "facets": [ "0/A", "1/A/A01", "2/A/A01/A01F", "3/A/A01/A01F/A01F12", "4/A/A01/A01F/A01F12/A01F12444" ] }, { } ]