USPTO / PatentPublicData

Utility tools to help download and parse patent data made available to the public
Other
182 stars 80 forks source link

TransformerCLI: CPC classes should have one Main and multiple Further types #73

Closed legolego closed 5 years ago

legolego commented 5 years ago

Hi, Looking at the CPC classes in the transformed JSON files, all of the CPC classes have a type main. I think there's one, but not sure if always true. The rest of the CPC classes should be of type further. These are for patent number US9428405B2 (source XML attached).

US9428405B2-github.zip

Source XML: <classifications-cpc> <main-cpc> <classification-cpc> ... C02F ... </classification-cpc> </main-cpc> <further-cpc> <classification-cpc> ... B01D ... </classification-cpc> ... </further-cpc> </classifications-cpc>

Result JSON:

   "cpc": [
        {
            "type": "main",
            "raw": "B01D 17/0202",
            "normalized": "B01D 17/0202",
            "facets": [
                "0/B",
                "1/B/B01",
                "2/B/B01/B01D",
                "3/B/B01/B01D/B01D17",
                "4/B/B01/B01D/B01D17/B01D170202"
            ]
        },
        {
        },
        {
            "type": "main",
            "raw": "B01D 36/008",
            "normalized": "B01D 36/008",
            "facets": [
                "0/B",
                "1/B/B01",
                "2/B/B01/B01D",
                "3/B/B01/B01D/B01D36",
                "4/B/B01/B01D/B01D36/B01D36008"
            ]
        },