USPTO / PatentPublicData

Utility tools to help download and parse patent data made available to the public
Other
182 stars 80 forks source link

SGML Fragments / InventorNode #57

Closed aosingh closed 6 years ago

aosingh commented 7 years ago

After serializing into JSON files, only one inventor is extracted.

TEST DATA Patent document ID : US6167583A ZIP file name : bulkdata.uspto.gov/data2/patent/grant/redbook/fulltext/2001/pg010102.zip

CURRENT RESULT

 "inventors":[
        {
            "sequence":"",
            "name":{
                "type":"person",
                "raw":"Miyashita, Naoto",
                "prefix":"",
                "firstName":"Naoto",
                "middleName":"",
                "lastName":"Miyashita",
                "suffix":"",
                "abbreviated":"Miyashita, N.",
                "synonyms":[
                ]
            },
            "address":{
                "street":"",
                "city":"Yokohama",
                "state":"",
                "zipCode":"",
                "country":"JP",
                "email":"",
                "fax":"",
                "phone":""
            },
            "residency":"",
            "nationality":""
        }
    ]

SHOULD BE

"inventors":[
        {
            "sequence":"",
            "name":{
                "type":"person",
                "raw":"Miyashita, Naoto",
                "prefix":"",
                "firstName":"Naoto",
                "middleName":"",
                "lastName":"Miyashita",
                "suffix":"",
                "abbreviated":"Miyashita, N.",
                "synonyms":[
                ]
            },
            "address":{
                "street":"",
                "city":"Yokohama",
                "state":"",
                "zipCode":"",
                "country":"JP",
                "email":"",
                "fax":"",
                "phone":""
            },
            "residency":"",
            "nationality":""
        },
        {
            "sequence":"",
            "name":{
                "type":"person",
                "raw":"Abe, Masahiro",
                "prefix":"",
                "firstName":"Masahiro",
                "middleName":"",
                "lastName":"Abe",
                "suffix":"",
                "abbreviated":"Abe, M.",
                "synonyms":[
                ]
            },
            "address":{
                "street":"",
                "city":"Yokohama",
                "state":"",
                "zipCode":"",
                "country":"JP",
                "email":"",
                "fax":"",
                "phone":""
            },
            "residency":"",
            "nationality":""
        }
    ]

I have fixed this locally and tested it. The change is in the file

PatentDocument/src/main/java/gov/uspto/patent/doc/sgml/fragments/InventorNode.java

and the following line.

    private static final String FRAGMENT_PATH = "/PATDOC/SDOBI/B700/B720/B721";