Open MatthewLoffredo opened 2 months ago
On the first point, it's worth noting that PEDS will be deprecated very soon by the USPTO so if you want to use patent_client for landscaping, your best bet is to use the open data portal (patent_client.odp) module.
Secondly, if you only want retrieve a list of applications and don't need all the patent_client bells and whistles then it's pretty trivial to just do a python requests call to the odp search API here: https://beta-data.uspto.gov/apis/getting-started. Construct a suitable query string (e.g. something like q = "applicationMetaData.applicantBag.applicantNameText: Google")
On normalising different company names, there is no universal solution to this problem so you'll almost always have to manually do this. Patent applicant name data is notoriously fractured due to attorneys typing these things in slightly differently so you'll need to build a custom query to capture as many variants as you can. Directly calling the uspto odp api also lets you do wildcards in your search query to try to capture as many variants as you can. It's pretty interesting to do a broad subject-matter search and then listing all the unique ways a company has been listed. I once did one where a well-known Korean research institute was named in 15 different ways...
Hi, I'm trying to follow the page here (https://patent-client.readthedocs.io/en/latest/examples/3%20-%20Company%20Ownership.html) to get a company's patent portfolio, but the first step doesn't seem to be working:
Calling the first step:
results in:
Additionally, I was wondering if there were any tricks you had for getting all the different potential assignee names for companies? Some companies might have multiple different ones, which using this method, would potentially exclude any name variations. Any advice on how to handle that? The end goal is to get a company's complete patent portfolio.