google / patents-public-data

Patent analysis using the Google Patents Public Datasets on BigQuery
https://bigquery.cloud.google.com/dataset/patents-public-data:patents
Apache License 2.0
539 stars 163 forks source link

Generating new Document Embeddings #62

Open LucasLaughlin opened 2 years ago

LucasLaughlin commented 2 years ago

From what I glean, it seems that you are generating patent embeddings directly. Is this correct? and if so, are you then forced to rerun the model every time new patents are published?

Is it possible to generate new embeddings using unpublished patent data like an abstract, etc?