google / patents-public-data

Patent analysis using the Google Patents Public Datasets on BigQuery
https://bigquery.cloud.google.com/dataset/patents-public-data:patents
Apache License 2.0
539 stars 163 forks source link

embedding model is not found// Automated Patent Landscaping #47

Open d-w216 opened 3 years ago

d-w216 commented 3 years ago

Hello, good day!

Thank you for your work.

Unfortunately the embedding model is not found. Is there a new data storage location ?

error message:

NotFound: 404 GET https://storage.googleapis.com/download/storage/v1/b/patent_landscapes/o/models%2F5.9m%2Fcheckpoints%2Fcheckpoint?alt=media: The specified bucket does not exist.: ('Request failed with status code', 404, 'Expected one of', <HTTPStatus.OK: 200>, <HTTPStatus.PARTIAL_CONTENT: 206>)

feltenberger commented 3 years ago

Hi - yes, it appears the model has been deleted from cloud storage somehow. Sorry about that. If/when I find some time, I can try to recreate the model and put it somewhere that it won't be deleted.

On Mon, Jun 7, 2021 at 6:25 AM wagner-science @.***> wrote:

Hello, good day!

Thank you for your work.

Unfortunately the embedding model is not found. Is there a new data storage location ?

error message:

NotFound: 404 GET https://storage.googleapis.com/download/storage/v1/b/patent_landscapes/o/models%2F5.9m%2Fcheckpoints%2Fcheckpoint?alt=media: The specified bucket does not exist.: ('Request failed with status code', 404, 'Expected one of', <HTTPStatus.OK: 200>, <HTTPStatus.PARTIAL_CONTENT: 206>)

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/google/patents-public-data/issues/47, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABJZSYUNKYX4TA4GWBPEDP3TRSNALANCNFSM46HLIUXA .

ghost commented 3 years ago

Hi Dave, If you find time to do this, it would be greatly appreciated by myself also. Thanks, Mike

d-w216 commented 3 years ago

Hi Dave, some edge data on the nature of the embeddings (or the creation process) would also help. Then we could reconstruct the embeddings.

feltenberger commented 3 years ago

Hi - makes sense. I do have some code in there to generate your own embeddings, but I suggest if you want to do your own to use gensim. Sometime this summer I'll publish the embeddings generated via gensim and get it working end-to-end. For reference: https://radimrehurek.com/gensim/models/word2vec.html

On Mon, Jun 21, 2021 at 3:50 AM wagner-science @.***> wrote:

Hi Dave, some edge data on the nature of the embeddings (or the creation process) would also help. Then we could reconstruct the embeddings.

— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/google/patents-public-data/issues/47#issuecomment-864809999, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABJZSYV4YZ5HF4RJAYO4GNTTT3VM5ANCNFSM46HLIUXA .

peiyu-wang commented 1 year ago

Hi @feltenberger, any update on this? will appreciate it a lot if we can figure out a way to download the model, thank you!

Forbidden: 403 GET https://storage.googleapis.com/download/storage/v1/b/patent_landscapes/o/models%2F5.9m%2Fcheckpoints%2Fcheckpoint?alt=media: wangpeiyu37@gmail.com does not have storage.objects.get access to the Google Cloud Storage object. Permission &#39;storage.objects.get&#39; denied on resource (or it may not exist).: ('Request failed with status code', 403, 'Expected one of', <HTTPStatus.OK: 200>, <HTTPStatus.PARTIAL_CONTENT: 206>)