-
Hey guys,
I'm trying to parallelize and queue machine learning jobs and have run into a problem. Occasionally, the BERT Question and Answer model will permanently hang when run in multiple workers …
-
### Description
Scala 2.12 seems to be de-facto across a lot of spark packages but I'm using packages that require spark w/scala-2.13 specifically and it's obviating working with Spark NLP (also, my …
-
Hi!
I am interested in using the [SimCSE](https://github.com/princeton-nlp/SimCSE#use-simcse-with-huggingface) model to get sentence embeddings, as its embeddings have been shown to significantly o…
-
I am using [this](https://colab.research.google.com/github/patil-suraj/exploring-T5/blob/master/T5_on_TPU.ipynb) notebook to train model
I have following dataset which is different from SQUAD used i…
-
I tried to use the local model from my machine to generate questions, but it seems the pipeline can't handle the model argument if the model is stored on the local machine. It always downloads the mod…
-
Hi,
I am trying to mine some parallel sentences from two large monolingual corpora (over 40M sentences each). In the first step I encoded the two sides and then called `mine_bitexts.py` to do the mag…
-
家人们,谁懂啊?
显卡4090,cuda12.3,死活跑不起来…
-
Hi,
I'm trying to deploy this model via torch serve. But when i try and save the tokenizer:
`tokenizer = AutoTokenizer.from_pretrained("valhalla/t5-base-e2e-qg")
tokenizer.save_pretrained('/du…
-
****
-
**Opensearch Version**: 2.15
**Environment**: AWS OpenSearch
### Issue Description
I am executing hybrid queries with three sub-queries on a large dataset containing tens to hundreds of thous…