-
### System Info
```
- `transformers` version: 4.30.2
- Platform: Linux-5.14.0-162.22.2.el9_1.x86_64-x86_64-with-glibc2.34
- Python version: 3.9.17
- Huggingface_hub version: 0.16.4
- Safetenso…
-
The models listed below are for the GSoC 2023 prerequisite task only.
We provide several potential candidates. Please select only one which hasn't already been selected (look at the checkboxes and…
-
### Describe the bug
I am trying to embed documents using the FinBert model using TransformerDocumentEmbeddings() and I've read the docs about how to do this. However, I am running into an issue wi…
-
I am currently attempting to apply BERTopic to company annual reports. However said documents include more than 512 tokens, which is the limit for all bert-based models. Is splitting the document into…
-
There are a bunch of models in huggingface that would be good to test if it compiles and accurate.
The most downloaded onnx models would be a good start: https://huggingface.co/models?library=onnx&so…
-
I am experimenting with bertopic.representation OpenAI() . With a free license token.
I got below RatelimitError reached due to restriction per minute. There were 380K documents, each document is a…
-
I am trying to use the FinBert model with BERTopic and I've read the docs about how to do create document embeddings using flair. However, I am running into an issue that I can't figure out. Here is m…
-
### System Info
```shell
optimum-habana 1.5.0
docker version 1.9.0
pytorch version 1.13.1
```
### Information
- [ ] The official example scripts
- [X] My own modified scripts
### Tasks
- [ ] A…
-
### Search before asking
- [X] I searched the [issues](https://github.com/ray-project/ray/issues) and found no similar issues.
### Ray Component
Ray Tune
### Issue Severity
High: It b…
-
For a document, Extracted keywords differ when the document is passed in solo v/s it is passed in batch of multiple documents (batch size=64)
And more keywords are extracted when it is passed in so…