-
I am embedding large noisy text docs (websites) using UMAP for further downstream tasks, including classification (site category), regression (advertising performance) and clustering.
Sometimes the…
-
I have two question for fine-tuning implementation.
1. At VISTA_Evaluation_FineTuning/downstream_finetune_example/,
when I run the code run_ds_cirr.py and load eva_clip weight from this part
`
s…
-
### Have you searched existing issues? 🔎
- [X] I have searched and found no existing issues
### Desribe the bug
I am using beropic with llama3.1 for topic modelling. My text is long, so I use doc_…
-
### Describe the bug
![image](https://github.com/user-attachments/assets/3320fc08-3563-422a-8091-e846782a72aa)
### To reproduce
```python
langfuse_callback_handler = LlamaIndexCallbackHandler(…
-
-
Hi Umar,
I absolutely love your YT video explaining the PaliGemma model and thanks for all the good work! I found [this line](https://github.com/hkproj/pytorch-paligemma/blob/f7cdcc28240f2377419bb8…
-
[sft.zip](https://github.com/user-attachments/files/17394377/sft.zip)
[ruozhiba_cvt.zip](https://github.com/user-attachments/files/17394510/ruozhiba_cvt.zip)
I try to fine-tune the ruozhiba datase…
-
Hi,
I still get this error message despite I use the last commit & version from bertopic, could you please help ?
Code:
import openai
from bertopic.backend import OpenAIBackend
from op…
-
### 🐛 Describe the bug
What happened:
In the Graph memory example notebook (https://colab.research.google.com/drive/1PfIGVHnliIlG2v8cx0g45TF0US-jRPZ1?usp=sharing) attempting to add a memory throws t…
-
email is multipart alternative plus image attachment
if the text part is removed then there is no issue with embed
(as reported by SysPete)