-
> Подумал, что мы же можем применить те же регулярные выражения и даже методы callkewords и qcallsuggest из мантикоры и, например, запустить новый процесс парсинга, который будет каждый …
-
**Describe the bug**
When using a SmoothQuantModifier and cpu offloading there is a conflict of tensors not being on the right device.
**Expected behavior**
cpu offloading should work w/ SmoothQu…
-
### Terraform Core Version
1.5.7
### AWS Provider Version
5.29.0
### Affected Resource(s)
resource: aws_sagemaker_model
### Expected Behavior
I'm attempting to create an AWS Sagem…
-
Hello everyone, I'm encountering a memory issue while fine-tuning a 7b model (such as Mistral) using a repository I found. Despite having 6 H100 GPUs at my disposal, I run into out-of-memory errors wh…
-
### Describe the issue
In fact, there are currently numerous works that expand the context, but as the context expands, the KV cache increases, leading to a sharp rise in HBM usage. Therefore, whet…
-
**Is your feature request related to a problem? Please describe.**
Is it possible to use Ollama embedding model for plugin selection while using OpenAI model for agents.
See my config file blow:
{
…
-
### 🐛 Describe the bug
I tried to implement the `causal_lower_right` masking in flex attention. This requires the masking function to know the difference in lengths of keys and queries:
```python
…
-
### Checked other resources
- [X] I added a very descriptive title to this issue.
- [X] I searched the LangChain.js documentation with the integrated search.
- [X] I used the GitHub search to find a …
-
### 🐛 Describe the bug
Hello,
I'm trying to quantize llama models using `OVQuantizer` but I'm facing an error:
```
IndexError: list index out of range
```
I tried llama3 and llama2
###…
-
### Bug Description
The MilvusVectorStore failed to connect when enable_sparse is True. when i set it to false it can connect.
### Version
0.10.38
### Steps to Reproduce
you have just to do:
```…