-
Has anyone tried downscaling the K and/or Q matrices for repeated layers in franken-merges? This should act like changing the temperature of the softmax and effectively smooth the distribution:
**H…
-
How can I load the pretrained Dinov2 model from a local source so that it loads the model even when there is no internet connection and does not attempt to download it again from the server?
The norm…
-
Embedded using the API
Significantly underperforms vs other models
In most of the cases, each embedding is a full text of the Supreme Court decision
Indexed with hnsw.
Should I use a different…
-
### Describe the problem you're trying to solve
Proof of Concept (PoC) a generic inference container that uses Triton as the inference engine and can download and utilize a ModelKit as efficiently as…
-
## Description
#2916 fixed an error check that was previously not properly implemented in `IndependentSource::sample,` where the variables `n_accept` and `n_reject` are now properly defined as st…
-
Hi everyone!My python is 3.8 and torch is 1.8.
When my "train_ddgan.py" run into “**from score_sde.models.discriminator import Discriminator_small, Discriminator_large**”,nothing happens and program …
-
This issue aims at keeping track of the models that would be interesting to get added to candle. Feel free to make a comment to mention a new model, or vote for a model already in the list.
- [musicg…
-
Will it be possible to add OpenVINO support for Intel-based processors? The repo by @zhuzilin [here](https://github.com/zhuzilin/whisper-openvino) shows a speed improvement of nearly 50%, so users wil…
-
If I wanted to use one of the larger wespeaker models - say 293 - would I just download the .pt file and point to it in the config.yaml?
-
### Search before asking
- [X] I had searched in the [issues](https://github.com/eosphoros-ai/DB-GPT/issues?q=is%3Aissue) and found no similar issues.
### Operating system information
Linux
### P…