-
I'm attempting to run the Starcoder model on a Mac M2 with 32GB of memory using the Transformers library in a CPU environment. Despite setting load_in_8bit=True, I'm encountering an error during execu…
-
### Checked other resources
- [X] I added a very descriptive title to this issue.
- [X] I searched the LangChain documentation with the integrated search.
- [X] I used the GitHub search to find a…
-
### System Info
CPU: x86_64, memory: 1024GB, GPU: 8*A6000 48GB each, Tensorrt-LLM version 0.9.0.DEV20240226. NVIDIA-Driver Version: 535.171.04 CUDA Version: 12.2; OS - Ubuntu 22.04
### Who can hel…
-
### Description
Hi, I am using the latest version of LLamaSharp and my model is Llama-3 70b gguf version, when the number of GpuLayerCount is 0 to 5, although it is not very fast, I get the answer, b…
-
Notice: In order to resolve issues more efficiently, please raise issue following the template.
(注意:为了更加高效率解决您遇到的问题,请按照模板提问,补充细节)
## 🐛 Bug
使用1.x版本funasr,跑aishell训练例子时,在stage 1 compute_audio_cmv…
-
Hi Maarten,
I'm attempting to execute one of your examples in Google Colab for processing large-scale databases. Here are the specifications of my machine: 8 NVIDIA A100 cards and a 50TB SSD. Howev…
-
报错内容
[ERROR] Exception occurred while handling uri: 'http://10.230.107.105:8777/api/local_doc_qa/local_doc_chat'
Traceback (most recent call last):
File "handle_request", line 132, in handle_r…
-
Hi,
I've followed the this blog post https://huggingface.co/blog/fine-tune-whisper to finetune Whisper with my own dataset.
Everything seems to be working as expected. However, I've noticed a stra…
-
Thanks for providing the LongLoRA forward functions.
Your flash-attn/non-flash-attn implementations of SSN show divergent behavior in my case.
For a repro script, please have a look at the issue I…
-
Right now, file size stats are only available through the standalone trace viewer, not the embedded version within Chrome. Especially with how often Chrome has problems dealing with too-large traces, …