-
![image](https://github.com/SevaSk/ecoute/assets/101507368/d48dba96-7d32-476a-8ab2-695d09e9775a)
Speaker:[.]
-
报错信息:
Using downloaded and verified file: /data/MiniGPT4Qwen/lavis/../cache/dataset/llava_instruct/llava_instruction_156k.json
2024-07-02 14:04:21,365 [INFO] Building datasets...
Using downloaded a…
-
The actual embeddings are not available yet.
When you run the example in the repo directly:
```python
from transformers import AutoModel
from numpy.linalg import norm
cos_sim = lambda a,b: (a…
-
I have tried whisper.cpp on my iPhone and it runs very fast , so I wonder if it is possible that llama.cpp could support it.
thank you .
-
### OpenVINO Version
2024.0.0-14509-34caeefd078-releases/2024/0
### Operating System
Ubuntu 20.04 (LTS)
### Device used for inference
CPU
### Framework
PyTorch
### Model used
…
-
### Motivation.
There are more and more use cases, where we need to transfer KV caches between vLLM instances, or store KV caches for future use. Some concrete use cases:
- Disaggregated prefilling.…
-
### Describe the bug
重启电脑后出现以下问题,重启前多卡是正常运行的
问题:多卡运行模型启动报错,单卡运行正常
### To Reproduce
To help us to reproduce this bug, please provide information below:
1. Your Python version.
3.10
>>> impo…
-
**Is your feature request related to a problem? Please describe.**
Thank you for putting this together. It helped me a lot to learn the big picture of LLMs.
I tried to build and run it on an…
-
I'm running on a CPU-only host.
My pip freeze output:
aiohttp==3.9.5
aiosignal==1.3.1
annotated-types==0.6.0
anyio==4.3.0
async-timeout==4.0.3
attrs==23.2.0
beautifulsoup4==4.12.3
bitarray…
-
### Your current environment
```
Collecting environment information...
PyTorch version: 2.3.0+cu121
Is debug build: False
CUDA used to build PyTorch: 12.1
ROCM used to build PyTorch: N/A
OS…