-
### Describe the bug
When trying this command tabby segfaults:
```
$ tabby serve --model TabbyML/StarCoder-1B
2024-08-09T21:36:29.342833Z INFO tabby::serve: crates/tabby/src/serve.rs:116: Startin…
-
thank you for the awesome contribution firstly. I implement this project successfully, and got the result below:
![image](https://user-images.githubusercontent.com/25050291/152751749-c9332ba1-9657-46…
-
Whenever I try to ingest my documents. It shows me this error, *WHY*?
It also gives me the solution to `pip install cpp-llama-python`
Can someone tell me what this is used for in the current pr…
-
Hi first of all thanks for the code.
When I did the inference on GTX 1080 Ti, the speed was 5 frames/sec with a resolution of 640, 384.
In the paper, it has been mentioned that the speed on Titan…
-
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:3 and cuda:0! (when checking argument for argument mask in method wrapper_CUDA__masked_scatter_)
请问…
-
### Search before asking
- [X] I have searched the Ultralytics YOLO [issues](https://github.com/ultralytics/ultralytics/issues) and [discussions](https://github.com/ultralytics/ultralytics/discussion…
-
### The bug
It seems like ffmpeg running into deadloop.
Ffmpeg running at 100% CPU usage for hours, and the output file size is only 48 bytes.
HW decoding and encoding switch are all ON.
Have accu…
-
Here is the paper:
4.2.1 CPU acceleration. To accelerate the CPU stage, three independent sequential MSA searches can be arranged in parallel (Fig 4). Due to the limited CPU cores accompanying GPUs…
wttat updated
2 years ago
-
Hi,
Some of the models in Hugeface shows the support of `create_chat_completion`, but now this plugin seems only support the `Simple inference`, will `Chat Completion` be supported in the future ve…
-
Dear MobileSAM Developers,
I hope this message finds you well. I am reaching out to discuss potential enhancements to the MobileSAM framework, particularly concerning its lightweight encoder's perf…