-
## 🐛 Bug
There is no output token when I use mlc-chat-cli to run the compiled rwkv-raven-1b5 and 3b models.
## To Reproduce
I followed the instructions [here](https://llm.mlc.ai/docs/comp…
-
## 🐛 Bug
I got this error while compiling my retrained llama2-13b model.
My model was converted using the [convert_llama_weights_to_hf.py ](https://github.com/huggingface/transformers/blob/main/…
-
How to manually load an image into cuda using C++ and display with HolovizOp?
My main doubt is that I don't know what `Class` should use and how to store image data into it.
The python version…
-
### 问题描述 Issue Description
[yanjiaxing@in-dev-docker:/apollo/data/Paddle/build]$ make -j8
Scanning dependencies of target extern_eigen3
Scanning dependencies of target copy_kernels_command
Scannin…
-
## 🐛 Bug
### Python API
Python3 segmentation fault error when execute `generate` function with `StreamIterator` in a **separate thread**.
In my opinion, there is a problem for real apps like pyth…
-
Conversion from (at least) integer to string columns (done a lot for printing on the cudf-python side of things) appears to have uninitialized device memory accesses. Related #8873.
Consider the fo…
-
**Describe the bug**
I get an error trying to import cudf:
**Steps/Code to reproduce bug**
```
>>> import cudf
Traceback (most recent call last):
File "", line 1, in
File "/home/z0022z7b…
-
## 🐛 Bug
I used mlc llm to compile Llama-2-7b-chat-hf, but when using mlc-chat, I reported an error:
subprocess.CalledProcessError: Command '['D:\Program\Anaconda3\envs\mlc-chat\python.exe', '…
-
### Describe the bug
Reproducer:
```python
import dpctl.tensor as dpt
import intel_extension_for_pytorch
import torch
array_dpt_gpu = dpt.reshape(dpt.arange(1000, device="gpu", dtype=dpt.flo…
-
Reproducer:
```python
import dpctl.tensor as dpt
import intel_extension_for_pytorch
import torch
array_dpt_gpu = dpt.reshape(dpt.arange(1000, device="gpu", dtype=dpt.float32), (4, 250))
array_…