-
There has been a lot of advancements recently in achieving context for dialog models through a separate context layer. Eg. [HRAN](https://arxiv.org/pdf/1701.07149.pdf) or [VHRED](http://www.cs.toronto…
-
### Is there an existing issue for this problem?
- [X] I have searched the existing issues
### Operating system
Windows
### GPU vendor
Nvidia (CUDA)
### GPU model
RTX4090 i9
### GPU VRAM
64GB…
z0rge updated
2 weeks ago
-
I run triton with tensorrtllm. But when i give long text to llm, triton returns a long array of zeros named output_log_probs in every token. If my text be longer than some number, the request not work…
-
### Bug Description
DEBUG:filelock:Lock 13121339920 released on /Users/whs/Library/Caches/llama_index/.locks/models--openai--clip-vit-base-patch32/9bfb42aa97dcd61e89f279ccaee988bccb4fabae.lock
Lock …
-
### Describe the bug
After running
```interpreter --model i ```
without any previous configuration, the following error is raised (after some input is sent):
```shell
> wheres my chrime_an…
-
Hi I've been working further to ingest context.json and produce my own flavor of outputs. In that process I have noticed a couple of inconsistencies/issues/potential improvements to the data.
I'm …
-
Perhaps stating that PD is an active entity in the system was not the right phrasing, which threw off the readers. So, it might be more fitting to have a notion of an execution-context as the active e…
-
### Your current environment
```text
Collecting environment information...
PyTorch version: 2.3.0+cu121
Is debug build: False
CUDA used to build PyTorch: 12.1
ROCM used to build PyTorch: N/A
…
-
## Description
- RuntimeError: Unhandled attribute type
## Reproduce
```
git checkout kkannan/forge_model_issues
git submodule update --recursive
export BACKEND_ARCH_NAME=wormhole_b0 ARCH…
-
Hello, I am trying to infer the quantized model SD2_1 using QNN 2.28 on a Samsung S24 phone。
I encountered an issue where the inference process using SDV1-5 gets stuck when inferring the Unet model r…