-
### System Info
Built tensorrtllm_backend from source using dockerfile/Dockerfile.trt_llm_backend
tensorrt_llm 0.13.0.dev2024081300
tritonserver 2.48.0
triton image: 24.07
Cuda 12.5
### Wh…
-
### Description
### References
- https://distill.pub/2016/augmented-rnns/
- https://medium.com/syncedreview/memory-attention-sequences-8522f531dd43
- https://jalammar.github.io/illustrated-transforme…
-
Hi
I was trying to run ' python a3c_main.py --evaluate 2 --load saved/pretrained_model' to run inference using the pre-trained model. However, I faced the following dimension error without changing…
-
Hey there,
I'm not able to *build* the `dropout-layer-norm`.
I used this Docker image: `nvcr.io/nvidia/pytorch:23.09-py3` and then installed the flash-attention components via:
flash_attn…
-
**Please check the [Github](https://github.com/zezhishao/MTS_Daily_ArXiv) page for a better reading experience and more papers.**
## Time Series
| **Title** | **Date** | **Comment** |
| --- | --- | -…
-
Thank you for the code! I've been using it as a reference for my own implementation. Have you replicated the results in the original blogpost..? Based on your update in the readme, it seems like you h…
-
I sat down to get my thoughts in order about community management and how I wanted to respond to the three proposals. That turned into a longer document available at https://bossett.io/bluesky-on-comm…
-
## Exploration
The element of a group giving the best results is marked in bold. If multiple elements are used together, all of them are marked. \
If a minor positive changes were noted in the early…
-
### Checklist
- [ ] The issue exists after disabling all extensions
- [ ] The issue exists on a clean installation of webui
- [X] The issue is caused by an extension, but I believe it is caused by a …
-
Hello, in your repository, I can't find a way to build a graph, please help me point out the way to build a graph, I want to use this as the beginning of my learning, I would be grateful and look forw…