-
### System Info
- 4x A100 SXM 40GB
- CUDA 12.4
- Docker: nvidia/cuda:12.4.0-devel-ubuntu22.04
- TensorRT-LLM version: 0.10.0
### Who can help?
@kaiyux
### Information
- [x] The official exampl…
-
## Checklist
- [X] I'm reporting a broken site support
- [X] I've verified that I'm running youtube-dl version **2021.06.06**
- [X] I've checked that all provided URLs are alive and pla…
-
### 🚀 The feature, motivation and pitch
Include the Llama-405B model as part of the nightly performance benchmarks here: https://buildkite.com/vllm/performance-benchmark/builds/4068
Is the reaso…
-
### Checklist
- [X] 1. I have searched related issues but cannot get the expected help.
- [X] 2. The bug has not been fixed in the latest version.
- [X] 3. Please note that if the bug-related iss…
-
### Please check that this issue hasn't been reported before.
- [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports.
…
-
### Informations
- **Qiskit Aer version**:
Requirement already satisfied: qiskit-aer-gpu in /global/homes/g/gzquse/.conda/envs/qiskit-summer/lib/python3.11/site-packages (0.14.2)
Requiremen…
-
Hi, thank you for releasing this wonderful codebase.
I noticed that the MFU calculation in your code is using the **TF32 Tensorcore peak FLOPs** as denominator, as hard-coded [here](https://github.…
-
### 🐛 Describe the bug
when enabling `kineto__tensor_core_insts` or `dram__bytes_read.sum`, the pytorch profiler outputs this warning and the trace becomes unusable. I have even tried adding the foll…
-
The "countries" listed below (by their iso 3166-3 code) appear in at least one indicator, but do not exist as groups on ckan. The data remains in the datasets coming from CPS, but isn't tied to a cka…
-
Put it in quotes too but my be wrong how put the quotes in.