-
## Description
The output of the TensorRT 10 model converted from ONNX is incorrect, while the output of the TensorRT 8.6 model is correct. The issue seems to be located in some fully connected lay…
-
Formulations to improve solving speed & reduce model size
Related issues:
Unit commitment, ramps & reserves
- [x] #654
- [ ] Implement start-up & shut-down trajectories
- [ ] #501
Storage
- [ ]…
-
- Resource adequacy & resiliency (in connection with multi-instance modelling T4.6)
- Is restarting model from previous results already implemented?
- Check if can improve speed of warm start
- W…
-
as titled,what is the minimun hardware requirement for 8b and 70b
-
### Description
BF16 matmul appears to be slower than F32 matmul on T4. From my test, BF16 appears to be half the speed. I believe this is a bug and bf16 should be the same speed (or possibly bette…
-
I would like to know what the coverage/quorum parameter means.
`histgrowth -t4 -l 1,2,1,1,1 -q 0,0,1,0.5,0.1`
-
Hello,
First and foremost, thank you for the commendation on our work and paper. I've been attempting to run Evo locally on T4 GPUs, but I encountered an issue with FlashAttn 2.0 not being supporte…
-
I'm running a single T4 node on GKE. Nodes are properly labeled as shown below and `sky show-gpus --cloud kubernetes` is also correct but fails to launch.
```bash
(sky) gcpuser@gfd-ebd1-head-…
-
Options
- We can use a model using API key
- We can download the model.
Model is stored in in Memory Database. Is it??? What is the PPT file?
The compute power is from CPU or GPU.
Cola…
-