-
The README recommends these hyperparameters to train a 70b model:
```text
--num_key_value_heads=8
--llama_intermediate_size=28672
--hidden_width=8192
--num_layers=80
--num_heads=64
```
but…
-
Dear OpenAI team,
Thank you for sharing with us this great implementation.
When I try to load FFHQ 1204 using command:
```
wget https://openaipublic.blob.core.windows.net/very-deep-vaes-ass…
-
The (FINOS) Architecture as Code Working group have built a C4 model for TraderX , and it would be great to host some docs on this repository, possibly embedding some diagrams.
I'll defer to @rocke…
maoo updated
6 months ago
-
Hello there!! I am trying to run d2go_beginner.ipynb notebook file but I am facing some issue with that I am getting that error during getting the model from model zoo ,, "RuntimeError: faster_rcnn_fb…
-
Solving `Max x * y` returns "Optimal" instead of declaring the QP to be non-convex. See the JuMP example:
```julia
julia> using JuMP, HiGHS
julia> model = Model(HiGHS.Optimizer)
A JuMP Model
Fe…
-
# Phone Models Table(JsOutput) | CompSci Blogs
Data table of phone models(Js Output)
[https://imaad08.github.io/student2/5.a/c4.1/2023/08/30/JsOutput.html](https://imaad08.github.io/student2/5.a/c4.…
-
**error count 4 example**
Starting file br_AmonZ_NIWA-UKCA2_refD1_r2i1p1f1_gnz_196001-201812.nc
C4.001.001: [parse_filename]: OK
C4.001.002: [parse_filename_timerange]: OK
C4.001.004: [file_name_e…
-
Hello, I'm trying to train YOLOv8-large in int4 format. I took the training recipe available at [sparsezoo](https://sparsezoo.neuralmagic.com/models/yolov8-l-coco-pruned85_quantized?hardware=deepspars…
-
### Describe the bug
When using the same command from the docs, I'm running into an error
### Error log
```log
atvremote -n "[LG] webOS TV" play_url=http://commondatastorage.googleapis.com/gtv-vid…
-
### Question
I downloaded llava-llama-2-13b from:
https://huggingface.co/liuhaotian/llava-llama-2-13b-chat-lightning-preview
Then I've quantized the model to 4-bit using .
```
git clone htt…