-
Hello, you say evaluation code is there, but I found nothing:-(
https://huggingface.co/datasets/fxmeng/pissa-evaluation-code
-
In https://huggingface.co/docs/dataset-viewer/pyspark, the first link behaves strangely.
The link is https://spark.apache.org/docs/latest/api/python
See the video:
https://github.com/user-att…
-
For example, [gemma-2-27b-bnb-4bit](https://huggingface.co/unsloth/gemma-2-27b-bnb-4bit) has 14.6 B parameters, while the main model https://huggingface.co/google/gemma-2-27b, has 27.2 B parameters?
-
### 用LiBai的Bert加载huggingface的权重对齐输出发现的一些问题,经过修改后可以与hugigngface输出对齐
#### 参数结构对比,可以先看最下面两个库中`Bert`的参数结构:
- **LiBai**的`embedding`部分和**huggingface**的没问题。
- 然后,看`LayerNorm`层,我们**LiBai**的`LayerNorm`层放在…
-
Create a docker image with the following models:
1. https://huggingface.co/martin-ha/toxic-comment-model/tree/main
2. https://huggingface.co/leondz/refutation_detector_distilbert/tree/main
3. htt…
-
How can I fix this error?
```
INFO:root:Loading the model vicuna-13b ...
Traceback (most recent call last):
File "/usr/local/lib/python3.9/site-packages/huggingface_hub/utils/_errors.py", line…
-
**Describe the bug**
gemma-2-9b-it-gptq-4bit CUDA OOM on RTX 3090
**GPU Info**
```
Sun Aug 4 02:35:35 2024
+-----------------------------------------------------------------------…
-
Transfusion seems to be also about AR + Diffusion Multi Modality Model (https://huggingface.co/papers/2408.11039). Are you using similar techniques? Is there any major difference?
-
On the example image, More General OCR at the bottom,
music notes, chemical compound, some geometrical shapes are shown.
Whats the python command to extract such things?
I've tried all the exam…
-
Up to this point, I did not find minimal examples to resume training from universal checkpoint.
The only example for using universal checkpoint is [here](https://github.com/microsoft/Megatron-DeepS…