-
### Question
I downloaded llava-llama-2-13b from:
https://huggingface.co/liuhaotian/llava-llama-2-13b-chat-lightning-preview
Then I've quantized the model to 4-bit using .
```
git clone htt…
-
现在tensorflow1.0已经发布了,API有变动,我把代码里API变动的地方都改了,但是测试训练时仍显示softmax loss function的matmul矩阵shape不对,不知道是哪里出了问题,先问问,回头有时间我自己再读读源码找一下~
log:
```bash
dim: 6865
准备数据
bucket 0 中有数据 164276 条
bucket 1 中有数…
-
@dennybritz
Hi ,
First of all many thanks for sharing your code. I am trying to use pretrained word embeddings instead of randomly initialized word embedings based on the vocabulary size.
My pre…
-
- [ ] [Example - Qwen](https://qwen.readthedocs.io/en/latest/training/SFT/example.html)
# Example - Qwen
**DESCRIPTION:**
Here we provide a very simple script for supervised finetuning, which is …
-
### Project Title
An LLM app with a deeper understanding of a [GitHub repo](https://github.com/staru09/Github_analyser)
### Motivation
It becomes challenging to review PRs and solve issues fo…
-
### Your current environment
### Docker Image and Execution Command Overview
**Docker Image Built From:**
```dockerfile
FROM vault.habana.ai/gaudi-docker/1.16.2/ubuntu22.04/habanalabs/pytorch-in…
-
### Your current environment
The output of `python collect_env.py`
```text
Collecting environment information...
PyTorch version: 2.4.0+cpu
Is debug build: False
CUDA used to build PyTor…
-
### Your current environment
```text
Pytorch: 2.4.0-cuda12.1-cudnn9-devel
Python: 3.11.9
vLLM: 0.6.0
```
### 🐛 Describe the bug
**I try to run the following script to load the quantized model…
-
Explore capabilities of NLTK
-
- [ ] [README.md · defog/sqlcoder-7b-2 at main](https://huggingface.co/defog/sqlcoder-7b-2/blob/main/README.md?code=true)
# README.md · defog/sqlcoder-7b-2 at main
**DESCRIPTION:**
```yaml
license:…