-
### System Info
> peft version: 0.9.0
> accelerate version: 0.27.2
> transformers version: 4.37.0
> trl version: 0.7.12.dev0
> base model: openai-community/gpt2
> hardware: 2xA100
I'm doing a…
-
This is the code, I achived 4bits with normal libs
import gc
import os
import re
import torch
import tensorflow as tf
import pandas as pd
import matplotlib.pyplot as plt
import nltk
import …
-
Thank you for creating this amazing package. It looks very promising. But i am facing some issue on installation on Linux Mint (Ubuntu) desktop. Where i have a NVIDIA RTX 3060 GPU with 12GB VRAM. I …
-
I am trying to load mistral-7b-instruct-v0.2-bnb-4bit model from unsloth using the following
model, tokenizer = FastLanguageModel.from_pretrained(
model_name = "models/unsloth/mistral-7b-i…
-
## Summary
Assess performance of Bloom Db as number of rows scales
## Approach
I'm running repeated invocations of `test_create_acceessioning_wf()` and monitoring the size of the `generic_instance`…
-
### 🚀 The feature, motivation and pitch
It was not clear for me if the fp8 support is available for rocm. But I got with 5.2 :
fp8 quantization is currently not supported in ROCm.
There are pla…
-
### 🚀 The feature, motivation and pitch
https://arxiv.org/abs/2401.14112
I think you guys are really going to like this.
The deepspeed developers introduce FP6 datatype on cards without fp8 suppo…
-
### Your current environment
```text
The output of `python collect_env.py`
```
Collecting environment information...
PyTorch version: 2.4.0+cu121
Is debug build: False
CUDA used to build PyTorc…
-
Hi, I noticed that the calib data is clipped from the origin input_ids whose length is >= args.seqlen.
Can the calib data be generated by packing origin input_ids first and then slicing?
-
이번 장에서는 극내 주식 데이터 중 주식티커와 섹터별 구성종목 및 퀀트 투자를 위한 핵심 데이터인 수정주가, 재무제표, 가치지표를 크롤링하는 방법을 알아보겠다.