build(deps): update bitsandbytes requirement from ~=0.39.0 to ~=0.39.1

Updates the requirements on bitsandbytes to permit the latest version.

Release notes

4-bit QLoRA, Paged Optimizers, and 8-bit Memory Leak Bugfix

This release brings 4-bit quantization support for QLoRA fine-tuning and a critical bugfix that doubled the memory cost of 8-bit models when they were serialized. Furthermore, paged optimizers are introduced, including 8-bit Lion.

0.39.1

Features:

4-bit matrix multiplication for Float4 and NormalFloat4 data types.

Added 4-bit quantization routines

Doubled quantization routines for 4-bit quantization

Paged optimizers for Adam and Lion.

bfloat16 gradient / weight support for Adam and Lion with 8 or 32-bit states.

Bug fixes:

Fixed a bug where 8-bit models consumed twice the memory as expected after serialization

Deprecated:

Kepler binaries (GTX 700s and Tesla K40/K80) are no longer provided via pip and need to be compiled from source. Kepler support might be fully removed in the future.

Changelog

Sourced from bitsandbytes's changelog.

0.0.21

Ampere, RTX 30 series GPUs now compatible with the library.

0.0.22:

Fixed an error where a reset_parameters() call on the StableEmbedding would lead to an error in older PyTorch versions (from 1.7.0).

0.0.23:

Bugs:

Unified quantization API: each quantization function now returns Q, S where Q is the quantized tensor and S the quantization state which may hold absolute max values, a quantization map or more. For dequantization all functions now accept the inputs Q, S so that Q is dequantized with the quantization state S.

Fixed an issue where the CUDA 11.1 binary was not compiled with the right headers

API changes:

Block-wise quantization for optimizers now enabled by default

Features:

Block-wise quantization routines now support CPU Tensors.

0.0.24:

Fixed a bug where a float/half conversion led to a compilation error for CUDA 11.1 on Turning GPUs.

removed Apex dependency for bnb LAMB

0.0.25:

Features:

Added skip_zeros for block-wise and 32-bit optimizers. This ensures correct updates for sparse gradients and sparse models.

Added support for Kepler GPUs. (#4)

Added Analysis Adam to track 8-bit vs 32-bit quantization errors over time.

Make compilation more user friendly.

Bug fixes:

fixed "undefined symbol: __fatbinwrap_38" error for P100 GPUs on CUDA 10.1 (#5)

Docs:

Added docs with instructions to compile from source.

0.26.0:

Features:

Added Adagrad (without grad clipping) as 32-bit and 8-bit block-wise optimizer.

Added AdamW (copy of Adam with weight decay init 1e-2). #10

Introduced ModuleConfig overrides which can be seamlessly be used at initialization time of a module.

Added bnb.nn.Embedding layer which runs at 32-bit but without the layernorm. This works well if you need to fine-tune pretrained models that do not have a embedding layer norm. #19

Bug fixes:

Fixed a bug where weight decay was incorrectly applied to 32-bit Adam. #13

... (truncated)

Commits

See full diff in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

hyperonym / basaran

build(deps): update bitsandbytes requirement from ~=0.39.0 to ~=0.39.1 #217

4-bit QLoRA, Paged Optimizers, and 8-bit Memory Leak Bugfix

0.39.1

0.0.21

0.0.22:

0.0.23:

0.0.24:

0.0.25:

0.26.0:

Codecov Report