build(deps): update transformers[sentencepiece] requirement from ~=4.27.4 to ~=4.28.0

Updates the requirements on transformers[sentencepiece] to permit the latest version.

Release notes

Sourced from transformers[sentencepiece]'s releases.

v4.28.0: LLaMa, Pix2Struct, MatCha, DePlot, MEGA, NLLB-MoE, GPTBigCode

LLaMA

The LLaMA model was proposed in LLaMA: Open and Efficient Foundation Language Models. It is a collection of foundation language models ranging from 7B to 65B parameters. You can request access to the weights here then use the conversion script to generate a checkpoint compatible with Hugging Face

LLaMA Implementation by @zphang in #21955

Pix2Struct, MatCha, DePlot

Pix2Struct is a pretrained image-to-text model for purely visual language understanding, which can be finetuned on tasks containing visually-situated language. Pix2Struct has been fine-tuned on various tasks and datasets, ranging from image captioning and visual question answering (VQA) over different inputs (books, charts, science diagrams) to captioning UI components, and others.

Add Pix2Struct by @younesbelkada in #21400

Add DePlot + MatCha on transformers by @younesbelkada in #22528

Mega

MEGA proposes a new approach to self-attention with each encoder layer having a multi-headed exponential moving average in addition to a single head of standard dot-product attention, giving the attention mechanism stronger positional biases. This allows MEGA to perform competitively to Transformers on standard benchmarks including LRA while also having significantly fewer parameters. MEGA’s compute efficiency allows it to scale to very long sequences, making it an attractive option for long-document NLP tasks.

Add Mega: Moving Average Equipped Gated Attention by @mnaylor5 in #21766

GPTBigCode

The model is a an optimized GPT2 model with support for Multi-Query Attention.

Add GPTBigCode model (Optimized GPT2 with MQA from Santacoder & BigCode) by @jlamypoirier in #22575

NLLB-MoE

The mixture of experts version of the NLLB release has been added to the library.

NLLB-MoE Adds the moe model by @ArthurZucker in #22024

Serializing 8bit models

[bnb] Let's make serialization of int8 models possible by @younesbelkada in #22177

You can now push 8bit models and/or load 8bit models directly from the Hub, save memory and load your 8bit models faster! An example repo here

Breaking Changes

Ordering of height and width for the BLIP image processor

Notes from the PR:

The BLIP image processor incorrectly passed in the dimensions to resize in the order (width, height). This is reordered to be correct.

In most cases, this won't have an effect as the default height and width are the same. However, this is not backwards compatible for custom configurations with different height, width settings and direct calls to the resize method with different height, width values.

🚨🚨🚨 Fix ordering of height, width for BLIP image processor by @amyeroberts in #22466

... (truncated)

Commits

9417c92 Release: v4.28.0
72a978f [Pix2struct] Simplify generation (#22527)
50f82e1 Fix docstrings for TF BLIP (#22618)
ce06e47 Update warning levels (#22727)
9858195 add fast support and option (#22724)
10fab90 torch.distributed group initialization for torch_neuron disabled when `op...
1306b7d [tests] switch to torchrun (#22712)
d87ef00 Modify pipeline_tutorial.mdx (#22726)
370f0ca [bnb] Let's make serialization of int8 models possible (#22177)
523ca4e add model resources for CPMAnt (new) (#20906)
Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

hyperonym / basaran

build(deps): update transformers[sentencepiece] requirement from ~=4.27.4 to ~=4.28.0 #123

v4.28.0: LLaMa, Pix2Struct, MatCha, DePlot, MEGA, NLLB-MoE, GPTBigCode

LLaMA

Pix2Struct, MatCha, DePlot

Mega

GPTBigCode

NLLB-MoE

Serializing 8bit models

Breaking Changes

Ordering of height and width for the BLIP image processor

Codecov Report