Bump transformers from 4.31.0 to 4.34.1

Bumps transformers from 4.31.0 to 4.34.1.

Release notes

Patch release: v4.34.1

A patch release was made for the following three commits:

Add add_generation_prompt argument to apply_chat_template (huggingface/transformers#26573)

Fix backward compatibility of Conversation (huggingface/transformers#26741)

[Tokenizer] Fix slow and fast serialization (huggingface/transformers#26570)

v4.34: Mistral, Persimmon, Prompt templating, Flash Attention 2, Tokenizer refactor

New models

Mistral

Mistral-7B-v0.1 is a decoder-based LM with the following architectural choices:

Sliding Window Attention - Trained with 8k context length and fixed cache size, with a theoretical attention span of 128K tokens

GQA (Grouped Query Attention) - allowing faster inference and lower cache size.

Byte-fallback BPE tokenizer - ensures that characters are never mapped to out-of-vocabulary tokens.

[Mistral] Mistral-7B-v0.1 support by @Bam4d in #26447

Persimmon

The authors introduced Persimmon-8B, a decoder model based on the classic transformers architecture, with query and key normalization. Persimmon-8B is a fully permissively licensed model with approximately 8 billion parameters, released under the Apache license. Some of the key attributes of Persimmon-8B are long context size (16K), performance, and capabilities for multimodal extensions.

[Persimmon] Add support for persimmon by @ArthurZucker in #26042

BROS

BROS stands for BERT Relying On Spatiality. It is an encoder-only Transformer model that takes a sequence of tokens and their bounding boxes as inputs and outputs a sequence of hidden states. BROS encode relative spatial information instead of using absolute spatial information.

Add BROS by @jinhopark8345 in #23190

ViTMatte

ViTMatte leverages plain Vision Transformers for the task of image matting, which is the process of accurately estimating the foreground object in images and videos.

Add ViTMatte by @NielsRogge in #25843

Nougat

Nougat uses the same architecture as Donut, meaning an image Transformer encoder and an autoregressive text Transformer decoder to translate scientific PDFs to markdown, enabling easier access to them.

Add Nougat by @NielsRogge and @molbap in #25942

Prompt templating

We've added a new template feature for chat models. This allows the formatting that a chat model was trained with to be saved with the model, ensuring that users can exactly reproduce that formatting when they want to fine-tune the model or use it for inference. For more information, see our template documentation.

Overhaul Conversation class and prompt templating by @Rocketknight1 in #25323

🚨🚨 Tokenizer refactor

... (truncated)

Commits

acc394c Release v4.34.1
0c4b637 [Tokenizer] Fix slow and fast serialization (#26570)
75c4250 Fix backward compatibility of Conversation (#26741)
3e425b9 Add add_generation_prompt argument to apply_chat_template (#26573)
9c27587 Release: v4.34.0
31543dd [Nougat] from transformers import * (#26562)
2aef9a9 [PEFT] Final fixes (#26559)
ae9a344 [Mistral] Add Flash Attention-2 support for mistral (#26464)
1a2e966 Nit-added-tokens (#26538)
245da7e [Doctest] Add configuration_encoder_decoder.py (#26519)
Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

megagonlabs / bunkai

Bump transformers from 4.31.0 to 4.34.1 #325

Patch release: v4.34.1

v4.34: Mistral, Persimmon, Prompt templating, Flash Attention 2, Tokenizer refactor

New models

Mistral

Persimmon

BROS

ViTMatte

Nougat

Prompt templating

🚨🚨 Tokenizer refactor