GoogleCloudPlatform/python-docs-samplesPublic

NotificationsYou must be signed in to change notification settings
Fork6.6k
Star7.8k

chore(deps): update dependency transformers to v4.50.0 [security]#13398

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Open

renovate-bot wants to merge1 commit intoGoogleCloudPlatform:main

base:main

Choose a base branch

fromrenovate-bot:renovate/pypi-transformers-vulnerability

Open

chore(deps): update dependency transformers to v4.50.0 [security]#13398

renovate-bot wants to merge1 commit intoGoogleCloudPlatform:mainfromrenovate-bot:renovate/pypi-transformers-vulnerability

Conversation

Copy link

Contributor

renovate-bot commentedMay 31, 2025•
edited
Loading

This PR contains the following updates:

Package	Change	Age	Adoption	Passing	Confidence
transformers	`==4.38.0` ->`==4.50.0`

GitHub Vulnerability Alerts

CVE-2024-11392

Hugging Face Transformers MobileViTV2 Deserialization of Untrusted Data Remote Code Execution Vulnerability. This vulnerability allows remote attackers to execute arbitrary code on affected installations of Hugging Face Transformers. User interaction is required to exploit this vulnerability in that the target must visit a malicious page or open a malicious file.

The specific flaw exists within the handling of configuration files. The issue results from the lack of proper validation of user-supplied data, which can result in deserialization of untrusted data. An attacker can leverage this vulnerability to execute code in the context of the current user. Was ZDI-CAN-24322.

CVE-2024-11394

Hugging Face Transformers Trax Model Deserialization of Untrusted Data Remote Code Execution Vulnerability. This vulnerability allows remote attackers to execute arbitrary code on affected installations of Hugging Face Transformers. User interaction is required to exploit this vulnerability in that the target must visit a malicious page or open a malicious file.

The specific flaw exists within the handling of model files. The issue results from the lack of proper validation of user-supplied data, which can result in deserialization of untrusted data. An attacker can leverage this vulnerability to execute code in the context of the current user. Was ZDI-CAN-25012.

CVE-2024-11393

Hugging Face Transformers MaskFormer Model Deserialization of Untrusted Data Remote Code Execution Vulnerability. This vulnerability allows remote attackers to execute arbitrary code on affected installations of Hugging Face Transformers. User interaction is required to exploit this vulnerability in that the target must visit a malicious page or open a malicious file.

The specific flaw exists within the parsing of model files. The issue results from the lack of proper validation of user-supplied data, which can result in deserialization of untrusted data. An attacker can leverage this vulnerability to execute code in the context of the current user. Was ZDI-CAN-25191.

CVE-2024-12720

A Regular Expression Denial of Service (ReDoS) vulnerability was identified in the huggingface/transformers library, specifically in the file tokenization_nougat_fast.py. The vulnerability occurs in the post_process_single() function, where a regular expression processes specially crafted input. The issue stems from the regex exhibiting exponential time complexity under certain conditions, leading to excessive backtracking. This can result in significantly high CPU usage and potential application downtime, effectively creating a Denial of Service (DoS) scenario. The affected version is v4.46.3.

CVE-2025-1194

A Regular Expression Denial of Service (ReDoS) vulnerability was identified in the huggingface/transformers library, specifically in the filetokenization_gpt_neox_japanese.py of the GPT-NeoX-Japanese model. The vulnerability occurs in the SubWordJapaneseTokenizer class, where regular expressions process specially crafted inputs. The issue stems from a regex exhibiting exponential complexity under certain conditions, leading to excessive backtracking. This can result in high CPU usage and potential application downtime, effectively creating a Denial of Service (DoS) scenario. The affected version is v4.48.1 (latest).

CVE-2025-2099

A vulnerability in thepreprocess_string() function of thetransformers.testing_utils module in huggingface/transformers version v4.48.3 allows for a Regular Expression Denial of Service (ReDoS) attack. The regular expression used to process code blocks in docstrings contains nested quantifiers, leading to exponential backtracking when processing input with a large number of newline characters. An attacker can exploit this by providing a specially crafted payload, causing high CPU usage and potential application downtime, effectively resulting in a Denial of Service (DoS) scenario.

Release Notes

huggingface/transformers (transformers)

`v4.50.0`

Compare Source

Release v4.50.0

New Model Additions

Model-based releases

Starting with version v4.49.0, we have been doing model-based releases, additionally to our traditional, software-based monthly releases. These model-based releases provide a tag from which models may be installed.

Contrarily to our software-releases; these are not pushed to pypi and are kept on our GitHub. Each release has a tag attributed to it, such as:

v4.49.0-Gemma-3
v4.49.0-AyaVision

⚠️ As bugs are identified and fixed on each model, the release tags are updated so that installing from that tag always gives the best experience possible with that model.

Each new model release will always be based on the current state of the main branch at the time of its creation. This ensures that new models start with the latest features and fixes available.

For example, if two models—Gemma-3 and AyaVision—are released from main, and then a fix for gemma3 is merged, it will look something like this:

              o---- v4.49.0-Gemma-3 (includes AyaVision, plus main fixes)            /                  \  ---o--o--o--o--o-- (fix for gemma3) --o--o--o main       \                  o---- v4.49.0-AyaVision

We strive to merge model specific fixes on their respective branches as fast as possible!

Gemma 3

Gemma 3 is heavily referenced in the followingmodel-based release and we recommend reading these if you want all the information relative to that model.

The Gemma 3 model was proposed by Google. It is a vision-language model composed by aSigLIP vision encoder and aGemma 2 language decoder linked by a multimodal linear projection.

It cuts an image into a fixed number of tokens same way as Siglip if the image does not exceed certain aspect ratio. For images that exceed the given aspect ratio, it crops the image into multiple smaller pacthes and concatenates them with the base image embedding.

One particularity is that the model uses bidirectional attention on all the image tokens. Also, the model interleaves sliding window local attention with full causal attention in the language backbone, where each sixth layer is a full causal attention layer.

Gemma3 by@RyanMullins in#36658

Shield Gemma2

ShieldGemma 2 is built onGemma 3, is a 4 billion (4B) parameter model that checks the safety of both synthetic and natural images against key categories to help you build robust datasets and models. With this addition to the Gemma family of models, researchers and developers can now easily minimize the risk of harmful content in their models across key areas of harm as defined below:

No Sexually Explicit content: The image shall not contain content that depicts explicit or graphic sexual acts (e.g., pornography, erotic nudity, depictions of rape or sexual assault).
No Dangerous Content: The image shall not contain content that facilitates or encourages activities that could cause real-world harm (e.g., building firearms and explosive devices, promotion of terrorism, instructions for suicide).
No Violence/Gore content: The image shall not contain content that depicts shocking, sensational, or gratuitous violence (e.g., excessive blood and gore, gratuitous violence against animals, extreme injury or moment of death).

We recommend using ShieldGemma 2 as an input filter to vision language models, or as an output filter of image generation systems. To train a robust image safety model, we curated training datasets of natural and synthetic images and instruction-tuned Gemma 3 to demonstrate strong performance.

Shieldgemma2#36678 by@RyanMullins

Aya Vision

AyaVision is heavily referenced in the followingmodel-based release and we recommend reading these if you want all the information relative to that model.

The Aya Vision 8B and 32B models is a state-of-the-art multilingual multimodal models developed by Cohere For AI. They build on the Aya Expanse recipe to handle both visual and textual information without compromising on the strong multilingual textual performance of the original model.

Aya Vision 8B combines theSiglip2-so400-384-14 vision encoder with the Cohere CommandR-7B language model further post-trained with the Aya Expanse recipe, creating a powerful vision-language model capable of understanding images and generating text across 23 languages. Whereas, Aya Vision 32B uses Aya Expanse 32B as the language model.

Key features of Aya Vision include:

Multimodal capabilities in 23 languages
Strong text-only multilingual capabilities inherited from CommandR-7B post-trained with the Aya Expanse recipe and Aya Expanse 32B
High-quality visual understanding using the Siglip2-so400-384-14 vision encoder
Seamless integration of visual and textual information in 23 languages.

Add aya by@ArthurZucker in#36521

Mistral 3.1

Mistral 3.1 is heavily referenced in the followingmodel-based release and we recommend reading these if you want all the information relative to that model.

Building upon Mistral Small 3 (2501), Mistral Small 3.1 (2503) adds state-of-the-art vision understanding and enhances long context capabilities up to 128k tokens without compromising text performance. With 24 billion parameters, this model achieves top-tier capabilities in both text and vision tasks.

It is ideal for:

Fast-response conversational agents.
Low-latency function calling.
Subject matter experts via fine-tuning.
Local inference for hobbyists and organizations handling sensitive data.
Programming and math reasoning.
Long document understanding.
Visual understanding.

Add Mistral3 by@Cyrilvallez in#36790

Smol VLM 2

SmolVLM-2 is heavily referenced in the followingmodel-based release and we recommend reading these if you want all the information relative to that model.

SmolVLM2 is an adaptation of the Idefics3 model with two main differences:

It uses SmolLM2 for the text model.
It supports multi-image and video inputs

SmolVLM2 by@orrzohar in#36126

SigLIP-2

SigLIP-2 is heavily referenced in the followingmodel-based release and we recommend reading these if you want all the information relative to that model.

The SigLIP2 model was proposed inSigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features by Michael Tschannen, Alexey Gritsenko, Xiao Wang, Muhammad Ferjad Naeem, Ibrahim Alabdulmohsin,
Nikhil Parthasarathy, Talfan Evans, Lucas Beyer, Ye Xia, Basil Mustafa, Olivier Hénaff, Jeremiah Harmsen,
Andreas Steiner and Xiaohua Zhai.

The model comes in two variants

FixRes - model works with fixed resolution images (backward compatible with SigLIP v1)
NaFlex - model works with variable image aspect ratios and resolutions (SigLIP2 intransformers)

Add SigLIP 2 by@qubvel in#36323

Prompt Depth Anything

PromptDepthAnything is a high-resolution, accurate metric depth estimation model that leverages prompting, inspired by its success in vision-language (VLMs) and large language models (LLMs). Using iPhone LiDAR as a prompt, the model generates precise depth maps at up to 4K resolution, unlocking the potential of depth foundation models.

Add Prompt Depth Anything Model by@haotongl in#35401

New tool: attention visualization

We add a new tool totransformers to visualize the attention layout of a given model. It only requires a model ID as input, and will load the relevant tokenizer/model and display what the attention mask looks like. Some examples:

fromtransformers.utils.attention_visualizerimportAttentionMaskVisualizervisualizer=AttentionMaskVisualizer("meta-llama/Llama-3.2-3B-Instruct")visualizer("A normal attention mask")visualizer=AttentionMaskVisualizer("mistralai/Mistral-Small-24B-Instruct-2501")visualizer("A normal attention mask with a long text to see how it is displayed, and if it is displayed correctly")visualizer=AttentionMaskVisualizer("google/paligemma2-3b-mix-224")visualizer("<img> You are an assistant.",suffix="What is on the image?")visualizer=AttentionMaskVisualizer("google/gemma-2b")visualizer("You are an assistant. Make sure you print me")# we should have slidiing on non sliding side by sidevisualizer=AttentionMaskVisualizer("google/gemma-3-27b-it")visualizer("<img>You are an assistant. Make sure you print me")# we should have slidiing on non sliding side by side

Add attention visualization tool by@ArthurZucker in#36630

Deprecating transformers.agents in favor of smolagents

We are deprecatingtransformers.agents in favour of thesmolagents library. Read more about smolagentshere.

Deprecate transformers.agents by@aymeric-roucher in#36415

Quantization

We support adding custom quantization method by using the@register_quantization_config and@register_quantizer decorator:

@&#8203;register_quantization_config("custom")classCustomConfig(QuantizationConfigMixin):pass@&#8203;register_quantizer("custom")classCustomQuantizer(HfQuantizer):passquantized_model=AutoModelForCausalLM.from_pretrained("facebook/opt-350m",quantization_config=CustomConfig(),torch_dtype="auto")

Added Support for Custom Quantization by@keetrap in#35915
Add Example for Custom quantization by@MekkCyber in#36286

AMD is developing its in-house quantizer namedQuark released under MIT license, which supports a broad range of quantization pre-processing, algorithms, dtypes and target hardware. You can now load a model quantized by quark library:

### pip install amd-quarkmodel_id="EmbeddedLLM/Llama-3.1-8B-Instruct-w_fp8_per_channel_sym"model=AutoModelForCausalLM.from_pretrained(model_id)model=model.to("cuda")

Support loading Quark quantized models in Transformers by@fxmarty-amd and@BowenBao in#36372

Torchao is augmented withautoquant support, CPU-quantization, as well as newAOBaseConfig object instances for more advanced configuration.

Add autoquant support for torchao quantizer by@jerryzh168 in#35503
enable torchao quantization on CPU by@jiqing-feng in#36146
Add option for ao base configs by@drisspg in#36526

Tensor Parallelism implementation changes

At loading time, the parallelization is now applied module-by-module, so that no memory overhead is required compared to what the final weight distribution will be!

TP initialization module-by-module by@Cyrilvallez in#35996

Generation

This release includes two speed upgrades togenerate:

Assisted generation now works with ANY model as an assistant, even withdo_sample=True;

fromtransformersimportpipelineimporttorchprompt="Alice and Bob"checkpoint="google/gemma-2-9b"assistant_checkpoint="double7/vicuna-68m"pipe=pipeline("text-generation",model=checkpoint,assistant_model=assistant_checkpoint,do_sample=True)pipe_output=pipe(prompt,max_new_tokens=50,do_sample=True)print(pipe_output[0]["generated_text"])

Beam search was vectorized, and should be significantly faster with a largenum_beams. The speedup is more visible on smaller models, wheremodel.forward doesn't dominate the total run time.

Universal Speculative DecodingCandidateGenerator by@keyboardAnt,@jmamou, and@gauravjain14 in#35029
[generate] ✨ vectorized beam search ✨ by@gante in#35802

Documentation

A significant redesign of our documentation has wrapped-up. The goal was to greatly simplify thetransformers documentation, making it much more easy to navigate. Let us know what you think!

[docs] Redesign by@stevhliu in#31757

Notable repo maintenance

The research examples folder that was hosted intransformers is no more. We have moved it out oftransformers and in the following repo: github.com/huggingface/transformers-research-projects/

Remove research projects by@Rocketknight1 in#36645

We have updated our flex attention support so as to have it be on-par with our Flash Attention 2 support.

Proper_flex by@ArthurZucker in#36643

More models support flex attention now thanks to@qubvel

Refactor Attention implementation for ViT-based models by@qubvel in#36545

First integration of hub kernels for deformable detr!

Use deformable_detr kernel from the Hub (#36853) by@danieldk

Bugfixes and improvements

[tests] fixEsmModelIntegrationTest::test_inference_bitsandbytes by@faaany in#36225
FixLlavaForConditionalGenerationModelTest::test_config after#36077 by@ydshieh in#36230
AMD DeepSpeed image additional HIP dependencies by@ivarflakstad in#36195
[generate] remove cache v4.47 deprecations by@gante in#36212
Add missing atol to torch.testing.assert_close where rtol is specified by@ivarflakstad in#36234
[tests] remove tf/flax tests in/generation by@gante in#36235
[generate] Fix encoder decoder models attention mask by@eustlb in#36018
Add compressed tensor in quant dockerfile by@SunMarc in#36239
[tests] removetest_export_to_onnx by@gante in#36241
Au revoir flakytest_fast_is_faster_than_slow by@ydshieh in#36240
Fix TorchAoConfig not JSON serializable by@andrewor14 in#36206
Remove flakiness in VLMs by@zucchini-nlp in#36242
feat: add support for tensor parallel training workflow with accelerate by@kmehant in#34194
Fix XGLM loss computation (PyTorch and TensorFlow) by@damianoamatruda in#35878
GitModelIntegrationTest - flatten the expected slice tensor by@ivarflakstad in#36260
Added Support for Custom Quantization by@keetrap in#35915
Qwen2VL fix cos,sin dtypes to float when used with deepspeed by@ArdalanM in#36188
Uniformize LlavaNextVideoProcessor kwargs by@yonigozlan in#35613
Add support for post-processing kwargs in image-text-to-text pipeline by@yonigozlan in#35374
Add dithering to theSpeech2TextFeatureExtractor API. by@KarelVesely84 in#34638
[tests] removept_tf equivalence tests by@gante in#36253
TP initialization module-by-module by@Cyrilvallez in#35996
[tests] deflake dither test by@gante in#36284
[tests] remove flax-pt equivalence and cross tests by@gante in#36283
[tests] maketest_from_pretrained_low_cpu_mem_usage_equal less flaky by@gante in#36255
Add Example for Custom quantization by@MekkCyber in#36286
docs: Update README_zh-hans.md by@hyjbrave in#36269
Fix callback handler reference by@SunMarc in#36250
Make cache traceable by@IlyasMoutawwakil in#35873
Fix broken CI on release branch due to missing conversion files by@ydshieh in#36275
Ignore conversion files in test fetcher by@ydshieh in#36251
SmolVLM2 by@orrzohar in#36126
Fix typo in Pixtral example by@12v in#36302
fix: prevent second save in the end of training if last step was saved already by@NosimusAI in#36219
[smolvlm] make CI green by@gante in#36306
Fix default attention mask of generate in MoshiForConditionalGeneration by@cyan-channel-io in#36171
VLMs: even more clean-up by@zucchini-nlp in#36249
Add SigLIP 2 by@qubvel in#36323
[CI] Check test if theGenerationTesterMixin inheritance is correct 🐛 🔫 by@gante in#36180
[tests] make quanto tests device-agnostic by@faaany in#36328
Uses Collection in transformers.image_transforms.normalize by@CalOmnie in#36301
Fix exploitable regexes in Nougat and GPTSan/GPTJNeoXJapanese by@Rocketknight1 in#36121
[tests] enable bnb tests on xpu by@faaany in#36233
Improve model loading for compressed tensor models by@rahul-tuli in#36152
Change slack channel for mi250 CI to amd-hf-ci by@ivarflakstad in#36346
Add autoquant support for torchao quantizer by@jerryzh168 in#35503
Update amd pytorch index to match base image by@ivarflakstad in#36347
fix(type): padding_side type should be Optional[str] by@shenxiangzhuang in#36326
[Modeling] Reduce runtime when loading missing keys by@kylesayrs in#36312
notify new model merged tomain by@ydshieh in#36375
Update modeling_llava_onevision.py by@yinsong1986 in#36391
Load models much faster on accelerator devices!! by@Cyrilvallez in#36380
[modular] Do not track imports in functions by@Cyrilvallez in#36279
Fixis_causal fail with compile by@Cyrilvallez in#36374
enable torchao quantization on CPU by@jiqing-feng in#36146
Update _get_eval_sampler to reflect Trainer.tokenizer is deprecation self.tokenizer -> self.processing_class by@yukiman76 in#36315
Fix doc formatting in forward passes & modular by@Cyrilvallez in#36243
Added handling for length <2 of suppress_tokens for whisper by@andreystarenky in#36336
addressing the issue#34611 to make FlaxDinov2 compatible with any batch size by@MHRDYN7 in#35138
tests: revert change of torch_require_multi_gpu to be device agnostic by@dvrogozh in#35721
[tests] enable autoawq tests on XPU by@faaany in#36327
fix audio classification pipeline fp16 test on cuda by@jiqing-feng in#36359
chore: fix function argument descriptions by@threewebcode in#36392
Fix pytorch integration tests for SAM by@qubvel in#36397
[CLI] add import guards by@gante in#36376
Fix convert_to_rgb for SAM ImageProcessor by@MSt-10 in#36369
Security fix forbenchmark.yml by@ydshieh in#36402
Fixed VitDet for non-squre Images by@cjfghk5697 in#35969
Add retry hf hub decorator by@muellerzr in#35213
Deprecate transformers.agents by@aymeric-roucher in#36415
Fixing the docs corresponding to the breaking change in torch 2.6. by@Narsil in#36420
add recommendations for NPU using flash_attn by@zheliuyu in#36383
fix: prevent model access error during Optuna hyperparameter tuning by@emapco in#36395
Universal Speculative DecodingCandidateGenerator by@keyboardAnt in#35029
Fix compressed tensors config by@MekkCyber in#36421
Update form pretrained to make TP a first class citizen by@ArthurZucker in#36335
Fix Expected output for compressed-tensors tests by@MekkCyber in#36425
restrict cache allocator to non quantized model by@SunMarc in#36428
Change PR to draft when it is (re)opened by@ydshieh in#36417
Fix permission by@ydshieh in#36443
Fix another permission by@ydshieh in#36444
Addcontents: write by@ydshieh in#36445
[save_pretrained ] Skip collecting duplicated weight by@wejoncy in#36409
[generate]torch.distributed-compatibleDynamicCache by@gante in#36373
Lazy import libraries insrc/transformers/image_utils.py by@hmellor in#36435
Fixhub_retry by@ydshieh in#36449
[GroundingDino] Fix grounding dino loss 🚨 by@EduardoPach in#31828
Fix loading models with mismatched sizes by@qubvel in#36463
[docs] fix bug in deepspeed config by@faaany in#36081
Add Got-OCR 2 Fast image processor and refactor slow one by@yonigozlan in#36185
Fix couples of issues from#36335 by@SunMarc in#36453
Fix _load_state_dict_into_meta_model with device_map=None by@hlky in#36488
Fix loading zero3 weights by@muellerzr in#36455
CheckTRUST_REMOTE_CODE forRealmRetriever for security by@ydshieh in#36511
Fix kwargs UserWarning in SamImageProcessor by@MSt-10 in#36479
fix torch_dtype, contiguous, and load_state_dict regression by@SunMarc in#36512
Fix some typos in docs by@co63oc in#36502
chore: fix message descriptions in arguments and comments by@threewebcode in#36504
Fix pipeline+peft interaction by@Rocketknight1 in#36480
Fix edge case for continue_final_message by@Rocketknight1 in#36404
[Style] fix E721 warnings by@kashif in#36474
Remove unused code by@Rocketknight1 in#36459
[docs] Redesign by@stevhliu in#31757
Add aya by@ArthurZucker in#36521
chore: Fix typos in docs and examples by@co63oc in#36524
Fix bamba tests amd by@ivarflakstad in#36535
Fix links in quantization doc by@MekkCyber in#36528
chore: enhance messages in docstrings by@threewebcode in#36525
guard torch version for uint16 by@SunMarc in#36520
Fix typos in tests by@co63oc in#36547
Fix typos . by@zhanluxianshen in#36551
chore: enhance message descriptions in parameters,comments,logs and docstrings by@threewebcode in#36554
Delete redundancy if case in model_utils by@zhanluxianshen in#36559
Modular Conversion --fix_and_overwrite on Windows by@hlky in#36583
Integrate SwanLab for offline/online experiment tracking and local visualization by@ShaohonChen in#36433
[bark] fix loading of generation config by@gante in#36587
[XGLM] tag tests as slow by@gante in#36592
fix: argument by@ariG23498 in#36558
Mention UltraScale Playbook 🌌 in docs by@NouamaneTazi in#36589
avoid errors when the size ofinput_ids passed toPrefixConstrainedLogitsProcessor is zero by@HiDolen in#36489
Export base streamer. by@AndreasAbdi in#36500
Github action for auto-assigning reviewers by@Rocketknight1 in#35846
Update chat_extras.md with content correction by@krishkkk in#36599
Update "who to tag" / "who can review" by@gante in#36394
Fixed datatype related issues inDataCollatorForLanguageModeling by@capemox in#36457
Fix check for XPU. PyTorch >= 2.6 no longer needs ipex. by@tripzero in#36593
[HybridCache] disable automatic compilation by@gante in#36620
Fix auto-assign reviewers by@Rocketknight1 in#36631
chore: fix typos in language models by@threewebcode in#36586
[docs] Serving LLMs by@stevhliu in#36522
Refactor some core stuff by@ArthurZucker in#36539
Fix bugs in mllama image processing by@tjohnson31415 in#36156
Proper_flex by@ArthurZucker in#36643
Fix AriaForConditionalGeneration flex attn test by@ivarflakstad in#36604
Remove remote code warning by@Rocketknight1 in#36285
Stop warnings from unnecessary torch.tensor() overuse by@Rocketknight1 in#36538
[docs] Update docs dependency by@stevhliu in#36635
Remove research projects by@Rocketknight1 in#36645
Fix gguf docs by@SunMarc in#36601
fix typos in the docs directory by@threewebcode in#36639
Gemma3 by@RyanMullins in#36658
HPU support by@IlyasMoutawwakil in#36424
fix block mask typing by@ArthurZucker in#36661
[CI] gemma 3make fix-copies by@gante in#36664
Fix bnb regression due to empty state dict by@SunMarc in#36663
[core] Large/full refactor offrom_pretrained by@Cyrilvallez in#36033
Don't accidentally mutate the base_model_tp_plan by@Rocketknight1 in#36677
Fix Failing GPTQ tests by@MekkCyber in#36666
Remove hardcoded slow image processor class in processors supporting fast ones by@yonigozlan in#36266
[quants] refactor logic for modules_to_not_convert by@SunMarc in#36672
Remove differences between init and preprocess kwargs for fast image processors by@yonigozlan in#36186
Refactor siglip2 fast image processor by@yonigozlan in#36406
Fix rescale normalize inconsistencies in fast image processors by@yonigozlan in#36388
[Cache] Don't initialize the cache onmeta device by@gante in#36543
Update config.torch_dtype correctly by@SunMarc in#36679
Fix slicing for 0-dim param by@SunMarc in#36580
Changing the test model in Quanto kv cache by@MekkCyber in#36670
fix wandb hp search unable to resume from sweep_id by@bd793fcb in#35883
Upgrading torch version and cuda version in quantization docker by@MekkCyber in#36264
Change Qwen2_VL image processors to have init and call accept the same kwargs by@yonigozlan in#36207
fix type annotation for ALL_ATTENTION_FUNCTIONS by@WineChord in#36690
Fix dtype for params without tp_plan by@Cyrilvallez in#36681
chore: fix typos in utils module by@threewebcode in#36668
[CI] Automatic rerun of certain test failures by@gante in#36694
Add loading speed test by@Cyrilvallez in#36671
fix: fsdp sharded state dict wont work for save_only_model knob by@kmehant in#36627
Handling an exception related to HQQ quantization in modeling by@MekkCyber in#36702
Add GGUF support to T5-Encoder by@Isotr0py in#36700
Final CI cleanup by@Rocketknight1 in#36703
Add support for fast image processors in add-new-model-like CLI by@yonigozlan in#36313
Gemma3 processor typo by@Kuangdd01 in#36710
Make the flaky list a little more general by@Rocketknight1 in#36704
Cleanup the regex used for doc preprocessing by@Rocketknight1 in#36648
[model loading] don'tgc.collect() if only 1 shard is used by@gante in#36721
Fix/best model checkpoint fix by@seanswyi in#35885
Try working around the processor registration bugs by@Rocketknight1 in#36184
[tests] Parameterizedtest_eager_matches_sdpa_inference by@gante in#36650
🌐 [i18n-KO] Translated codegen.md to Korean by@maximizemaxwell in#36698
Fix post_init() code duplication by@Cyrilvallez in#36727
Fix grad accum arbitrary value by@IlyasMoutawwakil in#36691
[Generation, Gemma 3] When passing a customgeneration_config, overwrite default values with the model's basegeneration_config by@gante in#36684
🚨🚨🚨 Fix sdpa in SAM and refactor relative position embeddings by@geetu040 in#36422
enable/disable compile for quants methods by@SunMarc in#36519
fix can_generate by@jiqing-feng in#36570
Allow ray datasets to be used with trainer by@FredrikNoren in#36699
fix xpu tests by@jiqing-feng in [#36656](https://redirect

Configuration

📅Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).

🚦Automerge: Disabled by config. Please merge this manually once you are satisfied.

♻Rebasing: Never, or you tick the rebase/retry checkbox.

🔕Ignore: Close this PR and you won't be reminded about this update again.

If you want to rebase/retry this PR, check this box

This PR was generated byMend Renovate. View therepository job log.

renovate-bot requested review froma team ascode owners

May 31, 2025 00:37

trusted-contributions-gcfbot added kokoro:force-run

Add this label to force Kokoro to re-run the tests.

owlbot:runAdd this label to trigger the Owlbot post processor. labels

May 31, 2025

product-auto-labelbot added the samplesIssues that are directly related to samples. label

May 31, 2025

gemini-code-assistbot reviewed

May 31, 2025

View reviewed changes

Copy link

Contributor

gemini-code-assistbot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Hello@renovate-bot, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

Summary of Changes

Gemini or gemini-code-assist here, providing a summary of this pull request. The primary intent of this PR, created by renovate-bot, is to update thetransformers Python dependency from version4.38.0 to4.50.0. This is a significant version jump that includes numerous changes, notably addressing several security vulnerabilities (CVEs) related to deserialization and regular expressions, as detailed in the PR description. Beyond security fixes, this update brings a wealth of new features, including support for new models like Gemma 3, Aya Vision, and Mistral 3.1, improvements to quantization and tensor parallelism, generation speedups, and a redesigned documentation, among many other bug fixes and enhancements.

Highlights

Dependency Update: Updates thetransformers Python library from version4.38.0 to4.50.0.
Security Fixes: This update includes fixes for multiple security vulnerabilities (CVEs), including Remote Code Execution and Regular Expression Denial of Service issues present in previous versions.
Major Feature Release: Version 4.50.0 oftransformers introduces support for several new models, enhancements to quantization and generation, a new attention visualization tool, and various bug fixes and improvements as detailed in the release notes.

Changelog

dataflow/run-inference/requirements.txt
- Updates the specified version of thetransformers package from4.38.0 to4.50.0.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on ourdocumentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either/gemini <command> or@gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a.gemini/ folder in the base of the repository. Detailed instructions can be foundhere.

Limitations & Feedback

Gemini Code Assist is currently in preview and may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on@gemini-code-assist comments to provide feedback.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with theGemini Code Assist IDE Extension.

New version arrives,
Fixing bugs, adding models,
Code now safer runs.

Footnotes

Review thePrivacy Notices,Generative AI Prohibited Use Policy,Terms of Service, and learn how to configure Gemini Code Assist in GitHubhere. Gemini can make mistakes, so double check it anduse code with caution.↩

kokoro-team removed the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

May 31, 2025

gemini-code-assistbot reviewed

May 31, 2025

View reviewed changes

Copy link

Contributor

gemini-code-assistbot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Code Review

This pull request updates thetransformers dependency from4.38.0 to4.50.0. This is a crucial update as it addresses several significant security vulnerabilities, including Remote Code Execution (RCE) and Regular Expression Denial of Service (ReDoS) issues, as detailed in the pull request description. The change itself is minimal, consisting only of updating the version number in therequirements.txt file.

Summary of Findings

Security Vulnerability Mitigation: The update totransformers==4.50.0 addresses several critical security vulnerabilities, including Remote Code Execution (RCE) and Regular Expression Denial of Service (ReDoS) issues, as detailed in the pull request description. Applying this update is essential for improving the security posture of the application.

Merge Readiness

The pull request updates a critical dependency to a version that includes important security fixes. The change is limited to a version bump in the requirements file. Based on the Renovate compatibility badge, tests appear to be passing. Given the security implications, this update is highly recommended for merging. I am unable to approve this pull request, so please ensure it is reviewed and approved by others before merging.

renovate-bot force-pushed therenovate/pypi-transformers-vulnerability branch fromd1cd102 to35d323fCompare

May 31, 2025 07:10

trusted-contributions-gcfbot added the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

May 31, 2025

kokoro-team removed the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

May 31, 2025

renovate-bot force-pushed therenovate/pypi-transformers-vulnerability branch from35d323f toab2bc2fCompare

May 31, 2025 15:37

trusted-contributions-gcfbot added the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

May 31, 2025

kokoro-team removed the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

May 31, 2025

renovate-bot force-pushed therenovate/pypi-transformers-vulnerability branch fromab2bc2f to00deea4Compare

May 31, 2025 19:18

trusted-contributions-gcfbot added the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

May 31, 2025

kokoro-team removed the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

May 31, 2025

renovate-bot force-pushed therenovate/pypi-transformers-vulnerability branch from00deea4 to8db64bbCompare

June 1, 2025 03:52

trusted-contributions-gcfbot added the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

Jun 1, 2025

kokoro-team removed the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

Jun 1, 2025

renovate-bot force-pushed therenovate/pypi-transformers-vulnerability branch from8db64bb tofc8e82bCompare

June 1, 2025 10:59

trusted-contributions-gcfbot added the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

Jun 1, 2025

kokoro-team removed the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

Jun 1, 2025

renovate-bot force-pushed therenovate/pypi-transformers-vulnerability branch fromfc8e82b to4f6b429Compare

June 1, 2025 18:47

trusted-contributions-gcfbot added the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

Jun 1, 2025

kokoro-team removed the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

Jun 1, 2025

renovate-bot force-pushed therenovate/pypi-transformers-vulnerability branch from4f6b429 tob1d3c6aCompare

June 2, 2025 03:16

trusted-contributions-gcfbot added the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

Jun 2, 2025

kokoro-team removed the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

Jun 2, 2025

renovate-bot force-pushed therenovate/pypi-transformers-vulnerability branch fromb1d3c6a toce0711cCompare

June 2, 2025 16:47

trusted-contributions-gcfbot added the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

Jun 2, 2025

kokoro-team removed the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

Jun 2, 2025

renovate-bot force-pushed therenovate/pypi-transformers-vulnerability branch fromce0711c to2a28fdaCompare

June 3, 2025 03:53

trusted-contributions-gcfbot added the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

Jun 3, 2025

kokoro-team removed the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

Jun 3, 2025

renovate-bot force-pushed therenovate/pypi-transformers-vulnerability branch from2a28fda to2700790Compare

June 3, 2025 13:12

trusted-contributions-gcfbot added the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

Jun 3, 2025

kokoro-team removed the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

Jun 3, 2025

renovate-bot force-pushed therenovate/pypi-transformers-vulnerability branch from2700790 tob25fcf6Compare

June 3, 2025 23:40

trusted-contributions-gcfbot added the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

Jun 3, 2025

kokoro-team removed the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

Jun 3, 2025

chore(deps): update dependency transformers to v4.50.0 [security]

fc557a1

renovate-bot force-pushed therenovate/pypi-transformers-vulnerability branch fromb25fcf6 tofc557a1Compare

June 4, 2025 14:52

trusted-contributions-gcfbot added the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

Jun 4, 2025

kokoro-team removed the kokoro:force-runAdd this label to force Kokoro to re-run the tests. label

Jun 4, 2025

Labels

owlbot:run

Add this label to trigger the Owlbot post processor.

samples

Issues that are directly related to samples.

2 participants

Movatterモバイル変換

chore(deps): update dependency transformers to v4.50.0 [security]#13398

Are you sure you want to change the base?

chore(deps): update dependency transformers to v4.50.0 [security]#13398

Uh oh!

Conversation

renovate-bot commentedMay 31, 2025• editedLoading Uh oh!There was an error while loading.Please reload this page.

Uh oh!

GitHub Vulnerability Alerts

CVE-2024-11392

CVE-2024-11394

CVE-2024-11393

CVE-2024-12720

CVE-2025-1194

CVE-2025-2099

Release Notes

v4.50.0

Release v4.50.0

New Model Additions

Model-based releases

Gemma 3

Shield Gemma2

Aya Vision

Mistral 3.1

Smol VLM 2

SigLIP-2

Prompt Depth Anything

New tool: attention visualization

Deprecating transformers.agents in favor of smolagents

Quantization

Tensor Parallelism implementation changes

Generation

Documentation

Notable repo maintenance

More models support flex attention now thanks to@​qubvel

First integration of hub kernels for deformable detr!

Bugfixes and improvements

Configuration

Uh oh!

gemini-code-assistbot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Changelog

Footnotes

Uh oh!

gemini-code-assistbot left a comment

Choose a reason for hiding this comment

Code Review

Summary of Findings

Merge Readiness

Uh oh!

Uh oh!

renovate-bot commentedMay 31, 2025•
edited
Loading

`v4.50.0`

More models support flex attention now thanks to@qubvel