Movatterモバイル変換

Qwen

From Wikipedia, the free encyclopedia

Family of large language models by Alibaba

Qwen

Screenshot Screenshot of an example of a Qwen 3 answer describingWikipedia, with the "Thinking" feature enabled
Developer	Alibaba Cloud
Initial release	April 2023; 2 years ago (2023-04)

Stable release	Qwen3-Max Thinking/ January 26, 2026; 18 days ago (2026-01-26) Qwen3-235B-A22B / July 25, 2025; 6 months ago (2025-07-25) Qwen3-Next-80B-A3B / September 11, 2025; 5 months ago (2025-09-11) Qwen3-Coder-Next based on Qwen3-Next-80B-A3B / February 2, 2026; 11 days ago (2026-02-02)

Written in	Python
Operating system	Web app Android
Type	Large language model,chatbot
License	Apache-2.0 Qwen Research License Qwen License
Website	chat.qwen.ai
Repository	github.com/QwenLM/Qwen

Qwen

Tongyi Qianwen

Traditional Chinese

通義千問

Simplified Chinese

通义千问

Literal meaning

to comprehend the meaning, [and to answer] a thousand kinds of questions

Transcriptions
Standard Mandarin
Hanyu Pinyin	Tōngyì Qiānwèn

Qwen (also known asTongyi Qianwen,Chinese:通义千问; pinyin:Tōngyì Qiānwèn) is a family oflarge language models developed byAlibaba Cloud. Many Qwen variants are distributed as open‑weight models under the Apache‑2.0 license, while others are served through Alibaba Cloud.^[1]

In July 2024,South China Morning Post reported that benchmarking platform SuperCLUE ranked Qwen2‑72B‑Instruct behindOpenAI's GPT‑4o andAnthropic’s Claude 3.5 Sonnet and ahead of other Chinese models.^[2]

Models

[edit]

An AI-generated image by Qwen3-Max (using Qwen-Image), based onWikipe-tan. Prompt is:`Transformthis image into painting in the style of Picasso and Juan Gris`

Alibaba launched a beta of Qwen in April 2023 under the name Tongyi Qianwen, then opened it for public use in September 2023 after regulatory clearance.^[3]^[4]

The model's architecture was based on theLlama architecture developed byMeta AI.^[5]^[6] In December 2023, it released its 72B and 1.8B models for download, while Qwen 7B weights were released in August.^[7]^[8] Their models are sometimes described asopen source, but the training code has not been released nor has the training data been documented, and they do not meet the terms of either theOpen Source AI Definition or theModel Openness Framework from theLinux Foundation.

In June 2024 Alibaba launched Qwen2 and in September it released some of its models with open weights, while keeping its most advanced models proprietary.^[9]^[10] Qwen2 contains both dense andsparse models.^[11]

In November 2024, QwQ-32B-Preview, a model focusing on reasoning similar to OpenAI'so1, was released under theApache 2.0 License, although only the weights were released, not the dataset or training method.^[12]^[13] QwQ has a 32K token context length and performs better than o1 on some benchmarks.^[14]

The Qwen-VL series is a line of visual language models that combines avision transformer with an LLM.^[5]^[15] Alibaba releasedQwen2-VL with variants of 2 billion and 7 billion parameters.^[16]^[17]^[18]

In January 2025, Qwen2.5-VL was released with variants of 3, 7, 32, and 72 billion parameters.^[19] All models except the 72B variant are licensed under the Apache 2.0 license.^[20] Qwen-VL-Max is Alibaba's flagship vision model as of 2024, and is sold byAlibaba Cloud at a cost of US$0.41 per million input tokens.^[21]

Alibaba has released several other model types such as Qwen-Audio and Qwen2-Math.^[22] In total, it has released more than 100 open weight models, with its models having been downloaded more than 40 million times.^[10]Fine-tuned versions of Qwen have been developed by enthusiasts, such as "Liberated Qwen", developed by San Francisco-based Abacus AI, which is a version that responds to any user request without content restrictions.^[23]

On January 29, 2025, Alibaba launched Qwen2.5-Max.^[24]^[25]

On March 24, 2025, Alibaba launched Qwen2.5-VL-32B-Instruct as a successor to the Qwen2.5-VL model. It was released under the Apache 2.0 license.^[26]^[27]

On March 26, 2025, Qwen2.5-Omni-7B was released under the Apache 2.0 license and made available through chat.qwen.ai, as well as platforms likeHugging Face,GitHub, and ModelScope.^[28] The Qwen2.5-Omni model accepts text, images, videos, and audio as input and can generate both text and audio as output, allowing it to be used for real-time voice chatting, similar to OpenAI's GPT-4o.^[28]

On April 28, 2025, the Qwen3 model family was released,^[29] with all models licensed under the Apache 2.0 license. The Qwen3 model family includes both dense (0.6B, 1.7B, 4B, 8B, 14B, and 32B parameters) andsparse models (30B with 3B activated parameters, 235B with 22B activated parameters). They were trained on 36 trillion tokens in 119 languages and dialects.^[30]

On September 5, 2025, Alibaba launched Qwen3-Max.^{[citation needed]}

On September 10, 2025, Qwen3-Next was released under the Apache 2.0 license and made available through chat.qwen.ai, as well as platforms likeHugging Face and Model Scope.^[31]^{[non-primary source needed]}

On September 22, 2025, Qwen3-Omni was release under the Apache 2.0 license and made available through chat.qwen.ai, as well as platforms likeHugging Face and Model Scope. Qwen3-Omni is a mixed/multimodal model that can generate text, images, audio, and video.^[32]^{[non-primary source needed]}

On 27 January 2026, Qwen3-Max-Thinking was released. The model can generate text, pictures, or video.^[33]

List of models
Version	Release date	Ref.
Tongyi Qianwen	September 2023	^[34]
Qwen-VL	August 2023	^[35]
Qwen2	June 2024	^[10]
Qwen2-Audio	August 2024	^[36]
Qwen2-VL	December 2024	^[16]
Qwen2.5	September 2024	^[37]
Qwen2.5-Coder	November 2024	^[38]
QvQ	December 2024	^[39]
Qwen2.5-VL	January 2025	^[40]
QwQ-32B	March 2025	^[41]
Qwen2.5-Omni	March 2025	^[28]
Qwen3	April 2025	^[29]
Qwen3-Coder (AKA Qwen3-Coder-480B-A35B) Qwen3-Coder-Flash (AKA Qwen3-Coder-30B-A3B)	July 2025	^[42]
Qwen3-Max	September 2025	^{[citation needed]}
Qwen3-Next	September 2025	^[43]
Qwen3-Omni	September 2025	^[32]
Qwen3-VL	September 2025	^[44]
Qwen3-Coder-Next	February 2026	^[45]

References

[edit]

^Mo, Liam; Hall, Casey (19 September 2024)."Alibaba accelerates AI push by releasing new open-source models, text-to-video".Reuters.
^Jiang, Ben (11 July 2024)."Alibaba's open-source AI model tops Chinese rivals, ranks 3rd globally".South China Morning Post.Archived from the original on 4 March 2025. Retrieved29 November 2024.
^Horwitz, Josh; Ye, Josh (11 April 2023)."Alibaba to roll out generative AI across apps".Reuters.
^Hall, Casey (13 September 2023)."Alibaba opens AI model Tongyi Qianwen to the public".Reuters.
^^a ^bBai, Jinze; et al. (28 September 2023). "Qwen Technical Report".arXiv:2309.16609 [cs.CL].
^"Qwen/techmemo-draft.md".GitHub. August 3, 2023.Archived from the original on March 7, 2025. RetrievedMarch 5, 2025.
^Fan, Feifei (2023-12-01)."Alibaba unveils new Tongyi Qianwen AI language model".global.chinadaily.com.cn.
^Ye, Josh (August 3, 2023)."Alibaba rolls out open-sourced AI model to take on Meta's Llama 2".reuters.Archived from the original on 2023-10-10. Retrieved2024-11-29.
^Jiang, Ben (7 June 2024)."Alibaba says new AI model Qwen2 bests Meta's Llama 3 in tasks like maths and coding".South China Morning Post.
^^a ^b ^cKharpal, Arjun (19 September 2024)."China's Alibaba launches over 100 new open-source AI models, releases text-to-video generation tool".CNBC.
^Yang, An; et al. (10 September 2024). "Qwen2 Technical Report".arXiv:2407.10671 [cs.CL].
^Dickson, Ben (29 November 2024)."Alibaba releases Qwen with Questions, an open reasoning model that beats o1-preview".VentureBeat. Archived fromthe original on 18 January 2025. Retrieved1 December 2024.
^故渊 (2024-11-28)."阿里通义千问 QwQ 登场：开源 AI 推理新王，MATH 测试超 OpenAI o1 模型 - IT之家".ITHome (Chinese website).
^Wiggers, Kyle (27 November 2024)."Alibaba releases an 'open' challenger to OpenAI's o1 reasoning model".TechCrunch.
^Browne, Ryan (31 December 2024)."Alibaba slashes prices on large language models by up to 85% as China AI rivalry heats up".CNBC.
^^a ^bFranzen, Carl (29 August 2024)."Alibaba releases new AI model Qwen2-VL that can analyze videos more than 20 minutes long".VentureBeat. Archived fromthe original on 6 August 2025. Retrieved29 April 2025.
^沛霖 (2024-08-30)."阿里通义千问推出 Qwen2-VL：开源 2B / 7B 参数 AI 大模型，处理任意分辨率图像无需分割成块".ITHome (Chinese website).
^Wang, Peng; Bai, Shuai; Tan, Sinan; Wang, Shijie; Fan, Zhihao; Bai, Jinze; Chen, Keqin; Liu, Xuejing; Wang, Jialin; Ge, Wenbin; Fan, Yang; Dang, Kai; Du, Mengfei; Ren, Xuancheng; Men, Rui; Liu, Dayiheng; Zhou, Chang; Zhou, Jingren; Lin, Junyang (September 18, 2024). "Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution".Cs.CV.arXiv:2409.12191.
^"Qwen2.5 VL! Qwen2.5 VL! Qwen2.5 VL!".Qwen. 2025-01-26. Retrieved2025-04-28.
^"Qwen/Qwen2.5-VL-72B-Instruct · Hugging Face".huggingface.co. 2025-04-28. Retrieved2025-04-28.
^Jiang, Ben (31 December 2024)."Alibaba Cloud cuts AI visual model price by 85% on last day of the year".South China Morning Post.
^Franzen, Carl (8 August 2024)."Alibaba claims no. 1 spot in AI math models with Qwen2-Math".VentureBeat. Archived fromthe original on 18 January 2025. Retrieved29 November 2024.
^Mims, Christopher (April 19, 2024)."Here Come the Anti-Woke AIs".WSJ.Archived from the original on April 23, 2024. RetrievedNovember 29, 2024.
^"Qwen2.5-Max: Exploring the Intelligence of Large-scale MoE Model".Github. 29 January 2025.
^Baptista, Eduardo (January 29, 2025)."Alibaba releases AI model it says surpasses DeepSeek".Reuters.
^"Qwen2.5-VL-32B: Smarter and Lighter".Qwen. 2025-03-24. Retrieved2025-03-25.
^Nikhil (2025-03-24)."Qwen Releases the Qwen2.5-VL-32B-Instruct: A 32B Parameter VLM that Surpasses Qwen2.5-VL-72B and Other Models like GPT-4o Mini".MarkTechPost. Retrieved2025-03-25.
^^a ^b ^cDotson, Kyt (27 March 2025)."Alibaba releases new open-source AI model to power intelligent voice applications".SiliconANGLE.
^^a ^bAra Shaikh, Jasmeen (April 28, 2025)."Alibaba unveils advanced Qwen 3 AI as Chinese tech rivalry intensifies".Reuters.
^Wiggers, Kyle (28 April 2025)."Alibaba unveils Qwen3, a family of 'hybrid' AI reasoning models".TechCrunch.Archived from the original on 29 April 2025. Retrieved29 April 2025.
^"Qwen3-Next: Towards Ultimate Training & Inference Efficiency".Qwen Blog. September 10, 2025.Archived from the original on September 11, 2025. RetrievedSeptember 13, 2025.
^^a ^b"Qwen/Qwen3-Omni-30B-A3B-Instruct · Hugging Face".huggingface.co. 2025-09-22. Retrieved2025-09-23.
^Cheng, Evelyn (2026-01-28)."One year after DeepSeek, Chinese AI firms from Alibaba to Moonshot race to release new models".CNBC. Retrieved2026-02-13.
^Jiang, Ben (13 September 2023)."Alibaba opens Tongyi Qianwen model to public as new CEO embraces AI".South China Morning Post.
^Kharpal, Arjun (25 August 2023)."Alibaba launches AI model that can understand images and have more complex conversations".CNBC.
^沛霖 (2024-08-13)."阿里通义千问开源 Qwen2-Audio 7B 语音交互大模型：自由互动，无需输入文本".ITHome (Chinese website).
^"Alibaba accelerates AI push by releasing new open-source models, text-to-video".Reuters. September 19, 2024.
^Nuñez, Michael (12 November 2024)."Qwen2.5-Coder just changed the game for AI programming—and it's free".VentureBeat.^{[dead link]}
^Dotson, Kyt (26 December 2024)."Alibaba announces advanced experimental visual reasoning QVQ-72B AI model".SiliconANGLE.
^Wiggers, Kyle (27 January 2025)."Alibaba's Qwen team releases AI models that can control PCs and phones".TechCrunch.
^Franzen, Carl (5 March 2025)."Alibaba's new open source model QwQ-32B matches DeepSeek-R1 with way smaller compute requirements".VentureBeat. Archived fromthe original on 16 May 2025. Retrieved29 April 2025.
^"Alibaba rolls out new AI coding model Qwen3-Coder, says it's their most powerful".Computerworld. Retrieved2025-07-24.
^"Qwen/Qwen3-Next-80B-A3B-Instruct · Hugging Face".huggingface.co. 2025-09-11. Retrieved2025-09-13.
^"Qwen3-VL: Sharper Vision, Deeper Thought, Broader Action".qwen.ai. 2025-09-22. Retrieved2025-11-10.
^"Qwen3-Coder-Next: Pushing Small Hybrid Models on Agentic Coding".qwen.ai. 2026-02-02. Retrieved2026-02-05.

External links

[edit]

v t e Alibaba Group
Services	Alibaba Cloud AliExpress AliGenie AliMusic AliOS Alipay Qwen Taobao Tmall Tmall Genie Xuexi Qiangguo
Subsidiaries	Alibaba Health Amblin Partners Amblin Entertainment Amblin Television DreamWorks Pictures Ant Group Hello Tianhong Asset Management Zhima Credit AutoNavi Cainiao Damai Entertainment Ele.me Heyi Pictures Lazada Group Shenma South China Morning Post T-Head Tudou UCWeb UC Browser World Electronic Sports Games Youku
School	Zhejiang Hupan Entrepreneurship Research Center
People	Jack Ma Daniel Zhang J. Michael Evans Peng Lei Jonathan Lu Joseph Tsai Wang Jian Maggie Wu
Commons Category

v t e Generative AI chatbots
Arena List of chatbots List of LLMs
Character.ai ChatGPT Claude Copilot DeepSeek Duck.ai Ernie Gemini GLM Grok HKChat Hunyuan Kimi Llama MiniMax Mistral Perplexity Poe Qwen Velvet You.com
Category

Generative AI

Concepts

Chatbots

Models

Text	Claude Gemini Gemma GPT 1 2 3 J 4 4o 4.5 4.1 OSS 5 5.1 5.2 Llama o1 o3 o4-mini Qwen Velvet
Coding	Claude Code Cursor Devstral GitHub Copilot Kimi Qwen3-Coder Replit
Image	Aurora Firefly DALL-E Flux GPT Image Ideogram Imagen Nano Banana Midjourney Qwen-Image Recraft Seedream Stable Diffusion
Video	Dream Machine Hailuo AI Kling AI Runway Gen Seedance LTX-2 Sora Veo Wan
Speech	15.ai Eleven MiniMax Speech 2.5 WaveNet
Music	Eleven Music Endel Lyria Riffusion Suno Udio

Controversies

Agents

Companies

Category

Artificial intelligence (AI)

Concepts

Applications

Implementations

Audio–visual	AlexNet WaveNet Human image synthesis HWR OCR Computer vision Speech synthesis 15.ai ElevenLabs Speech recognition Whisper Facial recognition AlphaFold Text-to-image models Aurora DALL-E Firefly Flux GPT Image Ideogram Imagen Midjourney Recraft Stable Diffusion Text-to-video models Dream Machine Runway Gen Hailuo AI Kling Sora Veo Music generation Riffusion Suno AI Udio
Text	Word2vec Seq2seq GloVe BERT T5 Llama Chinchilla AI PaLM GPT 1 2 3 J ChatGPT 4 4o o1 o3 4.5 4.1 o4-mini 5 5.1 5.2 Claude Gemini Gemini (language model) Gemma Grok LaMDA BLOOM DBRX Project Debater IBM Watson IBM Watsonx Granite PanGu-Σ DeepSeek Qwen
Decisional	AlphaGo AlphaZero OpenAI Five Self-driving car MuZero Action selection AutoGPT Robot control