Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Huawei PanGu

From Wikipedia, the free encyclopedia
Large language model developed by Huawei

Huawei PanGu
DeveloperHuawei
Initial release1.0, July 2021; 4 years ago (2021-07)
Stable release
5.5 / June 20, 2025; 5 months ago (2025-06-20)
Repositorygitcode.com/ascend-tribe
Available inChinese,English,Russian
TypeLarge language model
LicenseOpen source

Huawei PanGu,PanGu,PanGu-Σ,PanGu-π also known asopenPangu (Chinese:盘古大模型;pinyin:pángǔ dà móxíng) is amultimodallarge language model developed byHuawei. It was officially launched on July 2021.[1]

The name of the large learning language model,PanGu, was derived from the Chinese mythology and folklore ofPangu, a primordial character related to the creation of the world.[2]

History

[edit]

Early development

[edit]

In April 2023, Huawei released a paper detailing the development of PanGu-Σ, a colossal language model featuring 1.085 trillion parameters. Developed within Huawei'sMindSpore 5 framework, PanGu-Σ underwent training for over 100 days on a cluster system equipped with 512 Ascend 910 AI accelerator chips, processing 329 billion tokens in more than 40natural andprogramming languages.[3]

PanGu-Σ incorporates Random Routed Experts (RRE) and the Transformer decoder architecture, allowing easy extraction of sub-models for various applications like conversation, translation, code production, and natural language interpretation. The model achieves 6.3 times faster training throughput compared toMoE models with the same hyper-parameters. In the Chinese domain, it outperforms previous state-of-the-art models across 16 tasks in a zero-shot setting. Trained on datasets from 40 domains, including Chinese, English, Bilingual, and code, PanGu-Σ excels infew-shotnatural-language understanding, open-domain discussion, question answering, machine translation, and code creation.[4][5]

Launch

[edit]

During the Huawei Developer Conference on July 7, 2023, Huawei introduced PanGu 3.0, a large language model (LLM), tailored for sectors like government, finance, manufacturing, mining, and meteorology utilizingHuawei Cloud [zh] solutions. In the subsequent month, Huawei launched theCelia Virtual Assistant with advanced AI features, capable of generating long text replies based on user voice commands and set to release withHarmonyOS 4.0 for eligible devices.[6][7]

The LLM was designed for enterprises seeking advantages in the AI industry, focusing on task execution over creative work, unlike traditional models used for general purposes like chatbots, poetry, and visual content creation.[8]

Using the same technology asChatGPT, Huawei's LLM features a hierarchical architecture, allowing customers to adapt the model to various tasks and train it on their own datasets, making it versatile across various industries.[9]

Updates

[edit]

On August 5, 2023,Huawei partnered withEuropean Centre for Medium-Range Weather Forecasts (ECMWF) to launch a global weather forecasting AI model. This model used Huawei Cloud solutions and the PanGu-Weather Model withMindSpore. It is accessible on the ECMWF website and aims to provide accurate weather data.[10][11]

On December 19, 2023, Huawei announced its financial services on the PanGu-powered AI Finance platform for the global market. The tech giant introduced this product at the 2023 Huawei Cloud Fintech Summit, aiming to reshape the digital finance industry with efficient features to boost Fintech firms worldwide. The platform incorporated a variety of advanced technologies, including AI, big data analytics, and blockchain.[12]

On June 21, 2024, at HDC 2024, Huawei announced upgraded PanGu 5.0 alongsideHarmonyOS NEXT. This version integrated withHarmony Intelligence, which features a smarterCelia (Xiaoyi) and focuses on generative AI updates to itsLLM platform for creating new content, such as text, code, or images. Aiming to make PanGu accessible to a wide range of developers and businesses, it offered scalable options: smaller models requiring less computational power for those with limited resources, and larger models with increased capacities for complex tasks requiring more processing power.[13]

On June 20, 2025 at Huawei Developer Conference, the company released Pangu Models 5.5 version, a 718-billion parameter industrial focused AI platform.[14]

On June 30, 2025 Huawei has open-sourced its Pangu models as openPangu AI models, including a 7-billion-parameter openPangu model and a 72-billion-parameter openPangu Pro MoE (Mixture-of-Experts) model. The release also featured model inference technology optimized for Huawei’s Ascend AI accelerator chips.[15]

At Huawei Connect 2025 event, September 29, 2025, it announced a roadmap to fully open source its open-source AI software stack withMindSpore toolchains, openPangu models and CANN interfaces by December 31, 2025.[16]

Technical specifications

[edit]

PanGu Large Model 3.0, designed for industry use, was structured with a 5+N+X three-tier architecture.[17]

  • First Layer (L0): Comprises PanGu's five basic large models to provide a variety of capabilities for different industry scenarios. These include Natural Language Processing (NLP) models, Visual models, Multimodal models, Prediction models, and Scientific Computing models.
  • Second Layer (L1): Consists of N large industry-specific models. These models are trained using public data from various industries, such as government, finance, manufacturing, mining, and weather. Additionally, it uses customers' own data from L0 and L1 to train proprietary models tailored for each customer.
  • Third Layer (L2): Provides customers with detailed scenario-specific models. This layer focuses on specific applications or business needs, offering ready-to-use model services.

The updated Huawei PanGu Model 5.0 by Huawei Cloud business division offered three key features: adaptability for different business scenarios, multi-style modeling, and advanced intelligence. Huawei divided the AI model platform into four series, each with different parameter scales:[18]

  • PanGu E Series: The Embedded version supports smart apps on phones, tablets, PCs, and other devices, with a parameter scale of 1 billion.
  • PanGu P Series: The Professional version features a 10-billion parameter scale, ideal for low-latency and low-cost reasoning conditions.
  • PanGu U Series: The Ultra version comes in two variants, with 135 billion and 230 billion parameters, capable of handling complex tasks and serving as a base for large models.
  • PanGu S Series: The Super PanGu is the top-tier edition, featuring trillion-level parameters, designed to manage advanced AI technology scenarios such as cross-domain or multi-tasking applications.

Controversy

[edit]

On July 4, 2025, some researchers alleged on GitHub that there is an extremely high similarity in the attention parameter distribution between the Pangu Pro MoE model andAlibaba'sQwen model, using "model fingerprinting" technology. The next day, Huawei Noah's Ark Lab, the development team, responded that Pangu is a foundational large model self-developed onAscend hardware and not incrementally trained on other models. They added that they had made compliant attributions in strict accordance with open-source licenses, a common practice in the community. The original repository with the accusation has since been deleted.[19][20][21]

See also

[edit]

References

[edit]
  1. ^"Reshaping Industries with AI: Huawei Cloud launches Pangu Models 3.0 and Ascend AI Cloud services".CITI Newsroom. July 9, 2023. RetrievedFebruary 13, 2024.
  2. ^Nair, Arya M. (July 8, 2023)."Huawei rolls out latest version of its deep learning AI model, Pangu - GCC Business News".GCC Business News. RetrievedMay 29, 2024.
  3. ^Upadhyay, Shyam Nandan (April 3, 2023)."Huawei Researchers Develop LLM With 1.085 Trillion Parameters".AnalyticsIndiaMag. RetrievedFebruary 13, 2024.
  4. ^"Huawei Researchers Unveil Pangu-Σ: Trillion-Parameter Language Model with Sparse Architecture".Multiplatform.ai. RetrievedFebruary 13, 2024.
  5. ^Tickoo, Aneesh."Huawei Researchers Develop Pangu-Σ: A Large Language Model With Sparse Architecture And 1.085 Trillion Parameters".marktechpost.com. RetrievedFebruary 13, 2024.
  6. ^"Huawei Pangu AI models for Government, finance, manufacturing, mining, meteorology".HC Newsroom. July 23, 2023. RetrievedFebruary 13, 2024.
  7. ^Sarkar, Amy (August 4, 2023)."Huawei launches Voice Assistant with large Pangu AI model".HC Newsroom. RetrievedFebruary 13, 2024.
  8. ^"Revolutionizing Global AI Landscape: Huawei's PanGu Megamodel Set to Transform Industries Worldwide".LinkedIn. Grosso Link Sàrl. RetrievedFebruary 13, 2024.
  9. ^Jarrett, Miranda (July 7, 2023)."Huawei to revolutionise applications of AI with new Pangu model".Dao Insights. RetrievedFebruary 13, 2024.
  10. ^Li, Deng (August 5, 2023)."Huawei Pangu-Weather Model debuts European ECMWF website".HC Newsroom. RetrievedFebruary 13, 2024.
  11. ^Mishra, Yash (October 9, 2023)."Huawei Cloud will build large-scale high-precision regional weather forecast Pangu model".HC Newsroom. RetrievedFebruary 13, 2024.
  12. ^Birch, Scott (December 19, 2023)."Huawei Cloud and Pangu AI model reshaping finance industry".FinTech Magazine. RetrievedFebruary 13, 2024.
  13. ^Staff Writer (June 22, 2024)."Huawei Unveils New Harmony OS And AI Model In Continued Drive For Tech Self-reliance".Elnion. RetrievedJuly 7, 2024.
  14. ^Law, Marcus."What Huawei Pangu 5.5 Models Mean for Industrial AI".Technology Magazine. Technology Magazine. RetrievedNovember 10, 2025.
  15. ^"Huawei open-sources Pangu AI models, optimized for Ascend chips".techinasia.com. techinasia.com. RetrievedNovember 10, 2025.
  16. ^"Huawei Open-Sources AI Stack: CANN, MindSpore & openPangu at Connect 2025".poniak. poniak. RetrievedNovember 10, 2025.
  17. ^"Huawei launches latest AI model, Pangu 3.0".Business Today (Malaysia). July 8, 2023. RetrievedFebruary 13, 2024.
  18. ^Matsui, Emiko (June 21, 2024)."Huawei Cloud unveils Pangu Large Model 5.0".Huawei Central. RetrievedJuly 7, 2024.
  19. ^钛媒体 (July 5, 2025)."华为团队回应盘古开源AI模型抄袭争议:并非基于其他模型增量训练,已严格遵循开源许可".Sina Fiance. RetrievedJuly 6, 2025.
  20. ^Guancha."华为盘古团队声明:严格遵循开源要求".Sina Fiance. RetrievedJuly 6, 2025.
  21. ^"Huawei's AI lab denies that one of its Pangu models copied Alibaba's Qwen".Reuters. July 7, 2025. RetrievedJuly 7, 2025.
Smart devices
Phones
Ascend
P/Pura
series
Mate
series
Foldable
series
Nova
series
G
series
Y
series
Tablets
Laptops
Wearables
CPU/NPU
Operating
systems
Other,
software
Huawei logo
Communication
infrastructure
Services
People
Other
Concepts
Applications
Implementations
Audio–visual
Text
Decisional
People
Architectures
Retrieved from "https://en.wikipedia.org/w/index.php?title=Huawei_PanGu&oldid=1323597646"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp