megatron
Here are 10 public repositories matching this topic...
Language:All
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, MiniCPM-V-2.6, GLM4v, Xcomposer2.5, DeepSeek-VL2, Phi4, GOT-OCR2, ...).
- Updated
Apr 28, 2025 - Python
Megatron was a telegram file management bot that helped a lot of users, specially movie channel managers to upload their files to telegram by just providing a link to it. The project initially started as roanuedhuru_bot which lately retired and came back as Megatron which was a side project of the famous Maldivian Telegram community -@baivaru u…
- Updated
Jun 27, 2021 - Python
Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*
- Updated
Dec 14, 2023 - Python
A LLaMA1/LLaMA12 Megatron implement.
- Updated
Dec 13, 2023 - Python
Megatron was a telegram file management bot that helped a lot of users, specially movie channel managers to upload their files to telegram by just providing a link to it. The project initially started as roanuedhuru_bot which lately retired and came back as Megatron which was a side project of the famous Maldivian Telegram community -
- Updated
Apr 21, 2021 - Python
Tiny-Megatron, a minimalistic re-implementation of the Megatron library
- Updated
Aug 6, 2024 - Python
Wrapped Megatron: As User-Friendly as HuggingFace, As Powerful as Megatron-LM | Megatron封装:和HuggingFace一样方便,和Megatron-LM一样强大
- Updated
Mar 22, 2025 - Python
- Updated
Jun 22, 2024 - JavaScript
Improve this page
Add a description, image, and links to themegatron topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with themegatron topic, visit your repo's landing page and select "manage topics."