transformer-architecture
Here are 379 public repositories matching this topic...
Language:All
Sort:Most stars
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available inhttps://plachtaa.github.io/vallex/
- Updated
Feb 11, 2024 - Python
An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
- Updated
Jul 30, 2024
Inference Llama 2 in one file of pure 🔥
- Updated
Nov 23, 2025 - Mojo
Self-contained Machine Learning and Natural Language Processing library in Go
- Updated
Apr 1, 2025 - Go
Code for CRATE (Coding RAte reduction TransformEr).
- Updated
Oct 23, 2024 - Python
Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch
- Updated
Oct 24, 2024 - Python
Implementation of the Swin Transformer in PyTorch.
- Updated
Mar 29, 2021 - Python
Minimalist NMT for educational purposes
- Updated
Jan 29, 2024 - Python
Build high-performance AI models with modular building blocks
- Updated
Nov 7, 2025 - Python
The repository of ET-BERT, a network traffic classification model on encrypted traffic. The work has been accepted as The Web Conference (WWW) 2022 accepted paper.
- Updated
Nov 6, 2025 - Python
🌕 [BMVC 2022] You Only Need 90K Parameters to Adapt Light: A Light Weight Transformer for Image Enhancement and Exposure Correction. SOTA for low light enhancement, 0.004 seconds try this for pre-processing.
- Updated
Feb 27, 2024 - Python
[IGARSS'22]: A Transformer-Based Siamese Network for Change Detection
- Updated
Jan 31, 2024 - Python
CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation
- Updated
Dec 30, 2022 - Python
[ECCV 2022] Official repository for "MaxViT: Multi-Axis Vision Transformer". SOTA foundation models for classification, detection, segmentation, image quality, and generative modeling...
- Updated
Jun 2, 2023 - Jupyter Notebook
A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.
- Updated
Nov 3, 2025 - Python
This repo contains the updated version of all the assignments/labs (done by me) of Deep Learning Specialization on Coursera by Andrew Ng. It includes building various deep learning models from scratch and implementing them for object detection, facial recognition, autonomous driving, neural machine translation, trigger word detection, etc.
- Updated
Feb 9, 2024 - Jupyter Notebook
Attention Is All You Need | a PyTorch Tutorial to Transformers
- Updated
Feb 22, 2024 - Python
SeqFormer: Sequential Transformer for Video Instance Segmentation (ECCV 2022 Oral)
- Updated
Aug 2, 2022 - Python
Notes about "Attention is all you need" video (https://www.youtube.com/watch?v=bCz4OMemCcA)
- Updated
May 28, 2023
PyContinual (An Easy and Extendible Framework for Continual Learning)
- Updated
Jan 29, 2024 - Python
Improve this page
Add a description, image, and links to thetransformer-architecture topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thetransformer-architecture topic, visit your repo's landing page and select "manage topics."