nvidia-nemo
Here are 21 public repositories matching this topic...
Language:All
Sort:Most stars
Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan
- Updated
Dec 5, 2023 - Python
Free AI & Community powered Learning Experience
- Updated
Nov 14, 2024 - TypeScript
Training NVIDIA NeMo Megatron Large Language Model (LLM) using NeMo Framework on Google Kubernetes Engine
- Updated
Apr 28, 2025 - HCL
LLM tutorial materials include but not limited to NVIDIA NeMo, TensorRT-LLM, Triton Inference Server, and NeMo Guardrails.
- Updated
Jun 26, 2025 - Python
This repository combines `WavLM`, a powerful speech representation model from Microsoft, with `MSDD` (Multi-Scale Diarization Decoder), a state-of-the-art approach for speaker diarization from Nvidia.
- Updated
Jun 17, 2025 - Jupyter Notebook
Extractive Question-Answering with BERT on SQuAD v2.0 (Stanford Question Answering Dataset) using NVIDIA PyTorch Lightning
- Updated
Apr 18, 2023 - Jupyter Notebook
Post-training quantization on Nvidia Nemo ASR model
- Updated
Aug 23, 2023 - Jupyter Notebook
📄 SmartSRT is a command-line tool for generating accurate subtitles with per-word timestamps. It uses WhisperAI for speech transcription, NVIDIA NeMo for diarization, and OpenCV for face recognition. The program is good at creating high accuracy subtitles. 🎧💻⚙️
- Updated
Feb 15, 2023
Automatic transcriber made with the Nvidia NeMo AI toolkit. Used to transcribe speech to text in real-time from any source. Requires CUDA capable GPU to run on the local machine, if setup using virtual audio cables can transcribe the audio that is being played in real-time without any other requirements.
- Updated
Oct 18, 2020 - Python
The simplest & most comprehensible tutorial on speaker identification with NVIDIA's `Nemo`.
- Updated
Aug 5, 2021 - Python
This bootcamp is designed to give NLP researchers an end-to-end overview on the fundamentals of NVIDIA NeMo framework, complete solution for building large language models. It will also have hands-on exercises complimented by tutorials, code snippets, and presentations to help researchers kick-start with NeMo LLM Service and Guardrails.
- Updated
Mar 7, 2024 - Jupyter Notebook
Training and Tunning a Text to speech model with Nvidia NeMo and Weights and Biases
- Updated
Dec 8, 2022 - Jupyter Notebook
FastAPI-based Hindi ASR app using NVIDIA NeMo + ONNX, with Docker support for easy deployment.
- Updated
May 25, 2025 - Python
Module for russian speech recognition using NVIDIA Nemo.
- Updated
Feb 12, 2023 - Python
Implementation of a Kazakh Speech-to-Text Model using the NVIDIA NeMo toolkit for efficient transcription of spoken Kazakh speech into text.
- Updated
Jan 22, 2024 - Python
End-to-End MLOps pipeline for Vietnamese ASR using NVIDIA NeMo. Features custom ETL for YouTube data, robust data validation, and a hybrid Local-to-Cloud workflow for deploying Conformer-CTC models on GPU.
- Updated
Dec 16, 2025 - Python
Diarizer system that takes as input .wav files and it transcribe the audio saying who spoke and when. This has been done using the NVIDIA Nemo Framework and pre-trained models.
- Updated
Nov 11, 2025 - Python
Audio profanity detector desktop app developed with PyQt5 using NVidia-Nemo tech.
- Updated
Dec 4, 2021 - Python
Autonomous Data Agent that cleans, analyzes, and models datasets using Python, RAPIDS, PyTorch, TensorFlow, XGBoost, LightGBM, CatBoost, SHAP/LIME, NeMo, and Streamlit, delivering GPU-accelerated, explainable insights.
- Updated
Oct 3, 2025 - Python
🤖 Intelligent Wikipedia research assistant powered by NVIDIA's NeMo Agent Toolkit. Features ReAct reasoning, multiple interfaces (CLI, interactive, Python), and comprehensive examples. Get started in minutes with NVIDIA NIM models.
- Updated
Aug 11, 2025 - Shell
Improve this page
Add a description, image, and links to thenvidia-nemo topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thenvidia-nemo topic, visit your repo's landing page and select "manage topics."