Ethan He ethanhe42

🚀

Focusing

Achievements

ethanhe42/README.md

I'm an engineer at xAI focusing on multimodal, video generation and world models. My ultimate goal is to build multimodal AGI[0],[1],[2]

🤗 Open Source Projects:

🎙️ Invited Talks

NVIDIA-NeMo/NeMoNVIDIA-NeMo/NeMoPublic
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
Python 16.8k 3.3k
NVIDIA/Megatron-LMNVIDIA/Megatron-LMPublic
Ongoing research training transformer models at scale
Python 15.2k 3.6k
NVIDIA-NeMo/DFMNVIDIA-NeMo/DFMPublic
State-of-the-art framework for fast, large-scale training and inference of diffusion models
Python 30 3
NVIDIA/Cosmos-TokenizerNVIDIA/Cosmos-TokenizerPublic archive
A suite of image and video neural tokenizers
Jupyter Notebook 1.7k 86
channel-pruningchannel-pruningPublic
Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)
Python 1.1k 308
KL-LossKL-LossPublic
Bounding Box Regression with Uncertainty for Accurate Object Detection (CVPR'19)
Python 721 105