Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
@ethanhe42
ethanhe42
Follow
View ethanhe42's full-sized avatar
🚀
Focusing

Ethan He ethanhe42

🚀
Focusing

Block or report ethanhe42

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more aboutblocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more aboutreporting abuse.

Report abuse
ethanhe42/README.md

I'm an engineer atxAI focusing on multimodal, video generation and world models. My ultimate goal is to build multimodal AGI[0],[1],[2]

🤗 Open Source Projects:

  • Cosmos: state-of-the-art generative world models
  • NeMo DFM: large-scale training and inference framework for diffusion models
  • Megatron-LM MoE: Scaling up mixture of experts
  • NeMo: scalable training framework for LLMs transformers
  • LongVILA: Long-Context VLM for long videos (ICLR'25)
  • ActGPT: browser-use agent
  • Channel Pruning: Accelerating Very Deep Neural Networks (ICCV'17)
  • Epipolar Transformers: Accurate multi-camera pose understanding (CVPR'20)
  • AMC: AutoML for model compression (ECCV'18)
  • KL Loss: Accurate Object Detection (CVPR'19)
  • FSAF: single-shot object detection (CVPR'19)

🎙️ Invited Talks

🤓Grok Heavy Tungsten Cube

PinnedLoading

  1. NVIDIA-NeMo/NeMoNVIDIA-NeMo/NeMoPublic

    A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

    Python 16.8k 3.3k

  2. NVIDIA/Megatron-LMNVIDIA/Megatron-LMPublic

    Ongoing research training transformer models at scale

    Python 15.2k 3.6k

  3. NVIDIA-NeMo/DFMNVIDIA-NeMo/DFMPublic

    State-of-the-art framework for fast, large-scale training and inference of diffusion models

    Python 30 3

  4. NVIDIA/Cosmos-TokenizerNVIDIA/Cosmos-TokenizerPublic archive

    A suite of image and video neural tokenizers

    Jupyter Notebook 1.7k 86

  5. channel-pruningchannel-pruningPublic

    Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)

    Python 1.1k 308

  6. KL-LossKL-LossPublic

    Bounding Box Regression with Uncertainty for Accurate Object Detection (CVPR'19)

    Python 721 105


[8]ページ先頭

©2009-2026 Movatter.jp