- Level AI
- India
- https://shreyansh26.github.io
- @shreyansh_26
Highlights
I’m aPrincipal ML Engineer atLevel AI, where I focus on building and scaling large language models (LLMs) specifically for conversational AI. With over four years of experience in applied AI and research, I’ve worked extensively on end-to-end solutions in NLP and ML systems.
Before Level AI, I worked as aData Scientist atMastercard AI Garage, where I developed AI models to enhance transaction security and intelligence. I graduated in 2020 with a degree in Computer Science from theIndian Institute of Technology (BHU) Varanasi.
My technical interests include Natural Language Processing, ML Systems Engineering—including CUDA and Triton for high-performance computing, Privacy-preserving ML, and Cryptography.
I’m always working on side projects, many of which involve implementing and experimenting with ideas from research papers, efficient kernels and other low-level stuff in LLM training/inference regime. You can find these projects here.
- 𝕏 Twitter/X:@shreyansh_26
- 👥 LinkedIn:Shreyansh Singh
- 💻 Website:https://shreyansh26.github.io
- Deep dive into CUDA Scan Kernels: Hierarchical and Single-Pass Variants
- Paper Summary #14 - Physics of Language Models: Part 3.1, Knowledge Storage and Extraction
- Understanding Multi-Head Latent Attention (MLA)
- Deriving the Gradient for the Backward Pass of Layer Normalization
- Notes from GTC’25: CUDA Techniques to Maximize Compute and Instruction Throughput
PinnedLoading
- FlashAttention-PyTorch
FlashAttention-PyTorch PublicImplementation of FlashAttention in PyTorch
- Extracting-Training-Data-from-Large-Langauge-Models
Extracting-Training-Data-from-Large-Langauge-Models PublicA re-implementation of the "Extracting Training Data from Large Language Models" paper by Carlini et al., 2020
- Speculative-Sampling
Speculative-Sampling PublicImplementation of Speculative Sampling as described in "Accelerating Large Language Model Decoding with Speculative Sampling" by Deepmind
- LLM-Sampling
LLM-Sampling PublicA collection of various LLM sampling methods implemented in pure Pytorch
- Linux-Malware-Detection-Research
Linux-Malware-Detection-Research PublicA collection of Linux Malware Detection projects (research paper implementations) done by me.
If the problem persists, check theGitHub status page orcontact support.
Uh oh!
There was an error while loading.Please reload this page.


