Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

IST-DASLab

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
@IST-DASLab

IST Austria Distributed Algorithms and Systems Lab

Popular repositoriesLoading

  1. gptqgptqPublic

    Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

    Python 2.1k 164

  2. sparsegptsparsegptPublic

    Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

    Python 776 102

  3. marlinmarlinPublic

    FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

    Python 771 62

  4. PanzaMailPanzaMailPublic

    Python 284 17

  5. qmoeqmoePublic

    Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

    Python 271 22

  6. QUIKQUIKPublic

    Repository for the QUIK project, enabling the use of 4bit kernels for generative inference - EMNLP 2024

    C++ 177 14

Repositories

Loading
Type
Select type
Language
Select language
Sort
Select order
Showing 10 of 56 repositories
  • torch_cgx Public

    Pytorch distributed backend extension with compression support

    IST-DASLab/torch_cgx’s past year of commit activity
    C++ 15AGPL-3.00 4 0 UpdatedMar 21, 2025
  • QuEST Public

    Work in progress.

    IST-DASLab/QuEST’s past year of commit activity
    Jupyter Notebook 50MIT 4 2 0 UpdatedMar 17, 2025
  • gemm-int8 Public

    High Performance Int8 GEMM Kernels for SM80 and later GPUs.

    IST-DASLab/gemm-int8’s past year of commit activity
    Python 6MIT0 0 0 UpdatedMar 11, 2025
  • DarwinLM Public

    Official Pytorch Implementation of Paper "DarwinLM: Evolutionary Structured Pruning of Large Language Models"

    IST-DASLab/DarwinLM’s past year of commit activity
    Python 9 2 0 0 UpdatedFeb 21, 2025
  • IST-DASLab/ISTA-DASLab-Optimizers’s past year of commit activity
    Python 8Apache-2.00 0 0 UpdatedFeb 19, 2025
  • ScalableMNN Public

    Official Repository for "Scalable Mechanistic Neural Networks" (ICLR 2025)

    IST-DASLab/ScalableMNN’s past year of commit activity
    Python 1MIT0 0 0 UpdatedFeb 19, 2025
  • SPADE Public

    Code of SPADE: Sparsity Guided Debugging for Deep Neural Networks

    IST-DASLab/SPADE’s past year of commit activity
    Jupyter Notebook 1 3 1 0 UpdatedFeb 18, 2025
  • PanzaMail Public
    IST-DASLab/PanzaMail’s past year of commit activity
    Python 284Apache-2.0 17 4 5 UpdatedFeb 17, 2025
  • HALO Public

    HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation ofhttps://arxiv.org/abs/2501.02625

    IST-DASLab/HALO’s past year of commit activity
    Python 11MIT0 1 0 UpdatedFeb 17, 2025
  • EvoPress Public
    IST-DASLab/EvoPress’s past year of commit activity
    Python 19 2 0 0 UpdatedFeb 13, 2025

Top languages

Loading…

Most used topics

Loading…


[8]ページ先頭

©2009-2025 Movatter.jp