efficient-llm

Here are 7 public repositories matching this topic...

horseee /Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

compression language-model knowledge-distillation model-quantization pruning-algorithms llm llm-compression efficient-llm

UpdatedMar 14, 2025
Python

mbzuai-oryx /MobiLlama

Star628

MobiLlama : Small Language Model tailored for edge devices

slm llm efficient-llm mobile-llm tiny-llm

UpdatedMar 3, 2024
Python

hao-ai-lab /Consistency_LLM

Star386

[ICML 2024] CLLMs: Consistency Large Language Models

large-language-models efficient-llm efficient-llm-inference

UpdatedNov 16, 2024
Python

shufangxun /LLaVA-MoD

Star114

[ICLR 2025] LLaVA-MoD: Making LLaVA Tiny via MoE-Knowledge Distillation

knowledge-distillation mixture-of-experts multimodal-large-language-models efficient-llm

UpdatedJan 22, 2025
Python

fscdc /Efficient-AI

Star4

There is a summary repo for Efficient AI direction. If you want to contribute to this repo, feel free to pr(pull request)!

efficient-algorithm llm generative-ai efficient-llm

UpdatedMay 29, 2024

yaya-sy /lillama

Star3

[NAACL' 25 main] Lillama: Large Language Model Compression via Low-Rank Feature Distillation

transformer pruning mamba distillation llm llm-inference efficient-llm

UpdatedFeb 10, 2025
Python

KevinZeng08 /efficient-large-model-papers

Star0

A Curated Paper List for Efficient Large Models

papers llm llm-serving llm-inference efficient-llm

UpdatedSep 18, 2024

Improve this page

Add a description, image, and links to theefficient-llm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with theefficient-llm topic, visit your repo's landing page and select "manage topics."

Learn more

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly