This repository was archived by the owner on Nov 1, 2024. It is now read-only.

facebookresearch/metaseqPublic archive

NotificationsYou must be signed in to change notification settings
Fork721
Star6.5k

Repo for external large-scale work

License

MIT license

6.5k stars 721 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 309 Commits
.circleci		.circleci
.github		.github
cpu_tests		cpu_tests
docs		docs
gpu_tests		gpu_tests
metaseq		metaseq
preprocessing		preprocessing
projects		projects
tests		tests
third_party		third_party
.flake8		.flake8
.gitignore		.gitignore
.gitmodules		.gitmodules
.pre-commit-config.yaml		.pre-commit-config.yaml
CHANGELOG.md		CHANGELOG.md
CODEOWNERS		CODEOWNERS
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
mypy.ini		mypy.ini
setup.py		setup.py

Repository files navigation

Metaseq

A codebase for working withOpen Pre-trained Transformers, originally forked fromfairseq.

Community Integrations

Using OPT with 🤗 Transformers

The OPT 125M--66B models are now available inHugging Face Transformers. You can access them under thefacebook organization on theHugging Face Hub

Using OPT-175B with Alpa

The OPT 125M--175B models are now supported in theAlpa project, whichenables serving OPT-175B with more flexible parallelisms on older generations of GPUs, such as 40GB A100, V100, T4, M60, etc.

Using OPT with Colossal-AI

The OPT models are now supported in theColossal-AI, which helps users to efficiently and quickly deploy OPT models training and inference, reducing large AI model budgets and scaling down the labor cost of learning and deployment.

Using OPT with CTranslate2

The OPT 125M--66B models can be executed withCTranslate2, which is a fast inference engine for Transformer models. The project integrates theSmoothQuant technique to allow 8-bit quantization of OPT models. See theusage example to get started.

Using OPT with FasterTransformer

The OPT models can be served withFasterTransformer, a highly optimized inference framework written and maintained by NVIDIA. We provide instructions to convert OPT checkpoints into FasterTransformer format anda usage example with some benchmark results.

Using OPT with DeepSpeed

The OPT models can be finetuned usingDeepSpeed. See theDeepSpeed-Chat example to get started.

Getting Started in Metaseq

Followsetup instructions here to get started.

Documentation on workflows

Background Info

Support

If you have any questions, bug reports, or feature requests regarding either the codebase or the models released in the projects section, please don't hesitate to post on ourGithub Issues page.

Please remember to follow ourCode of Conduct.

Contributing

We welcome PRs from the community!

You can find information about contributing to metaseq in ourContributing document.

The Team

Metaseq is currently maintained by the CODEOWNERS:Susan Zhang,Naman Goyal,Punit Singh Koura,Moya Chen,Kurt Shuster,David Esiobu,Igor Molybog,Peter Albert,Andrew Poulton,Nikolay Bashlykov,Binh Tang,Uriel Singer,Yuchen Zhang,Armen Aghajanya,Lili Yu, andAdam Polyak.

License

The majority of metaseq is licensed under the MIT license, however portions of the project are available under separate license terms:

Megatron-LM is licensed under theMegatron-LM license

About

Repo for external large-scale work

Resources

Readme

License

MIT license

Code of conduct

Contributing

Movatterモバイル変換

License

facebookresearch/metaseq

Folders and files

Latest commit

History

Repository files navigation

Metaseq

Community Integrations

Using OPT with 🤗 Transformers

Using OPT-175B with Alpa

Using OPT with Colossal-AI

Using OPT with CTranslate2

Using OPT with FasterTransformer

Using OPT with DeepSpeed

Getting Started in Metaseq

Documentation on workflows

Background Info

Support

Contributing

The Team

License

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Contributors57

Languages

Packages