hyparam/hyllamaPublic

NotificationsYou must be signed in to change notification settings
Fork3
Star43

llama.cpp gguf file parser for javascript

License

MIT license

43 stars 3 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
.github/workflows		.github/workflows
src		src
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
eslint.config.js		eslint.config.js
hyllama.jpg		hyllama.jpg
index.html		index.html
package.json		package.json
tsconfig.json		tsconfig.json

Repository files navigation

hyllama

Javascript parser forllama.cpp gguf files.

This library makes it easy to parse metadata from GGUF files.

llama.cpp was originally an implementation of meta's llama model in C++, particularly on apple m-series chips.But it has quickly evolved into a powerful tool for running various trained LLM models, on cpu or gpu.The runtime has minimal dependencies and so is easy to deploy.Model files are frequently distributed as .gguf files which contain all the info needed to run a model including architecture and weights.Hugging Face provides a great collection of serialized gguf model files, at varying levels of quantization.

Model files are often very large.A goal of this library is to parse the file efficiently, without loading the entire file.

Dependency free since 2023!

Installation

npm install hyllama

Usage

Node.js

If you're in a node.js environment, you can load a .gguf file with the following example:

const{ ggufMetadata}=awaitimport('hyllama')constfs=awaitimport('fs')// Read first 10mb of gguf fileconstfd=fs.openSync('example.gguf','r')constbuffer=newUint8Array(10_000_000)fs.readSync(fd,buffer,0,10_000_000,0)fs.closeSync(fd)// Parse metadata and tensor infoconst{ metadata, tensorInfos}=ggufMetadata(buffer.buffer)

Browser

If you're in a browser environment, you'll probably get .gguf file data from either a drag-and-dropped file from the user, or downloaded from the web.

To load .gguf data in the browser from a remoteurl, it is recommended that you use anHTTP range request to get just the first bytes:

import{ggufMetadata}from'hyllama'constheaders=newHeaders({Range:'bytes=0-10000000'})constres=awaitfetch(url,{ headers})constarrayBuffer=awaitres.arrayBuffer()const{ metadata, tensorInfos}=ggufMetadata(arrayBuffer)

To parse .gguf files from a user drag-and-drop action, see example inindex.html.

File Size

Since .gguf files are typically very large, it is recommended that you only load the start of the file that contains the metadata.How many bytes you need for the metadata depends on the gguf file.In practice, most .gguf files have metadata that takes up a few megabytes.If you get an error "RangeError: Offset is outside the bounds of the DataView" then you probably didn't fetch enough bytes.

References

Contributions

Contributions are welcome!

Hyparquet development is supported by an open-source grant from Hugging Face 🤗

About

llama.cpp gguf file parser for javascript

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

hyllama

Installation

Usage

Node.js

Browser

File Size

References

Contributions

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Contributors2

Languages

Movatterモバイル変換

License

hyparam/hyllama

Folders and files

Latest commit

History

Repository files navigation

hyllama

Installation

Usage

Node.js

Browser

File Size

References

Contributions

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Contributors2

Languages

Packages