Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

Welcome to bitnet.cpp Discussions!#16

Oct 18, 2024· 7 comments· 3 replies
Discussion options

👋 Welcome!

We’re using Discussions as a place to connect with other members of our community. We hope that you:

  • Ask questions you’re wondering about.
  • Share ideas.
  • Engage with other community members.
  • Welcome others and are open-minded. Remember that this is a community we
    build together 💪.

To get started, comment below with an introduction of yourself and tell us about what you do with this community.

You must be logged in to vote

Replies: 7 comments 3 replies

Comment options

what does 1-bit means

You must be logged in to vote
2 replies
@Gabrielfernandes7
Comment options

In this context, "1-bit" refers to the precision or the number of bits used to represent each parameter or value within the model. Traditionally, AI models use higher precision (such as 16 or 32 bits per value), but in this case, BitNet uses a 1-bit representation, which is much more compact. This allows the model to be significantly more efficient in terms of memory usage and speed, while maintaining the precision of the inference process as "lossless."

@surajssc1232
Comment options

how can i use it and what is the minimum specs

Comment options

bonsoir
You must be logged in to vote
0 replies
Comment options

Hey, I'm a full-stack dev, highly interested in the potential of 1-bit LLM's to deliver major electricity savings and performance optimizations, I've thrown together aBitNet Electron app to run the inference executable without using a developer terminal, hoping to gain more dev experience where I can!

Thanks for creating this technology, keep up the good work!

You must be logged in to vote
1 reply
@Gabrielfernandes7
Comment options

Good job!

Comment options

it is interesting to get connect with great developer

You must be logged in to vote
0 replies
Comment options

👋 Welcome!

We’re using Discussions as a place to connect with other members of our community. We hope that you:

  • Ask questions you’re wondering about.
  • Share ideas.
  • Engage with other community members.
  • Welcome others and are open-minded. Remember that this is a community we
    build together 💪.

To get started, comment below with an introduction of yourself and tell us about what you do with this community.

You must be logged in to vote
0 replies
Comment options

I ask a first question.
I submitted three PRs in this repository last month regarding urgent issues.(#163,#164)
However, these have never been reviewed yet. What do you think I should do?
I would like to get feedbacks for these.

You must be logged in to vote
0 replies
Comment options

Hello,

I'm currently working with the BitNet repository and have successfully downloaded the config.json and model.safetensors weight files.
However, I'm finding that the Python code defining the BitNetForCausalLM model class (typically found in a file like modeling_bitnet.py or similar) appears to be missing from the repository.
Without this core model definition, it's not possible to properly instantiate the BitNet architecture in PyTorch and load the model.safetensors weights. This prevents further exploration and use of the pre-trained model.
Could you please provide the necessary Python code (the modeling_bitnet.py file or equivalent) that defines the BitNetForCausalLM class and its components, so that the model.safetensors weights can be loaded correctly?

You must be logged in to vote
0 replies
Sign up for freeto join this conversation on GitHub. Already have an account?Sign in to comment
Labels
None yet
9 participants
@shumingma@grctest@y-vectorfield@Gabrielfernandes7@preethi-github-29@surajssc1232@AnandKumarSinha07@Wiwuth@Theajoseph

[8]ページ先頭

©2009-2025 Movatter.jp