NotificationsYou must be signed in to change notification settings
Fork26.3k
Star96k

[ONNX] Fix how shapes are computed for float4#156353

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Jump to bottom

Closed

justinchuby wants to merge1 commit intopytorch:mainfromjustinchuby:justinchu/float4-shape

Closed

[ONNX] Fix how shapes are computed for float4#156353

justinchuby wants to merge1 commit intopytorch:mainfromjustinchuby:justinchu/float4-shape

Conversation

Copy link

Collaborator

justinchuby commentedJun 18, 2025•
edited
Loading

Changed the way we compute shapes for unpacked float4. Previously we always added a last dimension [2] to existing shape, but this doesn't really make sense because it prevents use from being able to represent any shape other than those with a list dim [2]. I updated the logic to be[*shape[:-1], shape[-1]*2] which doubles the last dimension. This is more in line with what we see in practice when people are using 4bit types, and it allows us to represent any shape with an even dimension at the end, which is much more reasonable in my opinion.

Also clarified in#148791 (comment)

[ONNX] Update how shapes are computed for float4

b095aea

Signed-off-by: Justin Chu <justinchuby@users.noreply.github.com>

justinchuby requested review fromshubhambhokare1,titaiwangms andwschin ascode owners

June 18, 2025 20:00

Copy link

pytorch-botbot commentedJun 18, 2025•
edited
Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results athud.pytorch.org/pr/156353

📄 PreviewPython docs built from this PR
📄 PreviewC++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit thebot commands wiki or ouroffice hours

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 13 Pending

As of commitb095aea with merge basec74fd35 ():

NEW FAILURE - The following job has failed:

Lint / lintrunner-clang / linux-job (gh)
>>> Lint for torch/csrc/export/pt2_archive_constants.h:

This comment was automatically generated by Dr. CI and updates every 15 minutes.

pytorch-botbot added the release notes: onnxtorch.onnx related changes that should show up in the release notes label

Jun 18, 2025

justinchuby added this to the2.8.0 milestone

Jun 18, 2025

justinchuby added module: onnx

Related to torch.onnx

topic: improvementstopic category labels

Jun 18, 2025

justinchuby requested a review fromxadupre

June 18, 2025 20:01

justinchuby changed the title~~[ONNX] Update how shapes are computed for float4~~[ONNX] Fix how shapes are computed for float4

Jun 18, 2025

pytorchbot added the open source label

Jun 18, 2025

titaiwangms approved these changes

Jun 18, 2025

View reviewed changes

Copy link

CollaboratorAuthor

justinchuby commentedJun 18, 2025

@pytorchbot merge -i

pytorch-botbot added the ciflow/trunkTrigger trunk jobs on your pull request label

Jun 18, 2025

pytorchmergebot added the merging label

Jun 18, 2025

Copy link

Collaborator

pytorchmergebot commentedJun 18, 2025

Merge started

Your change will be merged while ignoring the following 1 checks:Lint / lintrunner-clang / linux-job

Learn more about merging in thewiki.

Questions? Feedback? Please reach out to thePyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

Copy link

CollaboratorAuthor

justinchuby commentedJun 18, 2025

@pytorchbot merge -f "all relevant tests passed"

Copy link

Collaborator

pytorchmergebot commentedJun 18, 2025

The mergejob was canceled or timed out. This most often happen if two merge requests were issued for the same PR, or if merge job was waiting for more than 6 hours for tests to finish. In later case, please do not hesitate to reissue the merge command
For more information seepytorch-bot wiki.

Copy link

Collaborator

pytorchmergebot commentedJun 18, 2025

Merge started

Your change will be merged immediately since you used the force (-f) flag,bypassing any CI checks (ETA: 1-5 minutes). Please use-f as last resort and instead consider-i/--ignore-current to continue the merge ignoring current failures. This will allow currently pending tests to finish and report signal before the merge.

Learn more about merging in thewiki.

Questions? Feedback? Please reach out to thePyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here