Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to ourterms of service andprivacy statement. We’ll occasionally send you account related emails.

Already on GitHub?Sign in to your account

Docs: Add precision support reference page#2111

Open
adeljo-amd wants to merge1 commit intodevelop
base:develop
Choose a base branch
Loading
fromprec_support

Conversation

adeljo-amd
Copy link

resolves #___

Summary:

Add precison support reference page which details supported data types in Tensile.

Outcomes:

Only affects documentation side of this project.

Notable changes:

Only addition of one RST file, and modification of two files to add a link.

@adeljo-amdadeljo-amd requested a review fromSwRawMarch 21, 2025 13:16
Copy link
Contributor

@SwRawSwRaw left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Nicely done.

.. _precision-support:

********************************
Precision Support
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change
PrecisionSupport
Precisionsupport

********************************

Tensile supports a rich variety of data types for matrix multiplication operations, enabling optimized performance
across different precision requirements. This document outlines the supported data types and precision formats
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change
across different precision requirements. Thisdocument outlines the supported data types and precision formats
across different precision requirements. Thistopic outlines the supported data types and precision formats

across different precision requirements. This document outlines the supported data types and precision formats
used in Tensile's GEMM implementations.

Data Types
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change
DataTypes
Datatypes

- ``__hip_fp8_e5m2`` / ``__hip_fp8_e5m2_fnuz``
- 8-bit
- | Brain float8 format with 5 exponent bits, 2 mantissa bits, and 1 sign bit. Provides greater dynamic range than
| F8 at the cost of reduced precision.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change
|F8 at the cost of reduced precision.
F8 at the cost of reduced precision.

* - X
- N/A
- 32-bit
- | Tensorfloat equivalent with custom bit distribution for enhanced precision in specific computation patterns
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change
- | Tensorfloat equivalentwith custom bit distribution for enhanced precision in specific computation patterns
- | Tensorfloat equivalentto custom bit distribution. Used for enhanced precision in specific computation patterns

- # SGEMM
- {M: 5504, N: 5504, K: 5504, transposeA: false, transposeB: true, dataType: S}

**Half-Precision with Single-Precision Accumulation**
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change
**Half-Precision withSingle-Precision Accumulation**
**Half-precision withsingle-precision accumulation**

- # GEMM_EX (HHS)
- {M: 5504, N: 5504, K: 5504, transposeA: false, transposeB: true, dataType: H, destDataType: H, computeDataType: S}

**BFloat16 Input with Float32 Output**
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change
**BFloat16Input with Float32Output**
**BFloat16input with Float32output**

- # GEMM_EX (BSS)
- {M: 4096, N: 4096, K: 4096, transposeA: false, transposeB: true, dataType: B, destDataType: S, computeDataType: S}

**8-bit Integer Operations**
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change
**8-bitInteger Operations**
**8-bitinteger operations**

- # GEMM_EX (I8II)
- {M: 4096, N: 4096, K: 4096, transposeA: false, transposeB: true, dataType: I8, destDataType: I, computeDataType: I}

**Mixed F8/B8 Input with Half Precision Output**
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change
**Mixed F8/B8Input withHalf Precision Output**
**Mixed F8/B8input withhalf precision output**

- # GEMM_EX
- {M: 5504, N: 5504, K: 5504, transposeA: false, transposeB: true, dataType: F8B8, destDataType: H, computeDataType: S}

Library Logic File Naming
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change
LibraryLogic File Naming
Librarylogic file naming

- S
- Matrix A is bfloat8, Matrix B is float8, with half precision output

Configuration in Tensile
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Is this heading apt? How about "Data types in configuration files"?

Sign up for freeto join this conversation on GitHub. Already have an account?Sign in to comment
Reviewers

@SwRawSwRawSwRaw left review comments

@neon60neon60neon60 approved these changes

@babakpstbabakpstAwaiting requested review from babakpstbabakpst is a code owner

@yoichiyoshidayoichiyoshidaAwaiting requested review from yoichiyoshidayoichiyoshida is a code owner

@bragadeeshbragadeeshAwaiting requested review from bragadeeshbragadeesh is a code owner

@AlexBrownAMDAlexBrownAMDAwaiting requested review from AlexBrownAMDAlexBrownAMD is a code owner

Assignees

@adeljo-amdadeljo-amd

Labels
ci:docs-onlyDocs only changesDocumentationDocumentation
Projects
None yet
Milestone
No milestone
Development

Successfully merging this pull request may close these issues.

3 participants
@adeljo-amd@neon60@SwRaw

[8]ページ先頭

©2009-2025 Movatter.jp