********************************

		Tensile supports a rich variety of data types for matrix multiplication operations, enabling optimized performance
		across different precision requirements. This document outlines the supported data types and precision formats

Copy link

Contributor

SwRawMar 25, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	across different precision requirements. Thisdocument outlines the supported data types and precision formats
	across different precision requirements. Thistopic outlines the supported data types and precision formats

docs/src/reference/precision-support.rst

		across different precision requirements. This document outlines the supported data types and precision formats
		used in Tensile's GEMM implementations.

		Data Types

Copy link

Contributor

SwRawMar 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	DataTypes
	Datatypes

docs/src/reference/precision-support.rst

		- ``__hip_fp8_e5m2`` / ``__hip_fp8_e5m2_fnuz``
		- 8-bit
		- \| Brain float8 format with 5 exponent bits, 2 mantissa bits, and 1 sign bit. Provides greater dynamic range than
		\| F8 at the cost of reduced precision.

Copy link

Contributor

SwRawMar 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	\|F8 at the cost of reduced precision.
	F8 at the cost of reduced precision.

docs/src/reference/precision-support.rst

		* - X
		- N/A
		- 32-bit
		- \| Tensorfloat equivalent with custom bit distribution for enhanced precision in specific computation patterns

Copy link

Contributor

SwRawMar 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	- \| Tensorfloat equivalentwith custom bit distribution for enhanced precision in specific computation patterns
	- \| Tensorfloat equivalentto custom bit distribution. Used for enhanced precision in specific computation patterns

docs/src/reference/precision-support.rst

		- # SGEMM
		- {M: 5504, N: 5504, K: 5504, transposeA: false, transposeB: true, dataType: S}

		Half-Precision with Single-Precision Accumulation

Copy link

Contributor

SwRawMar 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	Half-Precision withSingle-Precision Accumulation
	Half-precision withsingle-precision accumulation

docs/src/reference/precision-support.rst

		- # GEMM_EX (HHS)
		- {M: 5504, N: 5504, K: 5504, transposeA: false, transposeB: true, dataType: H, destDataType: H, computeDataType: S}

		BFloat16 Input with Float32 Output

Copy link

Contributor

SwRawMar 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	BFloat16Input with Float32Output
	BFloat16input with Float32output

docs/src/reference/precision-support.rst

		- # GEMM_EX (BSS)
		- {M: 4096, N: 4096, K: 4096, transposeA: false, transposeB: true, dataType: B, destDataType: S, computeDataType: S}

		8-bit Integer Operations

Copy link

Contributor

SwRawMar 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	8-bitInteger Operations
	8-bitinteger operations

docs/src/reference/precision-support.rst

		- # GEMM_EX (I8II)
		- {M: 4096, N: 4096, K: 4096, transposeA: false, transposeB: true, dataType: I8, destDataType: I, computeDataType: I}

		Mixed F8/B8 Input with Half Precision Output

Copy link

Contributor

SwRawMar 26, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others.Learn more.

Suggested change

	Mixed F8/B8Input withHalf Precision Output
	Mixed F8/B8input withhalf precision output

docs/src/reference/precision-support.rst

		- # GEMM_EX
		- {M: 5504, N: 5504, K: 5504, transposeA: false, transposeB: true, dataType: F8B8, destDataType: H, computeDataType: S}

		Library Logic File Naming