Commit6948419

gderossi

authored and

pytorchmergebot

committed

Fix scaled_matmul_cuda tests (#169834)

This PR fixes a few test failures in `test_scaled_matmul_cuda.py` by adding Thor to a list of devices not compatible with SM carveout and by updating an SM version check to include all devices with SM >= 10.x instead of just devices with SM == 10.x.Based on commit history, it looks like the `dprops->major == 10` was just a typo introduced when upgrading to the new `scaled_mm_v2` API, but if it was intentional I can look into alternative fixes to these tests.Fixes#169833Pull Requestresolved:#169834Approved by:https://github.com/slayton58

1 parentf73345c commit6948419Copy full SHA for 6948419

File tree

2 files changed

-2

lines changed

aten/src/ATen/native/cuda
- ScaledBlas.cpp
test
- test_scaled_matmul_cuda.py

2 files changed

-2

lines changed

`‎aten/src/ATen/native/cuda/ScaledBlas.cpp‎`

Lines changed: 1 addition & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -739,7 +739,7 @@ _scaled_rowwise_rowwise(`
`739`	`739`	`auto dprops =at::cuda::getCurrentDeviceProperties();`
`740`	`740`	`if (((dprops->major <9 \|\| CUBLAS_VERSION <120900 \|\|cublasLtGetVersion() <120900)`
`741`	`741`	`// cuBLAS only supports tiled 1D factor layout for 1D block scaling, no 2D block scales`
`742`		`- \|\| (dprops->major==10 && (!scale_a.sizes().empty() \|\| !scale_b.sizes().empty())))) {`
	`742`	`+ \|\| (dprops->major>=10 && (!scale_a.sizes().empty() \|\| !scale_b.sizes().empty())))) {`
`743`	`743`	`TORCH_CHECK_VALUE(out.dtype() ==kBFloat16 \|\| out.dtype() ==kHalf,"Only bf16 and fp16 high precision output types are supported for row-wise scaling.");`
`744`	`744`	`at::cuda::detail::f8f8bf16_rowwise(`
`745`	`745`	`mat_a,`

`‎test/test_scaled_matmul_cuda.py‎`

Lines changed: 1 addition & 1 deletion

Original file line number	Diff line number	Diff line change
`@@ -1790,7 +1790,7 @@ def test_honor_sm_carveout(self) -> None:`
`1790`	`1790`
`1791`	`1791`	`self.assertEqual(no_carveout,no_carveout_again)`
`1792`	`1792`	`capability=torch.cuda.get_device_capability()`
`1793`		`-ifcapabilityin {(10,0), (10,3), (12,0), (12,1)}:`
	`1793`	`+ifcapabilityin {(10,0), (10,3), (11,0), (12,0), (12,1)}:`
`1794`	`1794`	`# expected failure`
`1795`	`1795`	`# CUTLASS only supports SM carveout via green contexts on SM100`
`1796`	`1796`	`self.assertEqual(no_carveout,carveout_66)`

0 commit comments

Comments

(0)

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit6948419

File tree

2 files changed

2 files changed

`‎aten/src/ATen/native/cuda/ScaledBlas.cpp‎`

`‎test/test_scaled_matmul_cuda.py‎`

0 commit comments