[AMDGPU] Fix unreachable reg bit width #122107

Shoreshen · 2025-01-08T13:17:25Z

Add register class bit width for SReg_256_XNULL and SReg_128_XNULL

github-actions · 2025-01-08T13:17:44Z

Thank you for submitting a Pull Request (PR) to the LLVM Project!

This PR will be automatically labeled and the relevant teams will be notified.

If you wish to, you can add reviewers by using the "Reviewers" section on this page.

If this is not working for you, it is probably because you do not have write permissions for the repository. In which case you can instead tag reviewers by name in a comment by using @ followed by their GitHub username.

If you have received no comments on your PR for a week, you can request a review by "ping"ing the PR by adding a comment “Ping”. The common courtesy "ping" rate is once a week. Please remember that you are asking for valuable time from other developers.

If you have further questions, they may be answered by the LLVM GitHub User Guide.

You can also ask questions in a comment on this PR, on the LLVM Discord or on the forums.

llvmbot · 2025-01-08T13:18:23Z

@llvm/pr-subscribers-backend-amdgpu

Author: None (Shoreshen)

Changes

Add register class bit width for SReg_256_XNULL and SReg_128_XNULL

Full diff: /~https://github.com/llvm/llvm-project/pull/122107.diff

1 Files Affected:

(modified) llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp (+2)

diff --git a/llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp b/llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
index 319ada3b27bd5a..d9c0aa300855fc 100644
--- a/llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
+++ b/llvm/lib/Target/AMDGPU/Utils/AMDGPUBaseInfo.cpp
@@ -2487,6 +2487,7 @@ unsigned getRegBitWidth(unsigned RCID) {
   case AMDGPU::AReg_128_Align2RegClassID:
   case AMDGPU::AV_128RegClassID:
   case AMDGPU::AV_128_Align2RegClassID:
+  case AMDGPU::SReg_128_XNULLRegClassID:
     return 128;
   case AMDGPU::SGPR_160RegClassID:
   case AMDGPU::SReg_160RegClassID:
@@ -2523,6 +2524,7 @@ unsigned getRegBitWidth(unsigned RCID) {
   case AMDGPU::AReg_256_Align2RegClassID:
   case AMDGPU::AV_256RegClassID:
   case AMDGPU::AV_256_Align2RegClassID:
+  case AMDGPU::SReg_256_XNULLRegClassID:
     return 256;
   case AMDGPU::SGPR_288RegClassID:
   case AMDGPU::SReg_288RegClassID:

arsenm

Tests?

Shoreshen · 2025-01-08T13:47:26Z

Tests?

Hi there is another PR depending on this so I created the PR first. Will add tests latter (need to minimize)

…t-width-for-SReg_256_XNULL-and-SReg_128_XNULL

…ge main

arsenm · 2025-01-10T12:25:15Z

llvm/test/CodeGen/AMDGPU/add-xnull-regclass-bitwidth.mir

+# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py UTC_ARGS: --version 5
+# RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx90a -run-pass=early-machinelicm -run-pass=postmisched -o - %s | FileCheck %s
+---
+name:            test_xnull_256


Also the 128 case.

Hi @arsenm , to trigger the unreachable during postmisched pass it has to be MIMG instruction, but I cannot find MIMG instruction uses SReg_128_XNULL

I also tried to find other places that may use the bit width function:

used in selectCOPY

used in foldOperand, but for immediate only

used in canInsertSelect test if it is ok to insert a select instruction

used in buildSpillLoadStore but it seems like the target registers are all caller/callee saved regs

used in printRegularOperand but only for Op.isDFPImm()

used in getRegOperandSize, which is used in validateMIMGAddrSize and validateMIMGDataSize

The only possibility is getRegSplitParts, but it is used by many function, I need help on this since I do not familiar with the fucntion.

I would expect the selectCOPY case would be most straightforward

Hi @arsenm , to successfully trigger bit width function in selectCOPY, the dst operand must:

pass isVCC function for the dst register of COPY

fails isVCC function for src register of the COPY

I think the bit width of src and dst must be the same, otherwise the copy mismatch type verification error will trigger

Thus to use SReg_128_XNULL as src reg class, the dst must also be 128 bit width

To pass the isVCC function for dst register, it must be:

Non physical register (which I cannot simply use $vcc as dst)

If it is assigned register class, the bit width must be 1

If it is register bank, the bank id must equals to AMDGPU::VCCRegBankID

For COPY not being assigned for reg class, I think the dst of COPY must not be used for any instruction.

Because each instruction's each input should have a reg class bind with it, and ti will try to assign reg class accordingly.

But if COPY is not used by any instruction, then it will not be selected, since it most probably will not pass !isTriviallyDead(MI, MRI) function check.

So it seems like I ran out of my ways to produce a 128 case, would there be any other possibilities?

Thanks a lot!

BTW, I also tried to remove the AMDGPU::getRegBitWidth(SrcRC->getID()) == 16 in selectCOPY and tried to find any test failed after then. but it seems even without the judgement, all tests passed.

The original problem was triggered by SReg_256_XNULL. So maybe we just fix it for that reg class?

arsenm · 2025-01-10T12:25:43Z

llvm/test/CodeGen/AMDGPU/add-xnull-regclass-bitwidth.mir

@@ -0,0 +1,12 @@
+# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py UTC_ARGS: --version 5
+# RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx90a -run-pass=early-machinelicm -run-pass=postmisched -o - %s | FileCheck %s


Can do this in one run-pass with , separated pass names. but I'm not sure why you need to run 2 passes, or how either ne stresses this function

arsenm · 2025-01-10T13:01:11Z

llvm/test/lit.cfg.py

@@ -463,7 +463,7 @@ def have_cxx_shared_library():
        print("could not exec llvm-readobj")
        return False

-    readobj_out = readobj_cmd.stdout.read().decode("ascii")
+    readobj_out = readobj_cmd.stdout.read().decode("utf-8")


unrelated change

…56_XNULL-and-SReg_128_XNULL

arsenm · 2025-01-22T03:04:17Z

llvm/test/CodeGen/AMDGPU/add-xnull-regclass-bitwidth.mir

@@ -0,0 +1,15 @@
+# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py UTC_ARGS: --version 5


Rename test file

github-actions · 2025-01-22T03:07:25Z

@Shoreshen Congratulations on having your first Pull Request (PR) merged into the LLVM Project!

Your changes will be combined with recent changes from other authors, then tested by our build bots. If there is a problem with a build, you may receive a report in an email or a comment on this PR.

Please check whether problems have been caused by your change specifically, as the builds can include changes from many authors. It is not uncommon for your change to be included in a build that fails due to someone else's changes, or infrastructure issues.

How to do this, and the rest of the post-merge process, is covered in detail here.

If your change does cause a problem, it may be reverted, or you can revert it yourself. This is a normal part of LLVM development. You can fix your changes and open a new PR to merge them again.

If you don't get any reports, no action is required from you. Your changes are working as expected, well done!

fix unreachable reg bit width. need add test case latter

cfbc8ad

llvmbot added the backend:AMDGPU label Jan 8, 2025

arsenm reviewed Jan 8, 2025

View reviewed changes

ronlieb approved these changes Jan 8, 2025

View reviewed changes

Shoreshen changed the title ~~Fix unreachable reg bit width~~ [AMDGPU] Fix unreachable reg bit width Jan 9, 2025

Shoreshen added 3 commits January 10, 2025 17:24

Merge remote-tracking branch 'origin/main' into Add-register-class-bi…

a083e08

…t-width-for-SReg_256_XNULL-and-SReg_128_XNULL

add test case, hard to find case forind case for SReg_128_XNULL & mer…

163fcc1

…ge main

fix lit.cfg.py

89e1047

arsenm reviewed Jan 10, 2025

View reviewed changes

run single pass

b1eddc3

arsenm reviewed Jan 10, 2025

View reviewed changes

Shoreshen added 2 commits January 10, 2025 21:05

fix lit

64378c4

Merge branch 'llvm:main' into Add-register-class-bit-width-for-SReg_2…

61cde04

…56_XNULL-and-SReg_128_XNULL

Shoreshen requested a review from arsenm January 16, 2025 05:25

Shoreshen added 2 commits January 20, 2025 08:54

Merge branch 'llvm:main' into Add-register-class-bit-width-for-SReg_2…

b971e70

…56_XNULL-and-SReg_128_XNULL

Merge branch 'llvm:main' into Add-register-class-bit-width-for-SReg_2…

5706df1

…56_XNULL-and-SReg_128_XNULL

Shoreshen mentioned this pull request Jan 21, 2025

Request Commit Access For Shoreshen #119686

Open

add FIXME in test case

6b9cfce

arsenm approved these changes Jan 22, 2025

View reviewed changes

arsenm reviewed Jan 22, 2025

View reviewed changes

Rename test

614c83d

arsenm merged commit e8811ad into llvm:main Jan 22, 2025
5 of 7 checks passed

Shoreshen deleted the Add-register-class-bit-width-for-SReg_256_XNULL-and-SReg_128_XNULL branch January 22, 2025 07:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AMDGPU] Fix unreachable reg bit width #122107

[AMDGPU] Fix unreachable reg bit width #122107

Shoreshen commented Jan 8, 2025

github-actions bot commented Jan 8, 2025

llvmbot commented Jan 8, 2025

arsenm left a comment

Shoreshen commented Jan 8, 2025 •

edited

Loading

arsenm Jan 10, 2025

Shoreshen Jan 10, 2025 •

edited

Loading

arsenm Jan 10, 2025

Shoreshen Jan 13, 2025 •

edited

Loading

jwanggit86 Jan 16, 2025

arsenm Jan 10, 2025

arsenm Jan 10, 2025

arsenm Jan 22, 2025

github-actions bot commented Jan 22, 2025

		@@ -0,0 +1,12 @@
		# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py UTC_ARGS: --version 5
		# RUN: llc -mtriple=amdgcn-amd-amdhsa -mcpu=gfx90a -run-pass=early-machinelicm -run-pass=postmisched -o - %s \| FileCheck %s

		@@ -0,0 +1,15 @@
		# NOTE: Assertions have been autogenerated by utils/update_mir_test_checks.py UTC_ARGS: --version 5

[AMDGPU] Fix unreachable reg bit width #122107

[AMDGPU] Fix unreachable reg bit width #122107

Conversation

Shoreshen commented Jan 8, 2025

github-actions bot commented Jan 8, 2025

llvmbot commented Jan 8, 2025

arsenm left a comment

Choose a reason for hiding this comment

Shoreshen commented Jan 8, 2025 • edited Loading

arsenm Jan 10, 2025

Choose a reason for hiding this comment

Shoreshen Jan 10, 2025 • edited Loading

Choose a reason for hiding this comment

arsenm Jan 10, 2025

Choose a reason for hiding this comment

Shoreshen Jan 13, 2025 • edited Loading

Choose a reason for hiding this comment

jwanggit86 Jan 16, 2025

Choose a reason for hiding this comment

arsenm Jan 10, 2025

Choose a reason for hiding this comment

arsenm Jan 10, 2025

Choose a reason for hiding this comment

arsenm Jan 22, 2025

Choose a reason for hiding this comment

github-actions bot commented Jan 22, 2025

Shoreshen commented Jan 8, 2025 •

edited

Loading

Shoreshen Jan 10, 2025 •

edited

Loading

Shoreshen Jan 13, 2025 •

edited

Loading