arm64: Refactor mov/movprfx for embedded masked operations by ylpoonlg · Pull Request #126398 · dotnet/runtime

ylpoonlg · 2026-04-01T08:46:12Z

This PR is the second part of #115508, following #123717.

The mov/movprfx logic for embedded masked operations are moved from codegen into the emit functions in a similar way to #123717. The main difference is that we can use predicated movprfx (zeroing or merging) for embedded masked operations, depending on the false argument of the wrapped conditional select. This information is passed into the emitInsSve_Mov helper using a new option mopt, which defaults to INS_SVE_MOV_OPTS_UNPRED.

@dotnet/arm64-contrib @a74nh @dhartglassMSFT

* Add an option for SVE mov/movprfx to differentiate between unpredicated, zeroing and merging operation for the emitInsSve_Mov helper function. * Clean up codegen for embedded masked operation. * Fix SIMD&FP scalar register name in SVE emit display.

dotnet-policy-service · 2026-04-01T08:47:31Z

Tagging subscribers to this area: @JulieLeeMSFT, @jakobbotsch
See info in area-owners.md if you want to be subscribed.

dhartglassMSFT · 2026-04-06T18:48:41Z

Hi @a74nh are you physically able to kick off a "jitstressregs" pipeline? Or, run locally on your side?

(Specifically referring to the test scenario that discovered this bug #126379 )

It should be at least possible to run the tests locally, it's controlled by DOTNET* env variables.

We've had at least a couple jitstress-discovered issues in arm64 codegen over the last few weeks, I'm hoping if that leg is run proactively we can cut down on that.

(That said, it shouldn't be run until this PR goes in #126434 )

a74nh · 2026-04-07T08:56:15Z

Hi @a74nh are you physically able to kick off a "jitstressregs" pipeline? Or, run locally on your side?

(Specifically referring to the test scenario that discovered this bug #126379 )

It should be at least possible to run the tests locally, it's controlled by DOTNET* env variables.

We've had at least a couple jitstress-discovered issues in arm64 codegen over the last few weeks, I'm hoping if that leg is run proactively we can cut down on that.

(That said, it shouldn't be run until this PR goes in #126434 )

We can't run the pipeline in github as we don't have the permissions.

We do have a script that Kunal wrote, which just runs the command line you give it using many different stress scenarios.

Running the hwintrinsic tests using it should be good enough.

@ylpoonlg - could you rebase and test all the hwintrinsics using that script please.

ylpoonlg · 2026-04-07T16:23:55Z

Fixed a similar issue to #126434 . The jitstress tests now pass running hwintrinsic tests with the script.

dhartglassMSFT · 2026-04-14T23:15:43Z

Do you have an spmi asmdiffs result like for the first PR?

Also, I had to use your jitstress changes to fix a higher pri issue last week, so you'll probably get merge conflicts : /

ylpoonlg · 2026-04-15T11:18:37Z

Do you have an spmi asmdiffs result like for the first PR?

Some examples from the SPMI asmdiffs summary:

@@ -20,16 +20,15 @@ G_M37464_IG01:        ; bbWeight=1, gcrefRegs=0000 {}, byrefRegs=0000 {}, byref,
 G_M37464_IG02:        ; bbWeight=1, gcrefRegs=0000 {}, byrefRegs=0000 {}, byref
             ptrue   p0.s
             movi    v16.4s, #0
-            movprfx z0, z0
             addp    z0.s, p0/m, z0.s, z1.s
             sel     z0.s, p0, z0.s, z16.s
-                                               ;; size=20 bbWeight=1 PerfScore 7.50
+                                               ;; size=16 bbWeight=1 PerfScore 5.50

Removes unnecessary movprfx where destination and source registers are the same.

@@ -49,7 +49,7 @@ G_M63337_IG02:        ; bbWeight=1, gcrefRegs=80000 {x19}, byrefRegs=0000 {}, by
             movk    x4, #0xD1FFAB1E LSL #16
             movk    x4, #0xD1FFAB1E LSL #32
             ldr     w1, [x4]
-            mov     x4, x1
+            mov     w4, w1
             sqincb  x4, w4, vl8, mul #2
             mov     x0, x19
             ; gcrRegs +[x0]

Small change to the scalar *qinc/qdec*, as they read the source 32-bit register and output 64-bit to the same register. So semantically only the 32-bit input need to be moved, but it doesn't affect the behavior or results. This is done to generalize the logic for the emit size of mov instructions.

ylpoonlg · 2026-04-15T11:23:45Z

Also, I had to use your jitstress changes to fix a higher pri issue last week, so you'll probably get merge conflicts : /

Seems to be clean with main so far, but I can do a rebase anyway. Sorry for breaking the jitstress tests and thanks for fixing them.

github-actions bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Apr 1, 2026

dotnet-policy-service bot added the community-contribution Indicates that the PR has been added by a community member label Apr 1, 2026

a74nh mentioned this pull request Apr 1, 2026

Improve Arm64 for .NET11 #121787

Open

20 tasks

ylpoonlg added 2 commits April 7, 2026 14:01

Merge branch 'main' into github-movprfx_refactor_2

e03919e

Fix immediate helper numInstrs

5727f92

Merge branch 'main' into github-movprfx_refactor_2

7cf5947

build-analysis bot mentioned this pull request Apr 15, 2026

System.Net.NameResolution.Tests DNS failures: Name or service not known #126641

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

arm64: Refactor mov/movprfx for embedded masked operations#126398

arm64: Refactor mov/movprfx for embedded masked operations#126398
ylpoonlg wants to merge 4 commits intodotnet:mainfrom
ylpoonlg:github-movprfx_refactor_2

ylpoonlg commented Apr 1, 2026

Uh oh!

dotnet-policy-service bot commented Apr 1, 2026

Uh oh!

dhartglassMSFT commented Apr 6, 2026

Uh oh!

a74nh commented Apr 7, 2026

Uh oh!

ylpoonlg commented Apr 7, 2026

Uh oh!

dhartglassMSFT commented Apr 14, 2026

Uh oh!

ylpoonlg commented Apr 15, 2026

Uh oh!

ylpoonlg commented Apr 15, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ylpoonlg commented Apr 1, 2026

Uh oh!

dotnet-policy-service bot commented Apr 1, 2026

Uh oh!

dhartglassMSFT commented Apr 6, 2026

Uh oh!

a74nh commented Apr 7, 2026

Uh oh!

ylpoonlg commented Apr 7, 2026

Uh oh!

dhartglassMSFT commented Apr 14, 2026

Uh oh!

ylpoonlg commented Apr 15, 2026

Uh oh!

ylpoonlg commented Apr 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ylpoonlg commented Apr 15, 2026 •

edited

Loading