S390x: emit new instructions added in z17 #12319

theotherjimmy · 2026-01-12T15:57:09Z

Z17 (arch15) includes some instructions that allow us to encode some more complicated operations in fewer instructions. This PR adds support to cranelift-codegen to emit these newer instructions when appropriate.

Further, Z17 includes a VBLEND instruction that mimics the same instruction on x64. Since this is no longer an x64-exclusive instruction type, I've renamed the appropriate stuff within cranelift codegen to reflect that this is not ISA-specific anymore.

github-actions · 2026-01-12T17:47:34Z

Subscribe to Label Action

cc @cfallin, @fitzgen

Details

This issue or pull request has been labeled: "cranelift", "cranelift:area:aarch64", "cranelift:area:x64", "cranelift:meta", "isle"

Thus the following users have been cc'd because of the following labels:

cfallin: isle
fitzgen: isle

To subscribe or unsubscribe from this label, edit the .github/subscribe-to-label.json configuration file.

Learn more.

alexcrichton · 2026-01-12T20:58:57Z

I'm going to shift review of this over to @uweigand

alexcrichton · 2026-01-12T20:59:25Z

or, well, I can't officially do that, but @uweigand I'm happy to rubber-stamp once you've approved

uweigand

Looks mostly good to me, but see inline comments.

In addition to what is implemented here, we now could implement vector integer division for 32-bit and 64-bit integer vectors - but there is currently no ISLE to even express this.

cranelift/codegen/src/isa/s390x/inst/emit_tests.rs

uweigand · 2026-01-13T14:57:05Z

cranelift/codegen/src/isa/s390x/lower.isle

                      (cmov_imm $I64 (intcc_as_cond (IntCC.Equal)) 0 x)))

+;; Implement `sdiv` for 128-bit integers on z17 (only).
+;; FIXME: integer-overflow check


I think we need to fix this before committing. (Also, there probably should be run-time test validating the correct behaviour like for other divison operations.)

uweigand · 2026-01-13T15:10:19Z

cranelift/codegen/src/isa/s390x/lower.isle

+(rule 16 (lower (has_type (and (vxrs_ext3_enabled) (vr128_ty ty)) (band (band y z) (bnot x))))
+      (vec_eval ty 0b00000010 x y z))
+(rule 17 (lower (has_type (and (vxrs_ext3_enabled) (vr128_ty ty)) (band (band x (bnot y)) z)))
+      (vec_eval ty 0b00000010 y x z))


Not sure what if any canonicalization is done at the ISLE level here, but these four don't cover all possible combinations. E.g. (band x (band (bnot y) z)) is not covered.

I guess a more fundamental question is which combinations we should be covering. For example, why cover and-not with three inputs but not or-not?

I stopped because I realized this would add hundreds of rules to get correct, and ran out of steam pretty quickly. I think it's an open question: what should we encode? what 3-input binary operations are actually used?

uweigand · 2026-01-13T15:13:33Z

cranelift/filetests/filetests/isa/s390x/vec-bitwise-arch15.clif

+; block0: ; offset 0x0
+;   .byte 0xe7, 0x8a
+;   .byte 0x80, 0x02
+;   .byte 0x9f, 0x88


We should also add z17 insns to the disassembler, but that is of course a different patch.

crates/cranelift/src/func_environ.rs

This emits & tests a bunch of instructions: * from Miscellaneous-Instruction-Extensions Facility 4: * CLZ, 64bit * CTZ, 64bit * from Vector-Enhancements Facility 3: * 32x4, 64x2 & 128x1 variants of the following: * Divide * Remainder * 64x2 & 128x1 multiply variants * 128x1 vaiants of: * Compare * CLZ * CTZ * Max * Min * Average * Negation * Evaluate (3-input and, 3-input or, atm) Co-authored-by: Jimmy Brisson <[email protected]>

Now that s390x implements blendv as well, we should refer to the instruction without the x86 prefix.

theotherjimmy requested a review from a team as a code owner January 12, 2026 15:57

theotherjimmy requested review from alexcrichton and removed request for a team January 12, 2026 15:57

theotherjimmy force-pushed the s390x-z17 branch 3 times, most recently from ae1c56a to fac9929 Compare January 12, 2026 16:26

uweigand reviewed Jan 13, 2026

View reviewed changes

uweigand and others added 3 commits January 13, 2026 12:22

s390x: Emit vector blend on z17

1ce6513

Rename x86_blendv to blendv

73c5595

Now that s390x implements blendv as well, we should refer to the instruction without the x86 prefix.

theotherjimmy force-pushed the s390x-z17 branch from fac9929 to 73c5595 Compare January 13, 2026 18:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

S390x: emit new instructions added in z17 #12319

S390x: emit new instructions added in z17 #12319

Uh oh!

theotherjimmy commented Jan 12, 2026

Uh oh!

github-actions bot commented Jan 12, 2026

Uh oh!

alexcrichton commented Jan 12, 2026

Uh oh!

alexcrichton commented Jan 12, 2026

Uh oh!

uweigand left a comment

Uh oh!

Uh oh!

uweigand Jan 13, 2026

Uh oh!

uweigand Jan 13, 2026

Uh oh!

theotherjimmy Jan 13, 2026

Uh oh!

uweigand Jan 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

S390x: emit new instructions added in z17 #12319

Are you sure you want to change the base?

S390x: emit new instructions added in z17 #12319

Uh oh!

Conversation

theotherjimmy commented Jan 12, 2026

Uh oh!

github-actions bot commented Jan 12, 2026

Subscribe to Label Action

Uh oh!

alexcrichton commented Jan 12, 2026

Uh oh!

alexcrichton commented Jan 12, 2026

Uh oh!

uweigand left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

uweigand Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

uweigand Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

theotherjimmy Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

uweigand Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants