Winch: packed integer basic arithmetic for x64 #10147

MarinPostma · 2025-01-29T13:03:24Z

implements the following instructions for winch x64:

i8x16.add
i8x16.add_sat_u
i8x16.add_sat_s
i16x8.add
i16x8.add_sat_u
i16x8.add_sat_s
i32x4.add
i64x2.add
i8x16.sub
i8x16.sub_sat_u
i8x16.sub_sat_s
i16x8.sub
i16x8.sub_sat_u
i16x8.sub_sat_s
i32x4.sub
i64x2.sub
i16x8.mul
i32x4.mul
i64x2.mul

MarinPostma · 2025-01-29T17:11:14Z

winch/codegen/src/masm.rs

+
+    /// Perform a vector add between `lsh` and `rhs`, placing the result in `dst`, where each lane
+    /// is interpreted to be `size` long.
+    fn v128_add(


naming-wise, I'm completely sure what to call those. Maybe vector_add is more appropriate?

I think the naming is fine here, as far as I can tell, we don't have any other vector naming convention.

saulecabrera · 2025-01-29T17:19:51Z

I can take this review.

saulecabrera · 2025-01-30T15:14:15Z

winch/codegen/src/isa/x64/masm.rs

+        let mul_avx512 = |this: &mut Self, op| {
+            this.ensure_has_avx512vl()?;
+            this.ensure_has_avx512dq()?;
+            this.asm.xmm_rm_rvex3(op, lhs, rhs, dst);
+            Ok(())


I don't think we should make this a hard requirement, given that our baseline is AVX.

Given that Intel suggests that there's no penalty on mixing AVX with AVX512 instructions , we could emit AVX512 if they are available, however in case they aren't we still need to emit a fallback to avoid bumping our baseline for this operation. For reference: https://github.com/bytecodealliance/wasmtime/blob/main/cranelift/codegen/src/isa/x64/lower.isle#L1121

my bad, I misunderstood your DM about what we expected to support. I have ported cranelift fallback implementation.

hold on I just found a bug

saulecabrera · 2025-01-30T15:14:46Z

winch/codegen/src/masm.rs

+
+    /// Perform a vector add between `lsh` and `rhs`, placing the result in `dst`, where each lane
+    /// is interpreted to be `size` long.
+    fn v128_add(


I think the naming is fine here, as far as I can tell, we don't have any other vector naming convention.

saulecabrera

LGTM, thanks!

MarinPostma requested review from a team as code owners January 29, 2025 13:03

MarinPostma requested review from fitzgen and removed request for a team January 29, 2025 13:03

MarinPostma commented Jan 29, 2025

View reviewed changes

saulecabrera requested review from saulecabrera and removed request for a team and fitzgen January 29, 2025 17:19

saulecabrera reviewed Jan 30, 2025

View reviewed changes

MarinPostma force-pushed the packed-integer-arithmetic branch 5 times, most recently from 8e99df3 to 6d9af62 Compare January 31, 2025 18:01

MarinPostma added 10 commits January 31, 2025 23:35

packed integer add

8aa9cc5

packed integer sub

7170e63

packed integer mul

e5b677c

packed integer saturating add

a05f46e

packed integer saturating sub

87c6759

fix missing error codes for avx

cec7a09

change size to lane_width

c783cda

fmt

afa5294

i64x2 mul fallback

55046ec

add fallback test.

8a9cde4

MarinPostma force-pushed the packed-integer-arithmetic branch from da32743 to 8a9cde4 Compare January 31, 2025 22:39

MarinPostma mentioned this pull request Feb 3, 2025

Winch: implement v128 neg and shifts for x64 #10170

Merged

saulecabrera approved these changes Feb 3, 2025

View reviewed changes

saulecabrera added this pull request to the merge queue Feb 3, 2025

Merged via the queue into bytecodealliance:main with commit 70c93c6 Feb 3, 2025
39 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Winch: packed integer basic arithmetic for x64 #10147

Winch: packed integer basic arithmetic for x64 #10147

MarinPostma commented Jan 29, 2025 •

edited

Loading

MarinPostma Jan 29, 2025

saulecabrera Jan 30, 2025

saulecabrera commented Jan 29, 2025

saulecabrera Jan 30, 2025

MarinPostma Jan 31, 2025

MarinPostma Jan 31, 2025 •

edited

Loading

MarinPostma Jan 31, 2025

saulecabrera Jan 30, 2025

saulecabrera left a comment

Winch: packed integer basic arithmetic for x64 #10147

Winch: packed integer basic arithmetic for x64 #10147

Conversation

MarinPostma commented Jan 29, 2025 • edited Loading

MarinPostma Jan 29, 2025

Choose a reason for hiding this comment

saulecabrera Jan 30, 2025

Choose a reason for hiding this comment

saulecabrera commented Jan 29, 2025

saulecabrera Jan 30, 2025

Choose a reason for hiding this comment

MarinPostma Jan 31, 2025

Choose a reason for hiding this comment

MarinPostma Jan 31, 2025 • edited Loading

Choose a reason for hiding this comment

MarinPostma Jan 31, 2025

Choose a reason for hiding this comment

saulecabrera Jan 30, 2025

Choose a reason for hiding this comment

saulecabrera left a comment

Choose a reason for hiding this comment

MarinPostma commented Jan 29, 2025 •

edited

Loading

MarinPostma Jan 31, 2025 •

edited

Loading