trunc: Use an assembly implementation on i586 by tgross35 · Pull Request #1152 · rust-lang/compiler-builtins

tgross35 · 2026-03-30T21:34:15Z

The trunc implementation uses integer operations so currently works
fine on i586. However, we already have the other three easy operations
based on frndint, so add trunc and complete the set.

ci: skip-extensive

tgross35 · 2026-03-30T21:34:26Z

Based on #1142 to avoid conflicts.

quaternic · 2026-04-05T12:54:49Z

I don't think we should have this just for the sake of consistency. It has strictly worse performance.

tgross35 · 2026-04-07T02:03:04Z

I don't think we should have this just for the sake of consistency. It has strictly worse performance.

What makes this slow - is frndint that much slower than soft ops? Looking at https://rust.godbolt.org/z/s6GjGsf6E I have no idea whether the latency of 100 is correct or just a worst case estimate since I can't find it cited anywhere.

Any idea why LLVM inserts the wait and why eax gets pushed? Thought that should be caller-saved.

The `trunc` implementation uses integer operations so currently works fine on i586. However, we already have the other three easy operations based on `frndint`, so add `trunc` and complete the set.

rustbot · 2026-04-07T05:06:30Z

This PR was rebased onto a different main commit. Here's a range-diff highlighting what actually changed.

Rebasing is a normal part of keeping PRs up to date, so no action is needed—this note is just to help reviewers.

quaternic · 2026-04-07T15:40:50Z

What makes this slow - is frndint that much slower than soft ops? Looking at https://rust.godbolt.org/z/s6GjGsf6E I have no idea whether the latency of 100 is correct or just a worst case estimate since I can't find it cited anywhere.

I'll admit I only tested by comparing against the x86-64 implementation. The 32-bit code does look more complex.

Measuring frndint latency (on https://en.wikipedia.org/wiki/Nehalem_(microarchitecture)):

|x| in [0, 2^63) -> ~21 cycles
|x| in [2^63, Inf) -> ~40 cycles
Inf -> ~230 cycles
NaN -> ~250 cycles

Looks like Agner Fog does provide measurements for x87 instructions too, see 4. Instruction tables in:
https://www.agner.org/optimize/#manuals

For Nehalem the listed latency for frndint is 22, so I'll assume the measurements don't consider potential input dependence, but AMD seems to have had a faster implementation since 2011. Intel does not.

tgross35 mentioned this pull request Mar 30, 2026

roundeven: Use an assembly implementation on i586 #1142

Merged

tgross35 force-pushed the i586-trunc-asm branch from 4b15979 to a0d2380 Compare March 30, 2026 22:23

This comment has been minimized.

Sign in to view

tgross35 force-pushed the i586-trunc-asm branch 3 times, most recently from e37a067 to 5aa3f86 Compare April 3, 2026 21:30

This comment has been minimized.

Sign in to view

tgross35 force-pushed the i586-trunc-asm branch from 5aa3f86 to e83cf4a Compare April 3, 2026 21:59

This comment has been minimized.

Sign in to view

trunc: Use an assembly implementation on i586

bc70120

The `trunc` implementation uses integer operations so currently works fine on i586. However, we already have the other three easy operations based on `frndint`, so add `trunc` and complete the set.

tgross35 force-pushed the i586-trunc-asm branch from e83cf4a to bc70120 Compare April 7, 2026 05:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

trunc: Use an assembly implementation on i586#1152

trunc: Use an assembly implementation on i586#1152
tgross35 wants to merge 1 commit intorust-lang:mainfrom
tgross35:i586-trunc-asm

tgross35 commented Mar 30, 2026

Uh oh!

tgross35 commented Mar 30, 2026

Uh oh!

This comment has been minimized.

This comment has been minimized.

quaternic commented Apr 5, 2026

Uh oh!

This comment has been minimized.

tgross35 commented Apr 7, 2026

Uh oh!

rustbot commented Apr 7, 2026

Uh oh!

quaternic commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

tgross35 commented Mar 30, 2026

Uh oh!

tgross35 commented Mar 30, 2026

Uh oh!

This comment has been minimized.

This comment has been minimized.

quaternic commented Apr 5, 2026

Uh oh!

This comment has been minimized.

tgross35 commented Apr 7, 2026

Uh oh!

rustbot commented Apr 7, 2026

Uh oh!

quaternic commented Apr 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants