Use fused multiply-add instructions when available by JSorngard · Pull Request #268 · JSorngard/lambert_w

JSorngard · 2025-11-07T22:00:55Z

This PR makes the polynomial function use fused multiply-add instructions when the target CPU supports them. This speeds up function execution slightly in those cases.

JSorngard · 2025-11-07T22:02:33Z

This means some tests started failing, because the exact errors have been shifted around by the difference in round-off error.

As a result they were made slightly less stringent.

JSorngard · 2025-11-07T22:06:24Z

In the original Fortran code by Fukushima the compiler is allowed to pick which instructions to use. E.g. if the user sets "-march=native" it may use fused multiply-add instructions. As a result their use shouldn't conflict with how the algorithm is set up. Though it is possible that the minimax algorithm that determined the coefficients for the polynomials would have determined different coefficients if it ran with fma instrucitons (I don't know if it did).

JSorngard · 2025-11-11T12:34:53Z

I will need to determine if the fma instructions break the minimax approximation before I merge this.

JSorngard added 3 commits November 7, 2025 19:52

Use fused multiply-add instructions when available

e992b14

Re-add the max_relative to the readme test

5897f87

cargo fmt

721fbcc

JSorngard added the enhancement New feature or request label Nov 7, 2025

Add changes to log

dc3b597

JSorngard added the tests Related to the test suite of the library label Nov 7, 2025

JSorngard self-assigned this Nov 27, 2025

Merge branch 'main' into mul_add

0b85045

JSorngard marked this pull request as draft December 4, 2025 20:29

JSorngard removed their assignment Dec 10, 2025

JSorngard added 2 commits December 13, 2025 20:58

Merge branch 'main' into mul_add

785fae2

Move the mul_add note to the latest unreleased version

6411954

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use fused multiply-add instructions when available#268

Use fused multiply-add instructions when available#268
JSorngard wants to merge 7 commits intomainfrom
mul_add

JSorngard commented Nov 7, 2025 •

edited

Loading

Uh oh!

JSorngard commented Nov 7, 2025 •

edited

Loading

Uh oh!

JSorngard commented Nov 7, 2025 •

edited

Loading

Uh oh!

JSorngard commented Nov 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

JSorngard commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JSorngard commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JSorngard commented Nov 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

JSorngard commented Nov 11, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

JSorngard commented Nov 7, 2025 •

edited

Loading

JSorngard commented Nov 7, 2025 •

edited

Loading

JSorngard commented Nov 7, 2025 •

edited

Loading