feat: add `Float.toBits` and `Float.fromBits` #6094

leodemoura · 2024-11-15T19:23:05Z

This PR adds raw transmutation of floating-point numbers to and from UInt64. Floats and UInts share the same endianness across all supported platforms. The IEEE 754 standard precisely specifies the bit layout of floats. Note that Float.toBits is distinct from Float.toUInt64, which attempts to preserve the numeric value rather than the bitwise value.

closes #6071

This PR adds raw transmutation of floating-point numbers to and from `UInt64`. Floats and UInts share the same endianness across all supported platforms. The IEEE 754 standard precisely specifies the bit layout of floats. Note that `Float.toBits` is distinct from `Float.toUInt64`, which attempts to preserve the numeric value rather than the bitwise value.

seanmcl

Wow that was fast. Thanks!

digama0 · 2024-11-15T19:44:05Z

I don't think this is sound unless you clear NaN payloads first, or take a proof that the float is a canonical NaN. Because float implementations are nondeterministic in NaN payload bits and this violates the expectation of opaque functions being functional.

leanprover-community-bot · 2024-11-15T19:46:29Z

Mathlib CI status (docs):

❗ Batteries/Mathlib CI will not be attempted unless your PR branches off the nightly-with-mathlib branch. Try git rebase 688ee4c88722ca191981246f7732f25caed81cac --onto 9a8543347796e52070ff7936661ae48fcebfea60. (2024-11-15 19:46:29)

seanmcl · 2024-11-15T19:49:33Z

I don't think this is sound unless you clear NaN payloads first

Canonicalization would be costly. Is what's missing is that no pair of nans should compare equal, even identical bit patterns?

digama0 · 2024-11-15T20:18:06Z

I don't think this is sound unless you clear NaN payloads first

Canonicalization would be costly. Is what's missing is that no pair of nans should compare equal, even identical bit patterns?

The opposite: two nans with different payloads should be equal according to lean's = relation. This is impossible if you have Float.toBits because UInt64 obviously can distinguish them, unless you make this function unsafe. Effectively, Float is really a quotient type and toBits is a variation on Quot.unquot.

seanmcl · 2024-11-15T20:21:10Z

Why should they be equal? IEEE-754 semantics is encodable in Lean.

digama0 · 2024-11-15T20:25:25Z

If NaNs are not treated as lean-equal, then you have much worse consequences: Float.add and friends cannot be functions of type Float -> Float -> Float at all, because they can produce different results given the same input, which is not something lean functions can do (provably). Taking a quotient allows for the nondeterminism allowed by the IEEE spec to be unobservable in lean which restores soundness, but it does so at the cost of making Float.toBits unsafe unless it targets not UInt64 itself but rather a quotient thereof (where the quotient relation is "these two bit patterns are equal, or they are both encodings of NaN").

IMO the most pragmatic solution which solves peoples' immediate needs here is to just provide this function as unsafe and let people wrap it with a safe function if they decide they don't want to care about this issue.

nomeata · 2024-11-15T20:55:00Z

I don't think this is sound

The function is defined as opaque - doesn't that mean that logically, its an unspecified function, and soundness is not under threat? Or am I missing something here?

digama0 · 2024-11-15T21:32:02Z

Even opaque functions have logical constraints, because you can prove for any lean function that x = y -> f x = f y. In this case, the requirement is that when you call a lean function with the same input you get the same output every time, i.e. it's a logically pure function. This is not true if the underlying computation is nondeterministic. IEEE-754 is generally bit-precise, but it is deliberately underspecified around NaN payload bits, and a combination of compiler optimizations and hardware behavior mean that it is possible to observe actual nondeterminism in practice, although it requires some work to get it to appear directly in tests.

nomeata · 2024-11-15T21:59:54Z

Ok, I think get the issue now. Hmm, tricky.

shigoel · 2024-11-15T22:29:40Z

This may be of interest: double-float support in the ACL2 theorem prover:
https://www.cs.utexas.edu/~moore/publications/double-float.pdf

seanmcl · 2024-11-15T23:20:52Z

Float.add and friends cannot be functions of type Float -> Float -> Float at all, because they can produce different results given the same input

So an example of this concern is something like having distinct NaN bit patterns, nan1 and nan2, and some expression like nan + 7 returns nan1 and nan2 nondeterministically, as allowed by the IEEE spec. I'm not an expert. Is there evidence this really happens on the same processor?

digama0 · 2024-11-16T08:14:49Z

Two common reasons you might encounter nondeterminism like that:

The processor puts information regarding the instruction on which the failure happened into the payload bits as a kind of exception handling. This is not stable since you can easily perform the same operation on the same inputs in two different places.
One of the nan operations was evaluated by the hardware and the other one was evaluated by the compiler via constant propagation, and they differ on NaN handling. This was a recent source of issues in the Rust language, which tries to be correct around this kind of thing, and you can see that the spec they landed on is explicitly nondeterministic because they were not able to promise more than this under the circumstances. (I recommend reading that RFC, it is a good introduction to these issues.)

digama0 · 2024-11-16T08:25:54Z

Note that this is not the first time this issue has come up, see #1459 for nondeterminism leakage through the Float.toString function.

nomeata · 2024-11-16T10:59:12Z

Does #6097 address these concerns raised here?

digama0 · 2024-11-16T14:59:48Z

Yes it does. Was there also a function exposed for converting float arrays to byte arrays? That might need the same treatment.

This PR adds raw transmutation of floating-point numbers to and from `UInt64`. Floats and UInts share the same endianness across all supported platforms. The IEEE 754 standard precisely specifies the bit layout of floats. Note that `Float.toBits` is distinct from `Float.toUInt64`, which attempts to preserve the numeric value rather than the bitwise value. closes leanprover#6071

leodemoura added the changelog-library Library label Nov 15, 2024

leodemoura requested a review from kim-em as a code owner November 15, 2024 19:23

leodemoura enabled auto-merge November 15, 2024 19:27

seanmcl approved these changes Nov 15, 2024

View reviewed changes

github-actions bot temporarily deployed to lean-lang.org/lean4/doc November 15, 2024 19:44 Inactive

leodemoura added this pull request to the merge queue Nov 15, 2024

github-actions bot added the toolchain-available A toolchain is available for this PR, at leanprover/lean4-pr-releases:pr-release-NNNN label Nov 15, 2024

Merged via the queue into master with commit ecbaeff Nov 15, 2024
18 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add `Float.toBits` and `Float.fromBits` #6094

feat: add `Float.toBits` and `Float.fromBits` #6094

leodemoura commented Nov 15, 2024

seanmcl left a comment

digama0 commented Nov 15, 2024

leanprover-community-bot commented Nov 15, 2024

seanmcl commented Nov 15, 2024 •

edited

Loading

digama0 commented Nov 15, 2024 •

edited

Loading

seanmcl commented Nov 15, 2024

digama0 commented Nov 15, 2024 •

edited

Loading

nomeata commented Nov 15, 2024

digama0 commented Nov 15, 2024 •

edited

Loading

nomeata commented Nov 15, 2024 •

edited

Loading

shigoel commented Nov 15, 2024

seanmcl commented Nov 15, 2024

digama0 commented Nov 16, 2024 •

edited

Loading

digama0 commented Nov 16, 2024

nomeata commented Nov 16, 2024

digama0 commented Nov 16, 2024

feat: add Float.toBits and Float.fromBits #6094

feat: add Float.toBits and Float.fromBits #6094

Conversation

leodemoura commented Nov 15, 2024

seanmcl left a comment

Choose a reason for hiding this comment

digama0 commented Nov 15, 2024

leanprover-community-bot commented Nov 15, 2024

seanmcl commented Nov 15, 2024 • edited Loading

digama0 commented Nov 15, 2024 • edited Loading

seanmcl commented Nov 15, 2024

digama0 commented Nov 15, 2024 • edited Loading

nomeata commented Nov 15, 2024

digama0 commented Nov 15, 2024 • edited Loading

nomeata commented Nov 15, 2024 • edited Loading

shigoel commented Nov 15, 2024

seanmcl commented Nov 15, 2024

digama0 commented Nov 16, 2024 • edited Loading

digama0 commented Nov 16, 2024

nomeata commented Nov 16, 2024

digama0 commented Nov 16, 2024

feat: add `Float.toBits` and `Float.fromBits` #6094

feat: add `Float.toBits` and `Float.fromBits` #6094

seanmcl commented Nov 15, 2024 •

edited

Loading

digama0 commented Nov 15, 2024 •

edited

Loading

digama0 commented Nov 15, 2024 •

edited

Loading

digama0 commented Nov 15, 2024 •

edited

Loading

nomeata commented Nov 15, 2024 •

edited

Loading

digama0 commented Nov 16, 2024 •

edited

Loading