pre-commit / pre-commit-hooks

Some out-of-the-box hooks for pre-commit
MIT License
5.39k stars 709 forks source link

`pretty-format-json` modifies floating point numbers that have too many digits of precision #780

Open dougthor42 opened 2 years ago

dougthor42 commented 2 years ago

pretty-format-json modifies floating point numbers that have too many digits of precision.

Background

I have to work with JSON files that may be generated by non-python programs. These files, for whatever reason, have numbers that have up to 16 digits after the decimal place.

(It's absurd, really. The people taking these measurements are somehow able to measure 0.1μHz on a 10GHz scale?? Yeah, they're saving values like 5.9257052820783001 GHz. Someone needs to teach them about significant figures... but that's beside the point. The point is I have to deal with this data :unamused:)

Steps to Reproduce

  1. Create the following json file:
    {"foo": 4.4257052820783003}
  2. Run pretty-format-json on it.

Expected Output:

{
  "foo": 4.4257052820783003
}

Actual Output:

{
  "foo": 4.4257052820783
}

The diff from expected is:

 {
-  "foo": 4.4257052820783003
+  "foo": 4.4257052820783
 }

Version Info

Discussion

This might be something that has to be fixed within the python builtin json package. A custom JSON encoder/decoder that wraps things using decimal.Decimal might work too.

I've created a test case for this. See my high-precision-numbers branch or the diff.

I'll see if I have time to actually fix this, but I don't expect to :frowning_face:.

asottile commented 2 years ago

yeah I suspect this isn't really fixable due to json -- but feel free to take a stab at it

liortct commented 2 years ago

As @asottile said the problem is in json standard library. image Therefore, this issue isn't fixable in the pre-commit hook and I advice to close this issue.

SilverTux commented 2 years ago

Hi, I have a PR #818 to solve this issue. What do you think about it, can you check it please? @dougthor42 @asottile I've already included the tests for this from @dougthor42's diff

dougthor42 commented 2 years ago

@asottile How would you feel about a custom json encoder/decoder that handles floats using decimal.Decimal?

json.loads also has a parse_float arg that might be usable instead or in addition.

riccardobucco commented 2 years ago

Python has no built-in arbitrary-precision floats. Here is an example:

>>> float(4.4257052820783003)
4.4257052820783

So it doesn't matter what you use, you can't have a float object with arbitrary precision.