OHDSI / WhiteRabbit

WhiteRabbit is a small application that can be used to analyse the structure and contents of a database as preparation for designing an ETL. It comes with RabbitInAHat, an application for interactive design of an ETL to the OMOP Common Data Model with the help of the the scan report generated by White Rabbit.
http://ohdsi.github.io/WhiteRabbit
Apache License 2.0
185 stars 90 forks source link

Java int type leads to overflow for large tables #391

Open janblom opened 1 year ago

janblom commented 1 year ago

Describe the bug WhiteRabbit scan use Java type int for row based statistics, leading to a maximum of 2147483647 processable rows.

To Reproduce Steps to reproduce the behavior:

  1. Go to '...'
  2. Click on '....'
  3. Scroll down to '....'
  4. See error

Expected behavior WhiteRabbit should be able to handle larger row counts. Basically, switch from int to long, but be aware of related issues (overflowing calculations).

Screenshots If applicable, add screenshots to help explain your problem.

Desktop (please complete the following information): All platforms/versions

Additional context First reported on the OHDSI forums here.