h2oai / h2o-3

H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
http://h2o.ai
Apache License 2.0
6.87k stars 2k forks source link

Frame.toString(off, len) does not report NA properly #13919

Open exalate-issue-sync[bot] opened 1 year ago

exalate-issue-sync[bot] commented 1 year ago
  1. Load attached dataset with NA
  2. call frame.toString(0, numRows-1)
  3. try to find NAs - they will be represented with 0 C1 C2 C3 C4 C5 C6 C7 C8 C9 C10 C11 C12 C13 C14 C15 C16 C17 C18 C19 C20 min 2006 1 11 600 714 93 1 774 412 42 37 39.5 0.0 0 0 17.5 0 -30 mean 2006 1 11 1270 1493 148 613 1871 803 44 37 40.87931034482759 0.0413793103448276 0 0 24.396551724137932 0 29 stddev 0 0 0 450 488 40 547 655 392 5 0 3.075406977753958 0.019221293610962236 0 0 2.8074496253743857 0 35 max 2006 1 12 2140 2331 220 1695 2960 1515 57 38 47.5 0.05 0 0 25.5 0 89 missing 1 1 1 1 1 1 1 1 1 1 1 1.0 1.0 1 1 1.0 1 1 0 2006 1 11 1550 1656 126 US 7 1579 CLT 599 42 37 39.5 0.05 0 0 25.5 0 68 1 2006 1 11 920 1028 128 US 216 1646 CLT 599 42 37 39.5 0.05 0 0 25.5 0 39 2 2006 1 11 2140 2249 129 US 382 1708 CLT 599 42 37 39.5 0.05 0 0 25.5 0 -20 3 2006 1 11 755 903 128 US 689 977 CLT 599 42 37 39.5 0.05 0 0 25.5 0 47 4 2006 1 11 1425 1532 127 US 975 1674 CLT 599 42 37 39.5 0.05 0 0 25.5 0 79 5 2006 1 11 700 804 124 US 1695 2904 DCA 612 42 37 39.5 0.05 0 0 25.5 0 9 6 2006 1 11 939 1516 217 US 720 2349 LAS 1515 42 37 39.5 0.05 0 0 25.5 0 60 7 2006 1 11 1608 2148 220 US 721 2219 LAS 1515 42 37 39.5 0.05 0 0 25.5 0 -15 8 2006 1 11 1129 1705 216 US 722 2323 LAS 1515 42 37 39.5 0.05 0 0 25.5 0 54 9 2006 1 11 1335 1459 144 US 236 977 PHL 678 42 37 39.5 0.05 0 0 25.5 0 65 10 0 1 11 1155 1323 148 US 331 1614 PHL 678 42 37 39.5 0.05 0 0 25.5 0 4 11 2006 0 11 600 714 134 US 357 1841 PHL 678 42 37 39.5 0.05 0 0 25.5 0 -13 12 2006 1 0 1005 1138 153 US 591 1886 PHL 678 42 37 39.5 0.05 0 0 25.5 0 26 13 2006 1 11 0 2115 140 US 762 2779 PHL 678 42 37 39.5 0.05 0 0 25.5 0 81 14 2006 1 11 1710 0 139 US 1603 2894 PHL 678 42 37 39.5 0.05 0 0 25.5 0 89 15 2006 1 11 805 931 0 US 1659 2899 PHL 678 42 37 39.5 0.05 0 0 25.5 0 29 16 2006 1 11 830 1248 198 1 2112 PHX 1440 42 37 39.5 0.05 0 0 25.5 0 30 17 2006 1 11 949 1414 205 US 0 2144 PHX 1440 42 37 39.5 0.05 0 0 25.5 0 14 18 2006 1 11 1312 1732 200 US 5 0 PHX 1440 42 37 39.5 0.05 0 0 25.5 0 81 19 2006 1 11 1545 2007 202 US 7 2336 1440 42 37 39.5 0.05 0 0 25.5 0 48 20 2006 1 11 1912 2331 199 US 9 2254 PHX 0 42 37 39.5 0.05 0 0 25.5 0 -5 21 2006 1 11 800 834 94 US 105 1693 PIT 412 0 37 39.5 0.05 0 0 25.5 0 7 22 2006 1 11 2040 2116 96 US 733 1146 PIT 412 42 0 39.5 0.05 0 0 25.5 0 7 23 2006 1 11 1340 1414 94 US 1404 930 PIT 412 42 37 1.0E-323 0.05 0 0 25.5 0 58 24 2006 1 11 1555 1628 93 US 1605 2960 PIT 412 42 37 39.5 1.0E-323 0 0 25.5 0 64 25 2006 1 12 1550 1656 126 US 7 1579 CLT 599 57 38 47.5 0.0 0 0 17.5 0 -9 26 2006 1 12 920 1028 128 US 216 839 CLT 599 57 38 47.5 0.0 0 0 17.5 0 -14 27 2006 1 12 2140 2249 129 US 382 1713 CLT 599 57 38 47.5 0.0 0 0 1.0E-323 0 -30 28 2006 1 12 755 903 128 US 689 1532 CLT 599 57 38 47.5 0.0 0 0 17.5 0 9 29 2006 1 12 1425 1532 127 US 975 774 CLT 599 57 38 47.5 0.0 0 0 17.5 0 0
DinukaH2O commented 1 year ago

JIRA Issue Migration Info

Jira Issue: PUBDEV-935 Assignee: Spencer Aiello Reporter: Michal Malohlava State: Open Fix Version: N/A Attachments: Available (Count: 1) Development PRs: N/A

Attachments From Jira

Attachment Name: dataset_with_nas.csv Attached By: Michal Malohlava File Link:https://h2o-3-jira-github-migration.s3.amazonaws.com/PUBDEV-935/dataset_with_nas.csv