Open exalate-issue-sync[bot] opened 1 year ago
Matt Dowle commented: I retried and the same problem occurred at 35% in to reading the file. I have 16GB RAM of which about 5GB was being used at the time of failing according to htop. Seems like I had plenty of spare RAM. I could see 8 cores busy according to htop.
Spencer Aiello commented: heyo matt, is it possible for you to scp test.csv to one of the machines?
if you don't have credentials yet, grab some from Kevin :)
or can just scp it to my home dir on 181 e.g.
Tom Kraljevic commented: Matt, this is part user-error, part not getting a good message back. Java won't use all the mem on your system. It will use as much as you tell it to when you start up h2o.
In this case:
H2O cluster total memory: 3.46 GB
Spencer Aiello commented: I think we should have this one piggy-back onto PUBDEV-905
Brandon Hill commented: Agreed. The short term solution should at least reject the parse job. The long term solution should be PUBDEV-905. The difference being that PUBDEV-905 can try reading and compressing, and only give up when death is certain. Memory performance of the parse will be improving, so this will be feasible in time.
JIRA Issue Migration Info
Jira Issue: PUBDEV-1065 Assignee: Brandon Hill Reporter: Matt Dowle State: Open Fix Version: N/A Attachments: N/A Development PRs: N/A
Using latest h2o-dev as of today.
mdowle@MattsMac:~$ R
R version 3.2.0 (2015-04-16) -- "Full of Ingredients" Copyright (C) 2015 The R Foundation for Statistical Computing Platform: x86_64-pc-linux-gnu (64-bit)
R is free software and comes with ABSOLUTELY NO WARRANTY. You are welcome to redistribute it under certain conditions. Type 'license()' or 'licence()' for distribution details.
Natural language support but running in an English locale
R is a collaborative project with many contributors. Type 'contributors()' for more information and 'citation()' on how to cite R or R packages in publications.
Type 'demo()' for some demos, 'help()' for on-line help, or 'help.start()' for an HTML browser interface to help. Type 'q()' to quit R.
Your next step is to start H2O and get a connection object (named 'localH2O', for example):
For H2O package documentation, ask for help:
After starting H2O, you can use the Web UI at http://localhost:54321 For more information visit http://docs.0xdata.com
H2O is not running yet, starting it now...
Note: In case of errors look at the following log files: /tmp/Rtmpig7DSz/h2o_mdowle_started_from_r.out /tmp/Rtmpig7DSz/h2o_mdowle_started_from_r.err
Picked up JAVA_TOOL_OPTIONS: -javaagent:/usr/share/java/jayatanaag.jar java version "1.7.0_80" Java(TM) SE Runtime Environment (build 1.7.0_80-b15) Java HotSpot(TM) 64-Bit Server VM (build 24.80-b11, mixed mode)
Successfully connected to http://127.0.0.1:54321/
R is connected to H2O cluster: H2O cluster uptime: 966 milliseconds H2O cluster version: 0.3.0.99999 H2O cluster name: H2O_started_from_R H2O cluster total nodes: 1 H2O cluster total memory: 3.46 GB H2O cluster total cores: 8 H2O cluster allowed cores: 8 H2O cluster healthy: TRUE