WormBase / pseudoace

Modelling the WormBase ACeDB database in datomic.
4 stars 3 forks source link

OutOfMemoryError when attempting sort edn logs #19

Closed mgrbyte closed 8 years ago

mgrbyte commented 8 years ago

The import process I was running on the AWS dev machine died with OutOfMemoryError.

Cause

Interestingly, this only happens on the AWS machine using the OpenJDK. The same code running on an EBI server with Oracle Java does not error.

Tracback

Caused by: java.lang.OutOfMemoryError: Requested array size exceeds VM limit

Suggested resolution

I modified the code to use the builtin clojure.core/line-seq function, which is more efficient in that it incrementally reads lines from a stream (as opposed to with slurp + clojure.string/split-lines) but it still errors out, this time with: java.lang.OutOfMemoryError: Java heap space

I suggest we make the change suggested above, and use the Oracle JVM.

mgrbyte commented 8 years ago

Fixed by #21