UW-COSMOS / Cosmos

Knowledge base construction from raw scientific documents
37 stars 16 forks source link

Elasticsearch virtual memory error #167

Open RichardScottOZ opened 2 years ago

RichardScottOZ commented 2 years ago
es01                | {"type": "server", "timestamp": "2022-08-12T08:17:23,741Z", "level": "INFO", "component": "o.e.t.TransportService", "cluster.name": "es-docker-cluster", "node.name": "es01", "message": "publish_address {10.0.1.9:9300}, bound_addresses {0.0.0.0:9300}" }
es01                | {"type": "server", "timestamp": "2022-08-12T08:17:23,892Z", "level": "INFO", "component": "o.e.b.BootstrapChecks", "cluster.name": "es-docker-cluster", "node.name": "es01", "message": "bound or publishing to a non-loopback address, enforcing bootstrap checks" }
es01                | ERROR: [1] bootstrap checks failed
es01                | [1]: max virtual memory areas vm.max_map_count [65530] is too low, increase to at least [262144]
es01                | ERROR: Elasticsearch did not exit normally - check the logs at /usr/share/elasticsearch/logs/es-docker-cluster.log
es01                | {"type": "server", "timestamp": "2022-08-12T08:17:23,899Z", "level": "INFO", "component": "o.e.n.Node", "cluster.name": "es-docker-cluster", "node.name": "es01", "message": "stopping ..." }
es01                | {"type": "server", "timestamp": "2022-08-12T08:17:23,910Z", "level": "INFO", "component": "o.e.n.Node", "cluster.name": "es-docker-cluster", "node.name": "es01", "message": "stopped" }
es01                | {"type": "server", "timestamp": "2022-08-12T08:17:23,910Z", "level": "INFO", "component": "o.e.n.Node", "cluster.name": "es-docker-cluster", "node.name": "es01", "message": "closing ..." }
es01                | {"type": "server", "timestamp": "2022-08-12T08:17:23,921Z", "level": "INFO", "component": "o.e.n.Node", "cluster.name": "es-docker-cluster", "node.name": "es01", "message": "closed" }
es01                | {"type": "server", "timestamp": "2022-08-12T08:17:23,922Z", "level": "INFO", "component": "o.e.x.m.p.NativeController", "cluster.name": "es-docker-cluster", "node.name": "es01", "message": "Native controller process has stopped - no new native processes can be started" }

Is that suggesting server should have 256GB minimum to run this?

RichardScottOZ commented 2 years ago

Which would seem like something to run on a non-GPU machine - e.g. 'the already have server' section?

RichardScottOZ commented 2 years ago

Leading back to the quick advice on setup section.

iross commented 2 years ago

Is that suggesting server should have 256GB minimum to run this?

Nope, it can run on much much less -- if I remember correctly, these are just counts of memory maps available to the Elasticsearch JVM, as opposed to MB. It's a system setting, and I'll have to look at the ES documentation to see if there's a way for the ES instance to accept the lower system setting (accepting any potential performance or stability hits).

RichardScottOZ commented 2 years ago

Thanks - I was just running it on a 32GB RAM gpu as a cheap test - ironically gpu bit didn't work. :)

Instant search engine capability is certainly appealing.