Docker demo: `Failed loading pipeline component 'Reader': Unable to load weights from pytorch checkpoint file` causes `Connection error`

sriprad commented 3 years ago

Describe the bug I am trying to replicate the docker (demo) with the webapp (streamlit) version of the code. When i run docker compose up i am getting the below error

Error message Error : raise ConnectionError(e, request=request) ui_1 | requests.exceptions.ConnectionError: HTTPConnectionPool(host='haystack-api', port=8000): Max retries exceeded with url: /query (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f5cce1a2350>: Failed to establish a new connection: [Errno 111] Connection refused'))

System:

OS: Microsoft windows 10 home
GPU/CPU: CPU
Haystack version (commit or version number):
DocumentStore: Elastic document store
Reader:
Retriever:

tholor commented 3 years ago

As mentioned in #1350, please share:

the full logs from docker-compose ( in a code block - not as a screenshot)
The RAM of your machine
The free disk space you have

Otherwise it will be impossible to debug / replicate.

JH95-ai commented 3 years ago

I have the same problems.Following is my logs. F:\github_download\semantic_search\haystack>docker-compose pull [+] Running 13/22

ui Skipped 0.0s
elasticsearch Pulled 21.7s
haystack-api Pulled 57.3s
- 9a0b0ce99936 Already exists 0.0s
- db3b6004c61a Already exists 0.0s
- f8f075920295 Already exists 0.0s
- 6ef14aff1139 Already exists 0.0s
- 0bbd8b48260f Already exists 0.0s
- 9f9509d27b35 Already exists 0.0s
- f300a57cfca1 Already exists 0.0s
- 21507b32fc6d Already exists 0.0s
- 861289d3f5cf Already exists 0.0s
- 46f99ea1a154 Pull complete 2.9s
- afdd58a74ce9 Downloading [============================> ] 21.01MB/... 32.9s
- 60096f00949b Downloading [========================> ] 22.71MB/... 32.9s
- a0afa3f6983b Download complete 32.9s
- f5778334c632 Download complete 32.9s
- 072d91ce915c Downloading 32.9s
- c058438aeb43 Waiting 32.9s
- 73ad96275b93 Waiting 32.9s
- 282d2c666966 Waiting 32.9s
- de9aa4545673 Waiting 32.9s

F:\github_download\semantic_search\haystack>docker-compose up [+] Running 3/3

Container haystack_elasticsearch_1 Started 2.1s
Container haystack_ui_1 Started 2.1s
Container haystack_haystack-api_1 Started 4.4s Attaching to elasticsearch_1, haystack-api_1, ui_1 ui_1 | 2021-08-23 06:33:29.598 Traceback (most recent call last): ui_1 | File "/usr/local/lib/python3.7/site-packages/streamlit/caching.py", line 515, in get_or_create_cached_value ui_1 | hash_funcs=hash_funcs, ui_1 | File "/usr/local/lib/python3.7/site-packages/streamlit/caching.py", line 308, in _read_from_cache ui_1 | raise e ui_1 | File "/usr/local/lib/python3.7/site-packages/streamlit/caching.py", line 294, in _read_from_cache ui_1 | mem_cache, key, allow_output_mutation, func_or_code, hash_funcs ui_1 | File "/usr/local/lib/python3.7/site-packages/streamlit/caching.py", line 212, in _read_from_mem_cache ui_1 | raise CacheKeyNotFoundError("Key not found in mem cache") ui_1 | streamlit.caching.CacheKeyNotFoundError: Key not found in mem cache ui_1 | ui_1 | During handling of the above exception, another exception occurred: ui_1 | ui_1 | Traceback (most recent call last): ui_1 | File "/usr/local/lib/python3.7/site-packages/urllib3/connection.py", line 170, in _new_conn ui_1 | (self._dns_host, self.port), self.timeout, extra_kw ui_1 | File "/usr/local/lib/python3.7/site-packages/urllib3/util/connection.py", line 96, in create_connection ui_1 | raise err ui_1 | File "/usr/local/lib/python3.7/site-packages/urllib3/util/connection.py", line 86, in create_connection ui_1 | sock.connect(sa) ui_1 | ConnectionRefusedError: [Errno 111] Connection refused ui_1 | ui_1 | During handling of the above exception, another exception occurred: ui_1 | ui_1 | Traceback (most recent call last): ui_1 | File "/usr/local/lib/python3.7/site-packages/urllib3/connectionpool.py", line 706, in urlopen ui_1 | chunked=chunked, ui_1 | File "/usr/local/lib/python3.7/site-packages/urllib3/connectionpool.py", line 394, in _make_request ui_1 | conn.request(method, url, httplib_request_kw) ui_1 | File "/usr/local/lib/python3.7/site-packages/urllib3/connection.py", line 234, in request ui_1 | super(HTTPConnection, self).request(method, url, body=body, headers=headers) ui_1 | File "/usr/local/lib/python3.7/http/client.py", line 1244, in request ui_1 | self._send_request(method, url, body, headers, encode_chunked) ui_1 | File "/usr/local/lib/python3.7/http/client.py", line 1290, in _send_request ui_1 | self.endheaders(body, encode_chunked=encode_chunked) ui_1 | File "/usr/local/lib/python3.7/http/client.py", line 1239, in endheaders ui_1 | self._send_output(message_body, encode_chunked=encode_chunked) ui_1 | File "/usr/local/lib/python3.7/http/client.py", line 1026, in _send_output ui_1 | self.send(msg) ui_1 | File "/usr/local/lib/python3.7/http/client.py", line 966, in send ui_1 | self.connect() ui_1 | File "/usr/local/lib/python3.7/site-packages/urllib3/connection.py", line 200, in connect ui_1 | conn = self._new_conn() ui_1 | File "/usr/local/lib/python3.7/site-packages/urllib3/connection.py", line 182, in _new_conn ui_1 | self, "Failed to establish a new connection: %s" % e ui_1 | urllib3.exceptions.NewConnectionError: <urllib3.connection.HTTPConnection object at 0x7f450d100410>: Failed to establish a new connection: [Errno 111] Connection refused ui_1 | ui_1 | During handling of the above exception, another exception occurred: ui_1 | ui_1 | Traceback (most recent call last): ui_1 | File "/usr/local/lib/python3.7/site-packages/requests/adapters.py", line 449, in send ui_1 | timeout=timeout ui_1 | File "/usr/local/lib/python3.7/site-packages/urllib3/connectionpool.py", line 756, in urlopen ui_1 | method, url, error=e, _pool=self, _stacktrace=sys.exc_info()[2] ui_1 | File "/usr/local/lib/python3.7/site-packages/urllib3/util/retry.py", line 574, in increment ui_1 | raise MaxRetryError(_pool, url, error or ResponseError(cause)) ui_1 | urllib3.exceptions.MaxRetryError: HTTPConnectionPool(host='haystack-api', port=8000): Max retries exceeded with url: /query (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f450d100410>: Failed to establish a new connection: [Errno 111] Connection refused')) ui_1 | ui_1 | During handling of the above exception, another exception occurred: ui_1 | ui_1 | Traceback (most recent call last): ui_1 | File "/usr/local/lib/python3.7/site-packages/streamlit/script_runner.py", line 350, in _run_script ui_1 | exec(code, module.dict) ui_1 | File "/home/user/webapp.py", line 127, in ui_1 | results, raw_json = retrieve_doc(question, top_k_reader=top_k_reader, top_k_retriever=top_k_retriever) ui_1 | File "/usr/local/lib/python3.7/site-packages/streamlit/caching.py", line 545, in wrapped_func ui_1 | return get_or_create_cached_value() ui_1 | File "/usr/local/lib/python3.7/site-packages/streamlit/caching.py", line 527, in get_or_create_cached_value ui_1 | return_value = func(*args, kwargs) ui_1 | File "/home/user/utils.py", line 17, in retrieve_doc ui_1 | response_raw = requests.post(url, json=req).json() ui_1 | File "/usr/local/lib/python3.7/site-packages/requests/api.py", line 117, in post ui_1 | return request('post', url, data=data, json=json, kwargs) ui_1 | File "/usr/local/lib/python3.7/site-packages/requests/api.py", line 61, in request ui_1 | return session.request(method=method, url=url, kwargs) ui_1 | File "/usr/local/lib/python3.7/site-packages/requests/sessions.py", line 542, in request ui_1 | resp = self.send(prep, send_kwargs) ui_1 | File "/usr/local/lib/python3.7/site-packages/requests/sessions.py", line 655, in send ui_1 | r = adapter.send(request, *kwargs) ui_1 | File "/usr/local/lib/python3.7/site-packages/requests/adapters.py", line 516, in send ui_1 | raise ConnectionError(e, request=request) ui_1 | requests.exceptions.ConnectionError: HTTPConnectionPool(host='haystack-api', port=8000): Max retries exceeded with url: /query (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x7f450d100410>: Failed to establish a new connection: [Errno 111] Connection refused')) ui_1 | elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:30,253Z", "level": "INFO", "component": "o.e.n.Node", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "version[7.9.2], pid[7], build[default/docker/d34da0ea4a966c4e49417f2da2f244e3e97b4e6e/2020-09-23T00:45:33.626720Z], OS[Linux/5.10.16.3-microsoft-standard-WSL2/amd64], JVM[AdoptOpenJDK/OpenJDK 64-Bit Server VM/15/15+36]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:30,256Z", "level": "INFO", "component": "o.e.n.Node", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "JVM home [/usr/share/elasticsearch/jdk]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:30,257Z", "level": "INFO", "component": "o.e.n.Node", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "JVM arguments [-Xshare:auto, -Des.networkaddress.cache.ttl=60, -Des.networkaddress.cache.negative.ttl=10, -XX:+AlwaysPreTouch, -Xss1m, -Djava.awt.headless=true, -Dfile.encoding=UTF-8, -Djna.nosys=true, -XX:-OmitStackTraceInFastThrow, -XX:+ShowCodeDetailsInExceptionMessages, -Dio.netty.noUnsafe=true, -Dio.netty.noKeySetOptimization=true, -Dio.netty.recycler.maxCapacityPerThread=0, -Dio.netty.allocator.numDirectArenas=0, -Dlog4j.shutdownHookEnabled=false, -Dlog4j2.disable.jmx=true, -Djava.locale.providers=SPI,COMPAT, -Xms1g, -Xmx1g, -XX:+UseG1GC, -XX:G1ReservePercent=25, -XX:InitiatingHeapOccupancyPercent=30, -Djava.io.tmpdir=/tmp/elasticsearch-6519235396742160566, -XX:+HeapDumpOnOutOfMemoryError, -XX:HeapDumpPath=data, -XX:ErrorFile=logs/hs_err_pid%p.log, -Xlog:gc,gc+age=trace,safepoint:file=logs/gc.log:utctime,pid,tags:filecount=32,filesize=64m, -Des.cgroups.hierarchy.override=/, -XX:MaxDirectMemorySize=536870912, -Des.path.home=/usr/share/elasticsearch, -Des.path.conf=/usr/share/elasticsearch/config, -Des.distribution.flavor=default, -Des.distribution.type=docker, -Des.bundled_jdk=true]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,285Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [aggs-matrix-stats]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,285Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [analysis-common]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,286Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [constant-keyword]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,286Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [flattened]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,286Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [frozen-indices]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,286Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [ingest-common]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,287Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [ingest-geoip]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,287Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [ingest-user-agent]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,287Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [kibana]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,288Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [lang-expression]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,288Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [lang-mustache]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,288Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [lang-painless]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,289Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [mapper-extras]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,289Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [parent-join]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,289Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [percolator]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,289Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [rank-eval]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,290Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [reindex]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,290Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [repository-url]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,290Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [search-business-rules]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,290Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [searchable-snapshots]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,291Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [spatial]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,291Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [tasks]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,291Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [transform]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,292Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [transport-netty4]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,292Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [vectors]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,292Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [wildcard]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,293Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-analytics]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,293Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-async]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,293Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-async-search]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,293Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-autoscaling]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,294Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-ccr]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,294Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-core]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,294Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-data-streams]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,295Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-deprecation]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,295Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-enrich]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,296Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-eql]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,296Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-graph]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,297Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-identity-provider]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,297Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-ilm]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,297Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-logstash]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,298Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-ml]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,298Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-monitoring]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,298Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-ql]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,299Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-rollup]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,299Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-security]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,299Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-sql]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,300Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-stack]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,300Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-voting-only-node]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,300Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "loaded module [x-pack-watcher]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,301Z", "level": "INFO", "component": "o.e.p.PluginsService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "no plugins loaded" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,343Z", "level": "INFO", "component": "o.e.e.NodeEnvironment", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "using [1] data paths, mounts [[/ (overlay)]], net usable_space [210.2gb], net total_space [250.9gb], types [overlay]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,343Z", "level": "INFO", "component": "o.e.e.NodeEnvironment", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "heap size [1gb], compressed ordinary object pointers [true]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:32,431Z", "level": "INFO", "component": "o.e.n.Node", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "node name [5cd1972a99f4], node ID [8P0WU3k2TiWLcZatUIjjBg], cluster name [docker-cluster]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:35,537Z", "level": "INFO", "component": "o.e.x.m.p.l.CppLogMessageHandler", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "[controller/224] [Main.cc@114] controller (64 bit): Version 7.9.2 (Build 6a60f0cf2dd5a5) Copyright (c) 2020 Elasticsearch BV" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:36,024Z", "level": "INFO", "component": "o.e.x.s.a.s.FileRolesStore", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "parsed [0] roles from file [/usr/share/elasticsearch/config/roles.yml]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:36,947Z", "level": "INFO", "component": "o.e.t.NettyAllocator", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "creating NettyAllocator with the following configs: [name=unpooled, factors={es.unsafe.use_unpooled_allocator=false, g1gc_enabled=true, g1gc_region_size=1mb, heap_size=1gb}]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:37,026Z", "level": "INFO", "component": "o.e.d.DiscoveryModule", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "using discovery type [single-node] and seed hosts providers [settings]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:37,460Z", "level": "WARN", "component": "o.e.g.DanglingIndicesState", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "gateway.auto_import_dangling_indices is disabled, dangling indices will not be automatically detected or imported and must be managed manually" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:37,861Z", "level": "INFO", "component": "o.e.n.Node", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "initialized" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:37,862Z", "level": "INFO", "component": "o.e.n.Node", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "starting ..." } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:37,990Z", "level": "INFO", "component": "o.e.t.TransportService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "publish_address {172.20.0.2:9300}, bound_addresses {0.0.0.0:9300}" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:38,289Z", "level": "WARN", "component": "o.e.b.BootstrapChecks", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "max virtual memory areas vm.max_map_count [65530] is too low, increase to at least [262144]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:38,290Z", "level": "INFO", "component": "o.e.c.c.Coordinator", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "cluster UUID [jO4tCBwtRbylGajeEvjrjw]" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:38,454Z", "level": "INFO", "component": "o.e.c.s.MasterService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "elected-as-master ([1] nodes joined)[{5cd1972a99f4}{8P0WU3k2TiWLcZatUIjjBg}{RUPvLsn3SQyRamAiA96Uiw}{172.20.0.2}{172.20.0.2:9300}{dilmrt}{ml.machine_memory=13363564544, xpack.installed=true, transform.node=true, ml.max_open_jobs=20} elect leader, _BECOME_MASTERTASK, _FINISHELECTION], term: 4, version: 51, delta: master node changed {previous [], current [{5cd1972a99f4}{8P0WU3k2TiWLcZatUIjjBg}{RUPvLsn3SQyRamAiA96Uiw}{172.20.0.2}{172.20.0.2:9300}{dilmrt}{ml.machine_memory=13363564544, xpack.installed=true, transform.node=true, ml.max_open_jobs=20}]}" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:38,552Z", "level": "INFO", "component": "o.e.c.s.ClusterApplierService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "master node changed {previous [], current [{5cd1972a99f4}{8P0WU3k2TiWLcZatUIjjBg}{RUPvLsn3SQyRamAiA96Uiw}{172.20.0.2}{172.20.0.2:9300}{dilmrt}{ml.machine_memory=13363564544, xpack.installed=true, transform.node=true, ml.max_open_jobs=20}]}, term: 4, version: 51, reason: Publication{term=4, version=51}" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:38,586Z", "level": "INFO", "component": "o.e.h.AbstractHttpServerTransport", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "publish_address {172.20.0.2:9200}, bound_addresses {0.0.0.0:9200}", "cluster.uuid": "jO4tCBwtRbylGajeEvjrjw", "node.id": "8P0WU3k2TiWLcZatUIjjBg" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:38,587Z", "level": "INFO", "component": "o.e.n.Node", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "started", "cluster.uuid": "jO4tCBwtRbylGajeEvjrjw", "node.id": "8P0WU3k2TiWLcZatUIjjBg" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:38,804Z", "level": "INFO", "component": "o.e.l.LicenseService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "license [26e9fcf5-56d7-4d53-b5ce-8211a0772c8b] mode [basic] - valid", "cluster.uuid": "jO4tCBwtRbylGajeEvjrjw", "node.id": "8P0WU3k2TiWLcZatUIjjBg" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:38,807Z", "level": "INFO", "component": "o.e.x.s.s.SecurityStatusChangeListener", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "Active license is now [BASIC]; Security is disabled", "cluster.uuid": "jO4tCBwtRbylGajeEvjrjw", "node.id": "8P0WU3k2TiWLcZatUIjjBg" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:38,818Z", "level": "INFO", "component": "o.e.g.GatewayService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "recovered [2] indices into cluster_state", "cluster.uuid": "jO4tCBwtRbylGajeEvjrjw", "node.id": "8P0WU3k2TiWLcZatUIjjBg" } elasticsearch_1 | {"type": "server", "timestamp": "2021-08-23T06:33:39,337Z", "level": "INFO", "component": "o.e.c.r.a.AllocationService", "cluster.name": "docker-cluster", "node.name": "5cd1972a99f4", "message": "Cluster health status changed from [RED] to [YELLOW] (reason: [shards started [[document][0], [label][0]]]).", "cluster.uuid": "jO4tCBwtRbylGajeEvjrjw", "node.id": "8P0WU3k2TiWLcZatUIjjBg" } haystack-api_1 | [2021-08-23 06:33:44 +0000] [1] [INFO] Starting gunicorn 20.1.0 haystack-api_1 | [2021-08-23 06:33:44 +0000] [1] [INFO] Listening at: http://0.0.0.0:8000 (1) haystack-api_1 | [2021-08-23 06:33:44 +0000] [1] [INFO] Using worker: uvicorn.workers.UvicornWorker haystack-api_1 | [2021-08-23 06:33:44 +0000] [10] [INFO] Booting worker with pid: 10 haystack-api_1 | /usr/local/lib/python3.7/site-packages/ray/autoscaler/_private/cli_logger.py:61: FutureWarning: Not all Ray CLI dependencies were found. In Ray 1.4+, the Ray CLI, autoscaler, and dashboard will only be usable via pip install 'ray[default]'. Please update your install command. haystack-api_1 | "update your install command.", FutureWarning) haystack-api_1 | pdftotext version 4.03 [www.xpdfreader.com] haystack-api_1 | Copyright 1996-2021 Glyph & Cog, LLC haystack-api_1 | [nltk_data] Downloading package punkt to /root/nltk_data... haystack-api_1 | [nltk_data] Error downloading 'punkt' from haystack-api_1 | [nltk_data] <https://raw.githubusercontent.com/nltk/nltk_data/gh- haystack-api_1 | [nltk_data] pages/packages/tokenizers/punkt.zip>: <urlopen error haystack-api_1 | [nltk_data] [Errno 0] Error> haystack-api_1 | 08/23/2021 06:34:18 - INFO - elasticsearch - HEAD http://elasticsearch:9200/ [status:200 request:0.089s] haystack-api_1 | 08/23/2021 06:34:18 - INFO - elasticsearch - HEAD http://elasticsearch:9200/document [status:200 request:0.009s] haystack-api_1 | 08/23/2021 06:34:18 - INFO - elasticsearch - GET http://elasticsearch:9200/document [status:200 request:0.004s] haystack-api_1 | 08/23/2021 06:34:18 - INFO - elasticsearch - PUT http://elasticsearch:9200/document/_mapping [status:200 request:0.022s] haystack-api_1 | 08/23/2021 06:34:18 - INFO - elasticsearch - HEAD http://elasticsearch:9200/label [status:200 request:0.003s] haystack-api_1 | 08/23/2021 06:34:18 - INFO - elasticsearch - HEAD http://elasticsearch:9200/ [status:200 request:0.004s] haystack-api_1 | 08/23/2021 06:34:18 - INFO - elasticsearch - HEAD http://elasticsearch:9200/document [status:200 request:0.003s] haystack-api_1 | 08/23/2021 06:34:18 - INFO - elasticsearch - GET http://elasticsearch:9200/document [status:200 request:0.002s] haystack-api_1 | 08/23/2021 06:34:18 - INFO - elasticsearch - PUT http://elasticsearch:9200/document/_mapping [status:200 request:0.013s] haystack-api_1 | 08/23/2021 06:34:18 - INFO - elasticsearch - HEAD http://elasticsearch:9200/label [status:200 request:0.004s] haystack-api_1 | 08/23/2021 06:34:18 - INFO - farm.utils - Using device: CPU haystack-api_1 | 08/23/2021 06:34:18 - INFO - farm.utils - Number of GPUs: 0 haystack-api_1 | 08/23/2021 06:34:18 - INFO - farm.utils - Distributed Training: False haystack-api_1 | 08/23/2021 06:34:18 - INFO - farm.utils - Automatic Mixed Precision: None haystack-api_1 | [2021-08-23 06:34:25 +0000] [10] [ERROR] Exception in worker process haystack-api_1 | Traceback (most recent call last): haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/transformers/modeling_utils.py", line 1205, in from_pretrained haystack-api_1 | state_dict = torch.load(resolved_archive_file, map_location="cpu") haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/torch/serialization.py", line 585, in load haystack-api_1 | with _open_zipfile_reader(opened_file) as opened_zipfile: haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/torch/serialization.py", line 242, in init haystack-api_1 | super(_open_zipfile_reader, self).init(torch._C.PyTorchFileReader(name_or_buffer)) haystack-api_1 | RuntimeError: [enforce fail at inline_container.cc:145] . PytorchStreamReader failed reading zip archive: failed finding central directory haystack-api_1 | haystack-api_1 | During handling of the above exception, another exception occurred: haystack-api_1 | haystack-api_1 | Traceback (most recent call last): haystack-api_1 | File "/home/user/haystack/pipeline.py", line 400, in _load_or_get_component haystack-api_1 | instance = BaseComponent.load_from_args(component_type=component_type, component_params) haystack-api_1 | File "/home/user/haystack/schema.py", line 284, in load_from_args haystack-api_1 | instance = subclass(kwargs) haystack-api_1 | File "/home/user/haystack/reader/farm.py", line 113, in init haystack-api_1 | strict=False) haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/farm/infer.py", line 268, in load haystack-api_1 | task_type=task_type) haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/farm/modeling/adaptive_model.py", line 548, in convert_from_transformers haystack-api_1 | processor=processor) haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/farm/conversion/transformers.py", line 91, in convert_from_transformers haystack-api_1 | lm = LanguageModel.load(model_name_or_path, revision=revision) haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/farm/modeling/language_model.py", line 150, in load haystack-api_1 | language_model = cls.subclasses[language_model_class].load(pretrained_model_name_or_path, kwargs) haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/farm/modeling/language_model.py", line 654, in load haystack-api_1 | roberta.model = RobertaModel.from_pretrained(str(pretrained_model_name_or_path), kwargs) haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/transformers/modeling_utils.py", line 1208, in from_pretrained haystack-api_1 | f"Unable to load weights from pytorch checkpoint file for '{pretrained_model_name_or_path}' " haystack-api_1 | OSError: Unable to load weights from pytorch checkpoint file for 'deepset/roberta-base-squad2' at '/root/.cache/huggingface/transformers/eac3273a8097dda671e3bea1db32c616e74f36a306c65b4858171c98d6db83e9.084aa7284f3a51fa1c8f0641aa04c47d366fbd18711f29d0a995693cfdbc9c9e'If you tried to load a PyTorch model from a TF 2.0 checkpoint, please set from_tf=True. haystack-api_1 | haystack-api_1 | During handling of the above exception, another exception occurred: haystack-api_1 | haystack-api_1 | Traceback (most recent call last): haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/gunicorn/arbiter.py", line 589, in spawn_worker haystack-api_1 | worker.init_process() haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/uvicorn/workers.py", line 66, in init_process haystack-api_1 | super(UvicornWorker, self).init_process() haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/gunicorn/workers/base.py", line 134, in init_process haystack-api_1 | self.load_wsgi() haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/gunicorn/workers/base.py", line 146, in load_wsgi haystack-api_1 | self.wsgi = self.app.wsgi() haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/gunicorn/app/base.py", line 67, in wsgi haystack-api_1 | self.callable = self.load() haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/gunicorn/app/wsgiapp.py", line 58, in load haystack-api_1 | return self.load_wsgiapp() haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/gunicorn/app/wsgiapp.py", line 48, in load_wsgiapp haystack-api_1 | return util.import_app(self.app_uri) haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/gunicorn/util.py", line 359, in import_app haystack-api_1 | mod = importlib.import_module(module) haystack-api_1 | File "/usr/local/lib/python3.7/importlib/init.py", line 127, in import_module haystack-api_1 | return _bootstrap._gcd_import(name[level:], package, level) haystack-api_1 | File "", line 1006, in _gcd_import haystack-api_1 | File "", line 983, in _find_and_load haystack-api_1 | File "", line 967, in _find_and_load_unlocked haystack-api_1 | File "", line 677, in _load_unlocked haystack-api_1 | File "", line 728, in exec_module haystack-api_1 | File "", line 219, in _call_with_frames_removed haystack-api_1 | File "/home/user/rest_api/application.py", line 8, in haystack-api_1 | from rest_api.controller.router import router as api_router haystack-api_1 | File "/home/user/rest_api/controller/router.py", line 3, in haystack-api_1 | from rest_api.controller import file_upload, search, feedback haystack-api_1 | File "/home/user/rest_api/controller/search.py", line 47, in haystack-api_1 | PIPELINE = Pipeline.load_from_yaml(Path(PIPELINE_YAML_PATH), pipeline_name=QUERY_PIPELINE_NAME) haystack-api_1 | File "/home/user/haystack/pipeline.py", line 370, in load_from_yaml haystack-api_1 | component = cls._load_or_get_component(name=name, definitions=definitions, components=components) haystack-api_1 | File "/home/user/haystack/pipeline.py", line 403, in _load_or_get_component haystack-api_1 | raise Exception(f"Failed loading pipeline component '{name}': {e}") haystack-api_1 | Exception: Failed loading pipeline component 'Reader': Unable to load weights from pytorch checkpoint file for 'deepset/roberta-base-squad2' at '/root/.cache/huggingface/transformers/eac3273a8097dda671e3bea1db32c616e74f36a306c65b4858171c98d6db83e9.084aa7284f3a51fa1c8f0641aa04c47d366fbd18711f29d0a995693cfdbc9c9e'If you tried to load a PyTorch model from a TF 2.0 checkpoint, please set from_tf=True. haystack-api_1 | [2021-08-23 06:34:25 +0000] [10] [INFO] Worker exiting (pid: 10) haystack-api_1 | [2021-08-23 06:34:26 +0000] [1] [INFO] Shutting down: Master haystack-api_1 | [2021-08-23 06:34:26 +0000] [1] [INFO] Reason: Worker failed to boot. haystack-api_1 exited with code 3 haystack-api_1 | [2021-08-23 06:34:43 +0000] [1] [INFO] Starting gunicorn 20.1.0 haystack-api_1 | [2021-08-23 06:34:43 +0000] [1] [INFO] Listening at: http://0.0.0.0:8000 (1) haystack-api_1 | [2021-08-23 06:34:43 +0000] [1] [INFO] Using worker: uvicorn.workers.UvicornWorker haystack-api_1 | [2021-08-23 06:34:43 +0000] [10] [INFO] Booting worker with pid: 10 haystack-api_1 | /usr/local/lib/python3.7/site-packages/ray/autoscaler/_private/cli_logger.py:61: FutureWarning: Not all Ray CLI dependencies were found. In Ray 1.4+, the Ray CLI, autoscaler, and dashboard will only be usable via pip install 'ray[default]'. Please update your install command. haystack-api_1 | "update your install command.", FutureWarning) haystack-api_1 | pdftotext version 4.03 [www.xpdfreader.com] haystack-api_1 | Copyright 1996-2021 Glyph & Cog, LLC

JH95-ai commented 3 years ago

docker stats log is following: CONTAINER ID NAME CPU % MEM USAGE / LIMIT MEM % NET I/O BLOCK I/O PIDS 61f306700043 haystack_haystack-api_1 0.05% 282MiB / 12.45GiB 2.21% 2.59MB / 95.3kB 0B / 0B 2 5cd1972a99f4 haystack_elasticsearch_1 0.26% 1.298GiB / 12.45GiB 10.43% 5.84kB / 3.71kB 0B / 0B 61 d7934023abd3 haystack_ui_1 2.57% 101.8MiB / 12.45GiB 0.80% 142kB / 203kB 0B / 0B 7

brandenchan commented 3 years ago

Hi @NeekHua , there are two things that catch my eye in your logs

streamlit.caching.CacheKeyNotFoundError: Key not found in mem cache

Exception: Failed loading pipeline component 'Reader': Unable to load weights from pytorch checkpoint file for 'deepset/roberta-base-squad2' at '/root/.cache/huggingface/transformers/eac3273a8097dda671e3bea1db32c616e74f36a306c65b4858171c98d6db83e9.084aa7284f3a51fa1c8f0641aa04c47d366fbd18711f29d0a995693cfdbc9c9e'If you tried to load a PyTorch model from a TF 2.0 checkpoint, please set from_tf=True.

Is it possible that the problem lies in your cache? Perhaps something around permissions?

One way to test this would be to run a script that loads a huggingface model from cache. In Haystack, you could try running a script like this twice

reader = FARMReader(model_name_or_path="deepset/roberta-base-squad2")

yingzwang commented 3 years ago

I've encountered the same issue. Have you figured out a solution? @NeekHua

XingYiBao commented 3 years ago

@julian-risch i encountered the same /similar issue when running docker-compose up. is it possible you seperate the components into several docker image so that we can run them one by one? Thanks,

julian-risch commented 3 years ago

I am not sure whether I correctly understand your idea. In https://github.com/deepset-ai/haystack/blob/master/docker-compose.yml we already have separate components with three images: "deepset/haystack-cpu:latest", "deepset/elasticsearch-game-of-thrones", and "deepset/haystack-streamlit-ui:latest"

XingYiBao commented 3 years ago

maybe you can ignore my comments. @julian-risch however, I did encounter the following error message when run with sudo docker-compose up, please kindly advise how to handle it. Thanks a lot!

haystack-api_1 | During handling of the above exception, another exception occurred: haystack-api_1 | haystack-api_1 | Traceback (most recent call last): haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/gunicorn/arbiter.py", line 589, in spawn_worker haystack-api_1 | worker.init_process() haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/uvicorn/workers.py", line 66, in init_process haystack-api_1 | super(UvicornWorker, self).init_process() haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/gunicorn/workers/base.py", line 134, in init_process haystack-api_1 | self.load_wsgi() haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/gunicorn/workers/base.py", line 146, in load_wsgi haystack-api_1 | self.wsgi = self.app.wsgi() haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/gunicorn/app/base.py", line 67, in wsgi haystack-api_1 | self.callable = self.load() haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/gunicorn/app/wsgiapp.py", line 58, in load haystack-api_1 | return self.load_wsgiapp() haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/gunicorn/app/wsgiapp.py", line 48, in load_wsgiapp haystack-api_1 | return util.import_app(self.app_uri) haystack-api_1 | File "/usr/local/lib/python3.7/site-packages/gunicorn/util.py", line 359, in import_app haystack-api_1 | mod = importlib.import_module(module) haystack-api_1 | File "/usr/local/lib/python3.7/importlib/init.py", line 127, in import_module haystack-api_1 | return _bootstrap._gcd_import(name[level:], package, level) haystack-api_1 | File "", line 1006, in _gcd_import haystack-api_1 | File "", line 983, in _find_and_load haystack-api_1 | File "", line 967, in _find_and_load_unlocked haystack-api_1 | File "", line 677, in _load_unlocked haystack-api_1 | File "", line 728, in exec_module haystack-api_1 | File "", line 219, in _call_with_frames_removed haystack-api_1 | File "/home/user/rest_api/application.py", line 8, in haystack-api_1 | from rest_api.controller.router import router as api_router haystack-api_1 | File "/home/user/rest_api/controller/router.py", line 3, in haystack-api_1 | from rest_api.controller import file_upload, search, feedback haystack-api_1 | File "/home/user/rest_api/controller/search.py", line 44, in haystack-api_1 | PIPELINE = Pipeline.load_from_yaml(Path(PIPELINE_YAML_PATH), pipeline_name=QUERY_PIPELINE_NAME) haystack-api_1 | File "/home/user/haystack/pipeline.py", line 430, in load_from_yaml haystack-api_1 | component = cls._load_or_get_component(name=name, definitions=definitions, components=components) haystack-api_1 | File "/home/user/haystack/pipeline.py", line 463, in _load_or_get_component haystack-api_1 | raise Exception(f"Failed loading pipeline component '{name}': {e}") haystack-api_1 | Exception: Failed loading pipeline component 'Reader': Unable to load weights from pytorch checkpoint file for 'deepset/roberta-base-squad2' at '/root/.cache/huggingface/transformers/eac3273a8097dda671e3bea1db32c616e74f36a306c65b4858171c98d6db83e9.084aa7284f3a51fa1c8f0641aa04c47d366fbd18711f29d0a995693cfdbc9c9e'If you tried to load a PyTorch model from a TF 2.0 checkpoint, please set from_tf=True. haystack-api_1 | [2021-09-21 15:07:36 +0000] [13] [INFO] Worker exiting (pid: 13) haystack-api_1 | [2021-09-21 15:07:39 +0000] [1] [INFO] Shutting down: Master haystack-api_1 | [2021-09-21 15:07:39 +0000] [1] [INFO] Reason: Worker failed to boot. haystack_haystack-api_1 exited with code 3

ZanSara commented 3 years ago

Hi @XingYiBao , I can't replicate your issue with the information you provided. What I can suggest is to run docker without sudo, and if it still doesn't work, to give us some more information about your setup, like:

Which OS are you using?
Which CPU do you have?
Available memory / disk space?
Docker version?
Do you have a stable Internet connection while you execute docker-compose up?
Did you modify the docker-compose.yml file in any way?
Did you try removing the images and re-building from scratch?

and any other information that might help.

Note: the root of the issue (Unable to load weights from pytorch checkpoint file for 'deepset/roberta-base-squad2' at '/root/.cache/huggingface/transformers) seems to come from transformers (https://github.com/huggingface/transformers), and it's an odd one: https://github.com/huggingface/transformers/blob/51ee20fc26381ca8aba4d4da9b410379302ee1d1/src/transformers/modeling_utils.py#L1361. If somebody can provide a way to reproduce this issue reliably I will definitely investigate.

XingYiBao commented 3 years ago

@ZanSara seems it is the default docker-compose.yml issue. Made some change and now it works. anyway, thanks a lot!

ZanSara commented 3 years ago

Glad to hear! If you could share the changes you've made to the docker-compose.yml we can fix the default one, so that it works for everybody :slightly_smiling_face:

XingYiBao commented 3 years ago

@ZanSara no problem. Below are some changes we made to the docker-compose.yml file. To use the network_mode:bridge, otherwise the default ip created by docker MAY conflict with your internal network ip addresses. The sample file after changing is as below:

version: "3"
services:
  haystack-api:
    build:
      context: .
      dockerfile: Dockerfile
    image: "/deepset/haystack-cpu:latest"
    # Mount custom Pipeline YAML and custom Components.
    # volumes:
    #   - ./rest_api/pipeline:/home/user/rest_api/pipeline
    ports:
      - 8000:8000
    environment:
      # See rest_api/pipelines.yaml for configurations of Search & Indexing Pipeline.
      #- ELASTICSEARCHDOCUMENTSTORE_PARAMS_HOST=elasticsearch
      #- DOCUMENTSTORE_PARAMS_HOST=elasticsearch
      - DOCUMENTSTORE_PARAMS_HOST={ip address of elasticsearch host}
    restart: always
    #depends_on:
    #  - elasticsearch
    command: "/bin/bash -c 'sleep 15 && gunicorn rest_api.application:app -b 0.0.0.0 -k uvicorn.workers.UvicornWorker --workers 1 --timeout 180'"
    network_mode: "bridge"
  elasticsearch:
    # This will start an empty elasticsearch instance (so you have to add your documents yourself)
    #image: "elasticsearch:7.9.2"
    # If you want a demo image instead that is "ready-to-query" with some indexed Game of Thrones articles:
    image: "/deepset/elasticsearch-game-of-thrones"
    ports:
      - 9200:9200
    environment:
      - discovery.type=single-node
    network_mode: "bridge"
  ui:
    build:
      context: ui
      dockerfile: Dockerfile
    image: "/deepset/haystack-streamlit-ui:latest"
    ports:
      - 8501:8501
    environment:
      - API_ENDPOINT=http://{your ip address}:8000
      - EVAL_FILE=eval_labels_example.csv
    network_mode: "bridge"

BTW, another SEPERATE issue: I strongly recommend that you specify the version for each docker image so that it won't use the latest one, because the latest one may result in unexpected problem. For example, this is the docker-compose-gpu.yml file after I changed your original one:

version: "3"
services:
  haystack-api:
    build:
      context: .
      dockerfile: Dockerfile
    image: "/deepset/haystack-gpu:0.10.0"

while the docker-all.repo.ebaotech.com/deepset/haystack-gpu:latest does NOT work.

ZanSara commented 3 years ago

Fantastic, thank you! (I took the liberty of adding a code block around the YAML for readability, I hope you don't mind).

Yes, the latest image might break sometimes, so pinning it to the latest release it's a good idea. I'm going to discuss it with the rest of the team.

Closing for now. Is somebody else have the same issue, feel free to re-open or to just open a new issue :slightly_smiling_face:

deepset-ai / haystack

Docker demo: `Failed loading pipeline component 'Reader': Unable to load weights from pytorch checkpoint file` causes `Connection error` #1357