Open xiaolvtongxue-zt opened 5 months ago
你好,在使用数据库问题回答时,数据召回似乎正常,但rerank时报错了。 在chat_data-chatollama-1 的日志日这样的:
Knowledge base file created with ID: 2 URL: /api/models/chat User: null Chat with knowledge base with id: 1 Knowledge base 新闻数据 with embedding "nomic-embed-text:latest" Creating embeddings for Ollama served model: nomic-embed-text:latest Creating Chroma vector store Initializing ParentDocumentRetriever with RedisDocstore Redis client options: { host: 'redis', port: 6379, username: undefined, password: undefined } Chat with OpenAI, host: http://192.168.0.74:7869/v1 User query: 嫦娥六号是第几次实现月球轨道交会对接? Reformulated query: 嫦娥六号是第几次实现月球轨道交会对接? Relevant documents: [ Document { pageContent: '{"data":null,"code":451,"name":"SecurityCompromiseError","status":45102,"message":"Domain www.sohu.com blocked until Fri Jun 07 2024 03:59:17 GMT+0000 (Coordinated Universal Time) due to previous abuse found on http://www.sohu.com/a/784279579_461392: DDoS attack suspected: Too many requests: 2001","readableMessage":"SecurityCompromiseError: Domain www.sohu.com blocked until Fri Jun 07 2024 03:59:17 GMT+0000 (Coordinated Universal Time) due to previous abuse found on http://www.sohu.com/a/784279579_461392: DDoS attack suspected: Too many requests: 2001"}', metadata: { source: 'https://www.sohu.com/a/784271789_114988', title: '', loc: [Object] } }, � Document { pageContent: '{"data":null,"code":451,"name":"SecurityCompromiseError","status":45102,"message":"Domain www.sohu.com blocked until Fri Jun 07 2024 03:59:17 GMT+0000 (Coordinated Universal Time) due to previous abuse found on http://www.sohu.com/a/784279579_461392: DDoS attack suspected: Too many requests: 2001","readableMessage":"SecurityCompromiseError: Domain www.sohu.com blocked until Fri Jun 07 2024 03:59:17 GMT+0000 (Coordinated Universal Time) due to previous abuse found on http://www.sohu.com/a/784279579_461392: DDoS attack suspected: Too many requests: 2001"}', metadata: { source: 'https://www.sohu.com/?strategyid=00001 ', title: '', loc: [Object] } }, � Document { pageContent: '{"data":null,"code":451,"name":"SecurityCompromiseError","status":45102,"message":"Domain www.sohu.com blocked until Fri Jun 07 2024 03:59:17 GMT+0000 (Coordinated Universal Time) due to previous abuse found on http://www.sohu.com/a/784279579_461392: DDoS attack suspected: Too many requests: 2001","readableMessage":"SecurityCompromiseError: Domain www.sohu.com blocked until Fri Jun 07 2024 03:59:17 GMT+0000 (Coordinated Universal Time) due to previous abuse found on http://www.sohu.com/a/784279579_461392: DDoS attack suspected: Too many requests: 2001"}', metadata: { source: 'https://www.sohu.com/a/784271789_114988', title: '', loc: [Object] } }, � Document { pageContent: '{"data":null,"path":"url","code":400,"name":"ParamValidationError","status":40001,"message":"Invalid protocol about:","readableMessage":"ParamValidationError(url): Invalid protocol about:"}', metadata: { source: 'about:blank#comment_area', title: '', loc: [Object] } } ] Cohere Rerank Options: { apiKey: 'xxxxx', baseUrl: 'http://peanutshell:8000/v1', model: 'ms-marco-MiniLM-L-6-v2', topN: 4 } ERROR [nuxt] [request error] [unhandled] [500] Status code: 500 Body: "Internal Server Error" at CohereClient.<anonymous> (./node_modules/.pnpm/cohere-ai@7.9.3/node_modules/cohere-ai/Client.js:481:27) at Generator.next (<anonymous>) at fulfilled (./node_modules/.pnpm/cohere-ai@7.9.3/node_modules/cohere-ai/Client.js:31:58) at process.processTicksAndRejections (node:internal/process/task_queues:95:5)
在[chat_data-peanutshell-1]花生壳的日志是这样的:
File "/usr/local/lib/python3.11/site-packages/requests/adapters.py", line 507, in send raise ConnectTimeout(e, request=request) requests.exceptions.ConnectTimeout: (MaxRetryError("HTTPSConnectionPool(host='huggingface.co', port=443): Max retries exceeded with url: /cross-encoder/ms-marco-MiniLM-L-6-v2/resolve/main/config.json (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x7f5c5f89ac90>, 'Connection to huggingface.co timed out. (connect timeout=10)'))"), '(Request ID: 8539d890-1012-4a77-9ec4-e449865ce6f7)') The above exception was the direct cause of the following exception: Traceback (most recent call last): File "/usr/local/lib/python3.11/site-packages/transformers/utils/hub.py", line 399, in cached_file resolved_file = hf_hub_download( ^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/huggingface_hub/utils/_validators.py", line 114, in _inner_fn return fn(*args, **kwargs) ^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/huggingface_hub/file_download.py", line 1221, in hf_hub_download return _hf_hub_download_to_cache_dir( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/huggingface_hub/file_download.py", line 1325, in _hf_hub_download_to_cache_dir _raise_on_head_call_error(head_call_error, force_download, local_files_only) File "/usr/local/lib/python3.11/site-packages/huggingface_hub/file_download.py", line 1826, in _raise_on_head_call_error raise LocalEntryNotFoundError( huggingface_hub.utils._errors.LocalEntryNotFoundError: An error happened while trying to locate the file on the Hub and we cannot find the requested files in the local cache. Please check your connection and try again or make sure your Internet connection is on. The above exception was the direct cause of the following exception: Traceback (most recent call last): File "/usr/local/lib/python3.11/site-packages/uvicorn/protocols/http/httptools_impl.py", line 411, in run_asgi result = await app( # type: ignore[func-returns-value] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/uvicorn/middleware/proxy_headers.py", line 69, in __call__ return await self.app(scope, receive, send) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/fastapi/applications.py", line 1054, in __call__ await super().__call__(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/applications.py", line 123, in __call__ await self.middleware_stack(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/middleware/errors.py", line 186, in __call__ raise exc File "/usr/local/lib/python3.11/site-packages/starlette/middleware/errors.py", line 164, in __call__ await self.app(scope, receive, _send) File "/usr/local/lib/python3.11/site-packages/starlette/middleware/exceptions.py", line 65, in __call__ await wrap_app_handling_exceptions(self.app, conn)(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 64, in wrapped_app raise exc File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 53, in wrapped_app await app(scope, receive, sender) File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 756, in __call__ await self.middleware_stack(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 776, in app await route.handle(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 297, in handle await self.app(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 77, in app await wrap_app_handling_exceptions(app, request)(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 64, in wrapped_app raise exc File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 53, in wrapped_app await app(scope, receive, sender) File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 72, in app response = await func(request) ^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/fastapi/routing.py", line 278, in app raw_response = await run_endpoint_function( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/fastapi/routing.py", line 193, in run_endpoint_function return await run_in_threadpool(dependant.call, **values) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/starlette/concurrency.py", line 42, in run_in_threadpool return await anyio.to_thread.run_sync(func, *args) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/anyio/to_thread.py", line 56, in run_sync return await get_async_backend().run_sync_in_worker_thread( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/anyio/_backends/_asyncio.py", line 2144, in run_sync_in_worker_thread return await future ^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/anyio/_backends/_asyncio.py", line 851, in run result = context.run(func, *args) ^^^^^^^^^^^^^^^^^^^^^^^^ File "/app/endpoints/__init__.py", line 24, in rerank service = CrossEncoderRerankService(modelName=model_name) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/app/services/cross_encoder/cross_encoder_rerank_service.py", line 17, in __init__ self.cross_encoder = CrossEncoder(model_name=modelName, local_files_only=False) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/sentence_transformers/cross_encoder/CrossEncoder.py", line 72, in __init__ self.config = AutoConfig.from_pretrained( ^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/transformers/models/auto/configuration_auto.py", line 934, in from_pretrained config_dict, unused_kwargs = PretrainedConfig.get_config_dict(pretrained_model_name_or_path, **kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/transformers/configuration_utils.py", line 632, in get_config_dict config_dict, kwargs = cls._get_config_dict(pretrained_model_name_or_path, **kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/transformers/configuration_utils.py", line 689, in _get_config_dict resolved_config_file = cached_file( ^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/transformers/utils/hub.py", line 442, in cached_file raise EnvironmentError( OSError: We couldn't connect to 'https://huggingface.co' to load this file, couldn't find it in the cached files and it looks like cross-encoder/ms-marco-MiniLM-L-6-v2 is not the path to a directory containing a file named config.json. Checkout your internet connection or see how to run the library in offline mode at 'https://huggingface.co/docs/transformers/installation#offline-mode'.
问题是:
- 既然需要重新从HF上下载模型?我们在拉取镜像的时候,有8G的大小,难道不是已经把模型拉取下来了??这个问题可以解决吗??比如说:如果模型已经拉取下来了,如何修改拉取模型的路径?如果没有拉取下来,是否可以手动去下载,后续放在某一个指定的路径下????
这个问题我遇到过,解决了,需要在花生壳哪里的docker文件,改成国内镜像网站https://hf-mirror.com/可以解决这个问题,这个是花生壳哪里部署问题,根据花生壳演示操作,他会下载模型,或者科学上网一下,但是看你也是ubuntu,需要all_proxy一下
你好,在使用数据库问题回答时,数据召回似乎正常,但rerank时报错了。 在chat_data-chatollama-1 的日志日这样的:
Knowledge base file created with ID: 2 URL: /api/models/chat User: null Chat with knowledge base with id: 1 Knowledge base 新闻数据 with embedding "nomic-embed-text:latest" Creating embeddings for Ollama served model: nomic-embed-text:latest Creating Chroma vector store Initializing ParentDocumentRetriever with RedisDocstore Redis client options: { host: 'redis', port: 6379, username: undefined, password: undefined } Chat with OpenAI, host: http://192.168.0.74:7869/v1 User query: 嫦娥六号是第几次实现月球轨道交会对接? Reformulated query: 嫦娥六号是第几次实现月球轨道交会对接? Relevant documents: [ Document { pageContent: '{"data":null,"code":451,"name":"SecurityCompromiseError","status":45102,"message":"Domain www.sohu.com blocked until Fri Jun 07 2024 03:59:17 GMT+0000 (Coordinated Universal Time) due to previous abuse found on http://www.sohu.com/a/784279579_461392: DDoS attack suspected: Too many requests: 2001","readableMessage":"SecurityCompromiseError: Domain www.sohu.com blocked until Fri Jun 07 2024 03:59:17 GMT+0000 (Coordinated Universal Time) due to previous abuse found on http://www.sohu.com/a/784279579_461392: DDoS attack suspected: Too many requests: 2001"}', metadata: { source: 'https://www.sohu.com/a/784271789_114988', title: '', loc: [Object] } }, � Document { pageContent: '{"data":null,"code":451,"name":"SecurityCompromiseError","status":45102,"message":"Domain www.sohu.com blocked until Fri Jun 07 2024 03:59:17 GMT+0000 (Coordinated Universal Time) due to previous abuse found on http://www.sohu.com/a/784279579_461392: DDoS attack suspected: Too many requests: 2001","readableMessage":"SecurityCompromiseError: Domain www.sohu.com blocked until Fri Jun 07 2024 03:59:17 GMT+0000 (Coordinated Universal Time) due to previous abuse found on http://www.sohu.com/a/784279579_461392: DDoS attack suspected: Too many requests: 2001"}', metadata: { source: 'https://www.sohu.com/?strategyid=00001 ', title: '', loc: [Object] } }, � Document { pageContent: '{"data":null,"code":451,"name":"SecurityCompromiseError","status":45102,"message":"Domain www.sohu.com blocked until Fri Jun 07 2024 03:59:17 GMT+0000 (Coordinated Universal Time) due to previous abuse found on http://www.sohu.com/a/784279579_461392: DDoS attack suspected: Too many requests: 2001","readableMessage":"SecurityCompromiseError: Domain www.sohu.com blocked until Fri Jun 07 2024 03:59:17 GMT+0000 (Coordinated Universal Time) due to previous abuse found on http://www.sohu.com/a/784279579_461392: DDoS attack suspected: Too many requests: 2001"}', metadata: { source: 'https://www.sohu.com/a/784271789_114988', title: '', loc: [Object] } }, � Document { pageContent: '{"data":null,"path":"url","code":400,"name":"ParamValidationError","status":40001,"message":"Invalid protocol about:","readableMessage":"ParamValidationError(url): Invalid protocol about:"}', metadata: { source: 'about:blank#comment_area', title: '', loc: [Object] } } ] Cohere Rerank Options: { apiKey: 'xxxxx', baseUrl: 'http://peanutshell:8000/v1', model: 'ms-marco-MiniLM-L-6-v2', topN: 4 } ERROR [nuxt] [request error] [unhandled] [500] Status code: 500 Body: "Internal Server Error" at CohereClient.<anonymous> (./node_modules/.pnpm/cohere-ai@7.9.3/node_modules/cohere-ai/Client.js:481:27) at Generator.next (<anonymous>) at fulfilled (./node_modules/.pnpm/cohere-ai@7.9.3/node_modules/cohere-ai/Client.js:31:58) at process.processTicksAndRejections (node:internal/process/task_queues:95:5)
在[chat_data-peanutshell-1]花生壳的日志是这样的:
File "/usr/local/lib/python3.11/site-packages/requests/adapters.py", line 507, in send raise ConnectTimeout(e, request=request) requests.exceptions.ConnectTimeout: (MaxRetryError("HTTPSConnectionPool(host='huggingface.co', port=443): Max retries exceeded with url: /cross-encoder/ms-marco-MiniLM-L-6-v2/resolve/main/config.json (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x7f5c5f89ac90>, 'Connection to huggingface.co timed out. (connect timeout=10)'))"), '(Request ID: 8539d890-1012-4a77-9ec4-e449865ce6f7)') The above exception was the direct cause of the following exception: Traceback (most recent call last): File "/usr/local/lib/python3.11/site-packages/transformers/utils/hub.py", line 399, in cached_file resolved_file = hf_hub_download( ^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/huggingface_hub/utils/_validators.py", line 114, in _inner_fn return fn(*args, **kwargs) ^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/huggingface_hub/file_download.py", line 1221, in hf_hub_download return _hf_hub_download_to_cache_dir( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/huggingface_hub/file_download.py", line 1325, in _hf_hub_download_to_cache_dir _raise_on_head_call_error(head_call_error, force_download, local_files_only) File "/usr/local/lib/python3.11/site-packages/huggingface_hub/file_download.py", line 1826, in _raise_on_head_call_error raise LocalEntryNotFoundError( huggingface_hub.utils._errors.LocalEntryNotFoundError: An error happened while trying to locate the file on the Hub and we cannot find the requested files in the local cache. Please check your connection and try again or make sure your Internet connection is on. The above exception was the direct cause of the following exception: Traceback (most recent call last): File "/usr/local/lib/python3.11/site-packages/uvicorn/protocols/http/httptools_impl.py", line 411, in run_asgi result = await app( # type: ignore[func-returns-value] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/uvicorn/middleware/proxy_headers.py", line 69, in __call__ return await self.app(scope, receive, send) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/fastapi/applications.py", line 1054, in __call__ await super().__call__(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/applications.py", line 123, in __call__ await self.middleware_stack(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/middleware/errors.py", line 186, in __call__ raise exc File "/usr/local/lib/python3.11/site-packages/starlette/middleware/errors.py", line 164, in __call__ await self.app(scope, receive, _send) File "/usr/local/lib/python3.11/site-packages/starlette/middleware/exceptions.py", line 65, in __call__ await wrap_app_handling_exceptions(self.app, conn)(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 64, in wrapped_app raise exc File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 53, in wrapped_app await app(scope, receive, sender) File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 756, in __call__ await self.middleware_stack(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 776, in app await route.handle(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 297, in handle await self.app(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 77, in app await wrap_app_handling_exceptions(app, request)(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 64, in wrapped_app raise exc File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 53, in wrapped_app await app(scope, receive, sender) File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 72, in app response = await func(request) ^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/fastapi/routing.py", line 278, in app raw_response = await run_endpoint_function( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/fastapi/routing.py", line 193, in run_endpoint_function return await run_in_threadpool(dependant.call, **values) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/starlette/concurrency.py", line 42, in run_in_threadpool return await anyio.to_thread.run_sync(func, *args) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/anyio/to_thread.py", line 56, in run_sync return await get_async_backend().run_sync_in_worker_thread( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/anyio/_backends/_asyncio.py", line 2144, in run_sync_in_worker_thread return await future ^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/anyio/_backends/_asyncio.py", line 851, in run result = context.run(func, *args) ^^^^^^^^^^^^^^^^^^^^^^^^ File "/app/endpoints/__init__.py", line 24, in rerank service = CrossEncoderRerankService(modelName=model_name) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/app/services/cross_encoder/cross_encoder_rerank_service.py", line 17, in __init__ self.cross_encoder = CrossEncoder(model_name=modelName, local_files_only=False) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/sentence_transformers/cross_encoder/CrossEncoder.py", line 72, in __init__ self.config = AutoConfig.from_pretrained( ^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/transformers/models/auto/configuration_auto.py", line 934, in from_pretrained config_dict, unused_kwargs = PretrainedConfig.get_config_dict(pretrained_model_name_or_path, **kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/transformers/configuration_utils.py", line 632, in get_config_dict config_dict, kwargs = cls._get_config_dict(pretrained_model_name_or_path, **kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/transformers/configuration_utils.py", line 689, in _get_config_dict resolved_config_file = cached_file( ^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/transformers/utils/hub.py", line 442, in cached_file raise EnvironmentError( OSError: We couldn't connect to 'https://huggingface.co' to load this file, couldn't find it in the cached files and it looks like cross-encoder/ms-marco-MiniLM-L-6-v2 is not the path to a directory containing a file named config.json. Checkout your internet connection or see how to run the library in offline mode at 'https://huggingface.co/docs/transformers/installation#offline-mode'.
问题是:
- 既然需要重新从HF上下载模型?我们在拉取镜像的时候,有8G的大小,难道不是已经把模型拉取下来了??这个问题可以解决吗??比如说:如果模型已经拉取下来了,如何修改拉取模型的路径?如果没有拉取下来,是否可以手动去下载,后续放在某一个指定的路径下????
这个问题我遇到过,解决了,需要在花生壳哪里的docker文件,改成国内镜像网站https://hf-mirror.com/可以解决这个问题,这个是花生壳哪里部署问题,根据花生壳演示操作,他会下载模型,或者科学上网一下,但是看你也是ubuntu,需要all_proxy一下
你好,我按照你的去进行了修改,但似乎依旧是这个报错。后续所幸直接改源码。
将原始模型下载下来,放在在花生壳原文件下:
然后,在源码调用时写死路径。
这样做似乎可以了。中间也试了一下,可以进行正常回答了。
但似乎这个好像并不是一直可以l聊天,有时候也会报错。我想我这种办法应该是不太对的。还是希望作者能给个回复,从根源解决。 报错时的log时:(正常聊天没问题(没有知识库)。)
ERROR [nuxt] [request error] [unhandled] [500] 500 Internal Server Error
at Function.generate (./node_modules/.pnpm/openai@4.33.0/node_modules/openai/src/error.ts:95:14)
at OpenAI.makeStatusError (./node_modules/.pnpm/openai@4.33.0/node_modules/openai/src/core.ts:383:21)
at OpenAI.makeRequest (./node_modules/.pnpm/openai@4.33.0/node_modules/openai/src/core.ts:446:24)
at process.processTicksAndRejections (node:internal/process/task_queues:95:5)
at async ./node_modules/.pnpm/@langchain+openai@0.0.15/node_modules/@langchain/openai/dist/chat_models.js:650:29
at async RetryOperation._fn (./node_modules/.pnpm/p-retry@4.6.2/node_modules/p-retry/index.js:50:12)
@satrong
你好,在使用数据库问题回答时,数据召回似乎正常,但rerank时报错了。 在chat_data-chatollama-1 的日志日这样的:
Knowledge base file created with ID: 2 URL: /api/models/chat User: null Chat with knowledge base with id: 1 Knowledge base 新闻数据 with embedding "nomic-embed-text:latest" Creating embeddings for Ollama served model: nomic-embed-text:latest Creating Chroma vector store Initializing ParentDocumentRetriever with RedisDocstore Redis client options: { host: 'redis', port: 6379, username: undefined, password: undefined } Chat with OpenAI, host: http://192.168.0.74:7869/v1 User query: 嫦娥六号是第几次实现月球轨道交会对接? Reformulated query: 嫦娥六号是第几次实现月球轨道交会对接? Relevant documents: [ Document { pageContent: '{"data":null,"code":451,"name":"SecurityCompromiseError","status":45102,"message":"Domain www.sohu.com blocked until Fri Jun 07 2024 03:59:17 GMT+0000 (Coordinated Universal Time) due to previous abuse found on http://www.sohu.com/a/784279579_461392: DDoS attack suspected: Too many requests: 2001","readableMessage":"SecurityCompromiseError: Domain www.sohu.com blocked until Fri Jun 07 2024 03:59:17 GMT+0000 (Coordinated Universal Time) due to previous abuse found on http://www.sohu.com/a/784279579_461392: DDoS attack suspected: Too many requests: 2001"}', metadata: { source: 'https://www.sohu.com/a/784271789_114988', title: '', loc: [Object] } }, � Document { pageContent: '{"data":null,"code":451,"name":"SecurityCompromiseError","status":45102,"message":"Domain www.sohu.com blocked until Fri Jun 07 2024 03:59:17 GMT+0000 (Coordinated Universal Time) due to previous abuse found on http://www.sohu.com/a/784279579_461392: DDoS attack suspected: Too many requests: 2001","readableMessage":"SecurityCompromiseError: Domain www.sohu.com blocked until Fri Jun 07 2024 03:59:17 GMT+0000 (Coordinated Universal Time) due to previous abuse found on http://www.sohu.com/a/784279579_461392: DDoS attack suspected: Too many requests: 2001"}', metadata: { source: 'https://www.sohu.com/?strategyid=00001 ', title: '', loc: [Object] } }, � Document { pageContent: '{"data":null,"code":451,"name":"SecurityCompromiseError","status":45102,"message":"Domain www.sohu.com blocked until Fri Jun 07 2024 03:59:17 GMT+0000 (Coordinated Universal Time) due to previous abuse found on http://www.sohu.com/a/784279579_461392: DDoS attack suspected: Too many requests: 2001","readableMessage":"SecurityCompromiseError: Domain www.sohu.com blocked until Fri Jun 07 2024 03:59:17 GMT+0000 (Coordinated Universal Time) due to previous abuse found on http://www.sohu.com/a/784279579_461392: DDoS attack suspected: Too many requests: 2001"}', metadata: { source: 'https://www.sohu.com/a/784271789_114988', title: '', loc: [Object] } }, � Document { pageContent: '{"data":null,"path":"url","code":400,"name":"ParamValidationError","status":40001,"message":"Invalid protocol about:","readableMessage":"ParamValidationError(url): Invalid protocol about:"}', metadata: { source: 'about:blank#comment_area', title: '', loc: [Object] } } ] Cohere Rerank Options: { apiKey: 'xxxxx', baseUrl: 'http://peanutshell:8000/v1', model: 'ms-marco-MiniLM-L-6-v2', topN: 4 } ERROR [nuxt] [request error] [unhandled] [500] Status code: 500 Body: "Internal Server Error" at CohereClient.<anonymous> (./node_modules/.pnpm/cohere-ai@7.9.3/node_modules/cohere-ai/Client.js:481:27) at Generator.next (<anonymous>) at fulfilled (./node_modules/.pnpm/cohere-ai@7.9.3/node_modules/cohere-ai/Client.js:31:58) at process.processTicksAndRejections (node:internal/process/task_queues:95:5)
在[chat_data-peanutshell-1]花生壳的日志是这样的:
File "/usr/local/lib/python3.11/site-packages/requests/adapters.py", line 507, in send raise ConnectTimeout(e, request=request) requests.exceptions.ConnectTimeout: (MaxRetryError("HTTPSConnectionPool(host='huggingface.co', port=443): Max retries exceeded with url: /cross-encoder/ms-marco-MiniLM-L-6-v2/resolve/main/config.json (Caused by ConnectTimeoutError(<urllib3.connection.HTTPSConnection object at 0x7f5c5f89ac90>, 'Connection to huggingface.co timed out. (connect timeout=10)'))"), '(Request ID: 8539d890-1012-4a77-9ec4-e449865ce6f7)') The above exception was the direct cause of the following exception: Traceback (most recent call last): File "/usr/local/lib/python3.11/site-packages/transformers/utils/hub.py", line 399, in cached_file resolved_file = hf_hub_download( ^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/huggingface_hub/utils/_validators.py", line 114, in _inner_fn return fn(*args, **kwargs) ^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/huggingface_hub/file_download.py", line 1221, in hf_hub_download return _hf_hub_download_to_cache_dir( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/huggingface_hub/file_download.py", line 1325, in _hf_hub_download_to_cache_dir _raise_on_head_call_error(head_call_error, force_download, local_files_only) File "/usr/local/lib/python3.11/site-packages/huggingface_hub/file_download.py", line 1826, in _raise_on_head_call_error raise LocalEntryNotFoundError( huggingface_hub.utils._errors.LocalEntryNotFoundError: An error happened while trying to locate the file on the Hub and we cannot find the requested files in the local cache. Please check your connection and try again or make sure your Internet connection is on. The above exception was the direct cause of the following exception: Traceback (most recent call last): File "/usr/local/lib/python3.11/site-packages/uvicorn/protocols/http/httptools_impl.py", line 411, in run_asgi result = await app( # type: ignore[func-returns-value] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/uvicorn/middleware/proxy_headers.py", line 69, in __call__ return await self.app(scope, receive, send) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/fastapi/applications.py", line 1054, in __call__ await super().__call__(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/applications.py", line 123, in __call__ await self.middleware_stack(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/middleware/errors.py", line 186, in __call__ raise exc File "/usr/local/lib/python3.11/site-packages/starlette/middleware/errors.py", line 164, in __call__ await self.app(scope, receive, _send) File "/usr/local/lib/python3.11/site-packages/starlette/middleware/exceptions.py", line 65, in __call__ await wrap_app_handling_exceptions(self.app, conn)(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 64, in wrapped_app raise exc File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 53, in wrapped_app await app(scope, receive, sender) File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 756, in __call__ await self.middleware_stack(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 776, in app await route.handle(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 297, in handle await self.app(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 77, in app await wrap_app_handling_exceptions(app, request)(scope, receive, send) File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 64, in wrapped_app raise exc File "/usr/local/lib/python3.11/site-packages/starlette/_exception_handler.py", line 53, in wrapped_app await app(scope, receive, sender) File "/usr/local/lib/python3.11/site-packages/starlette/routing.py", line 72, in app response = await func(request) ^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/fastapi/routing.py", line 278, in app raw_response = await run_endpoint_function( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/fastapi/routing.py", line 193, in run_endpoint_function return await run_in_threadpool(dependant.call, **values) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/starlette/concurrency.py", line 42, in run_in_threadpool return await anyio.to_thread.run_sync(func, *args) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/anyio/to_thread.py", line 56, in run_sync return await get_async_backend().run_sync_in_worker_thread( ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/anyio/_backends/_asyncio.py", line 2144, in run_sync_in_worker_thread return await future ^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/anyio/_backends/_asyncio.py", line 851, in run result = context.run(func, *args) ^^^^^^^^^^^^^^^^^^^^^^^^ File "/app/endpoints/__init__.py", line 24, in rerank service = CrossEncoderRerankService(modelName=model_name) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/app/services/cross_encoder/cross_encoder_rerank_service.py", line 17, in __init__ self.cross_encoder = CrossEncoder(model_name=modelName, local_files_only=False) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/sentence_transformers/cross_encoder/CrossEncoder.py", line 72, in __init__ self.config = AutoConfig.from_pretrained( ^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/transformers/models/auto/configuration_auto.py", line 934, in from_pretrained config_dict, unused_kwargs = PretrainedConfig.get_config_dict(pretrained_model_name_or_path, **kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/transformers/configuration_utils.py", line 632, in get_config_dict config_dict, kwargs = cls._get_config_dict(pretrained_model_name_or_path, **kwargs) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/transformers/configuration_utils.py", line 689, in _get_config_dict resolved_config_file = cached_file( ^^^^^^^^^^^^ File "/usr/local/lib/python3.11/site-packages/transformers/utils/hub.py", line 442, in cached_file raise EnvironmentError( OSError: We couldn't connect to 'https://huggingface.co' to load this file, couldn't find it in the cached files and it looks like cross-encoder/ms-marco-MiniLM-L-6-v2 is not the path to a directory containing a file named config.json. Checkout your internet connection or see how to run the library in offline mode at 'https://huggingface.co/docs/transformers/installation#offline-mode'.
问题是:
- 既然需要重新从HF上下载模型?我们在拉取镜像的时候,有8G的大小,难道不是已经把模型拉取下来了??这个问题可以解决吗??比如说:如果模型已经拉取下来了,如何修改拉取模型的路径?如果没有拉取下来,是否可以手动去下载,后续放在某一个指定的路径下????
这个问题我遇到过,解决了,需要在花生壳哪里的docker文件,改成国内镜像网站https://hf-mirror.com/可以解决这个问题,这个是花生壳哪里部署问题,根据花生壳演示操作,他会下载模型,或者科学上网一下,但是看你也是ubuntu,需要all_proxy一下
你好,我按照你的去进行了修改,但似乎依旧是这个报错。后续所幸直接改源码。
- 将原始模型下载下来,放在在花生壳原文件下:
- 然后,在源码调用时写死路径。
- 这样做似乎可以了。中间也试了一下,可以进行正常回答了。
- 但似乎这个好像并不是一直可以l聊天,有时候也会报错。我想我这种办法应该是不太对的。还是希望作者能给个回复,从根源解决。 报错时的log时:(正常聊天没问题(没有知识库)。)
ERROR [nuxt] [request error] [unhandled] [500] 500 Internal Server Error at Function.generate (./node_modules/.pnpm/openai@4.33.0/node_modules/openai/src/error.ts:95:14) at OpenAI.makeStatusError (./node_modules/.pnpm/openai@4.33.0/node_modules/openai/src/core.ts:383:21) at OpenAI.makeRequest (./node_modules/.pnpm/openai@4.33.0/node_modules/openai/src/core.ts:446:24) at process.processTicksAndRejections (node:internal/process/task_queues:95:5) at async ./node_modules/.pnpm/@langchain+openai@0.0.15/node_modules/@langchain/openai/dist/chat_models.js:650:29 at async RetryOperation._fn (./node_modules/.pnpm/p-retry@4.6.2/node_modules/p-retry/index.js:50:12)
@satrong
我看代码不可以这样做,他是需要放在一个隐藏文件,我没有试过,这个图你看看
是的,我在这里也进行了修改,但依旧是报了原始的那个错误~很奇怪~~@heimy2000
看不懂😂
我也遇到了一样的错误,按照 @heimy2000 给的图片,改了penutshell的docker环境变量之后就好了
直接删除peanutshell就好了,从docker-compose中去掉 #- COHERE_API_KEY=xxxxx
#- COHERE_BASE_URL=http://peanutshell:8000/v1 就能用,就是少了rerank
是的,我在这里也进行了修改,但依旧是报了原始的那个错误~很奇怪~~@heimy2000
docker没用镜像
从源码看,没有这部分peanutshell设置 就是不用rerank操作了 你清除环境变量 重启下试试发自我的 iPhone在 2024年6月30日,21:39,heimy2000 @.***> 写道:
@.***
docker没用镜像
—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you commented.Message ID: @.***>
遇到了一样的问题 rerank的时候下载模型需要能访问huggingface.co,即便部署的时候改hf镜像站是不起作用的。试了很多办法 好像都是不行。
不知道哪位大哥有好的解决方案 或者打包个不需要访问huggingface的镜像出来呢
你好,在使用数据库问题回答时,数据召回似乎正常,但rerank时报错了。 在chat_data-chatollama-1 的日志日这样的:
在[chat_data-peanutshell-1]花生壳的日志是这样的:
问题是: