Closed zhuwenxing closed 2 months ago
/assign @congqixia /unassign
Same like #28857, could you please verify? /assign @zhuwenxing
It still reproduced in 2.3-20231205-3d7bcb5e-amd64
[2023-12-05T21:39:43.253Z] [2023-12-05 21:38:35 - DEBUG - ci_test]: (api_request) : [Collection.flush] args: [], kwargs: {'timeout': 180} (api_request.py:62)
[2023-12-05T21:39:43.253Z] [2023-12-05 21:38:38 - DEBUG - ci_test]: (api_response) : None (api_request.py:37)
[2023-12-05T21:39:43.253Z] [2023-12-05 21:38:38 - DEBUG - ci_test]: (api_request) : [Collection.flush] args: [], kwargs: {'timeout': 180} (api_request.py:62)
[2023-12-05T21:39:43.253Z] [2023-12-05 21:38:38 - DEBUG - ci_test]: (api_response) : None (api_request.py:37)
[2023-12-05T21:39:43.253Z] [2023-12-05 21:38:38 - INFO - ci_test]: assert flush: 3.036179542541504, entities: 5000 (test_all_collections_after_chaos.py:71)
[2023-12-05T21:39:43.253Z] [2023-12-05 21:38:39 - INFO - ci_test]: index info: [{'collection': 'SearchChecker__WDMottvy', 'field': 'float_vector', 'index_name': 'index__vqpJ58XD', 'index_param': {'index_type': 'HNSW', 'metric_type': 'L2', 'params': {'M': 48, 'efConstruction': 500}}}] (test_all_collections_after_chaos.py:87)
[2023-12-05T21:39:43.253Z] [2023-12-05 21:38:39 - DEBUG - ci_test]: (api_request) : [Collection.load] args: [None, 1, 180], kwargs: {} (api_request.py:62)
[2023-12-05T21:39:43.253Z] [2023-12-05 21:38:41 - DEBUG - ci_test]: (api_response) : None (api_request.py:37)
[2023-12-05T21:39:43.253Z] [2023-12-05 21:38:41 - DEBUG - ci_test]: (api_request) : [Collection.search] args: [[[0.12861101059653923, 0.0073168426706904524, 0.09216541040468074, 0.07176322089673397, 0.09538922667491329, 0.03271475734423939, 0.0783028088693867, 0.07102629789407297, 0.14599988454247495, 0.016754264535591545, 0.1441581835040213, 0.13765911133929298, 0.03877683399555063, 0.02677501214728548, 0......., kwargs: {} (api_request.py:62)
[2023-12-05T21:39:43.253Z] [2023-12-05 21:38:41 - ERROR - pymilvus.decorators]: RPC error: [search], <MilvusException: (code=503, message=failed to search: segment lacks[segment=446119342078766866]: channel not available[channel=by-dev-rootcoord-dml_5_446119342078766854v0])>, <Time:{'RPC start': '2023-12-05 21:38:41.827668', 'RPC error': '2023-12-05 21:38:41.833847'}> (decorators.py:128)
[2023-12-05T21:39:43.253Z] [2023-12-05 21:38:41 - ERROR - ci_test]: Traceback (most recent call last):
[2023-12-05T21:39:43.253Z] File "/home/jenkins/agent/workspace/tests/python_client/utils/api_request.py", line 32, in inner_wrapper
[2023-12-05T21:39:43.253Z] res = func(*args, **_kwargs)
[2023-12-05T21:39:43.253Z] File "/home/jenkins/agent/workspace/tests/python_client/utils/api_request.py", line 63, in api_request
[2023-12-05T21:39:43.253Z] return func(*arg, **kwargs)
[2023-12-05T21:39:43.253Z] File "/usr/local/lib/python3.8/dist-packages/pymilvus/orm/collection.py", line 782, in search
[2023-12-05T21:39:43.253Z] resp = conn.search(
[2023-12-05T21:39:43.253Z] File "/usr/local/lib/python3.8/dist-packages/pymilvus/decorators.py", line 129, in handler
[2023-12-05T21:39:43.253Z] raise e from e
[2023-12-05T21:39:43.253Z] File "/usr/local/lib/python3.8/dist-packages/pymilvus/decorators.py", line 125, in handler
[2023-12-05T21:39:43.253Z] return func(*args, **kwargs)
[2023-12-05T21:39:43.253Z] File "/usr/local/lib/python3.8/dist-packages/pymilvus/decorators.py", line 164, in handler
[2023-12-05T21:39:43.253Z] return func(self, *args, **kwargs)
[2023-12-05T21:39:43.253Z] File "/usr/local/lib/python3.8/dist-packages/pymilvus/decorators.py", line 104, in handler
[2023-12-05T21:39:43.253Z] raise e from e
[2023-12-05T21:39:43.253Z] File "/usr/local/lib/python3.8/dist-packages/pymilvus/decorators.py", line 68, in handler
[2023-12-05T21:39:43.253Z] return func(*args, **kwargs)
[2023-12-05T21:39:43.253Z] File "/usr/local/lib/python3.8/dist-packages/pymilvus/client/grpc_handler.py", line 748, in search
[2023-12-05T21:39:43.253Z] return self._execute_search(request, timeout, round_decimal=round_decimal, **kwargs)
[2023-12-05T21:39:43.253Z] File "/usr/local/lib/python3.8/dist-packages/pymilvus/client/grpc_handler.py", line 709, in _execute_search
[2023-12-05T21:39:43.253Z] raise e from e
[2023-12-05T21:39:43.253Z] File "/usr/local/lib/python3.8/dist-packages/pymilvus/client/grpc_handler.py", line 702, in _execute_search
[2023-12-05T21:39:43.253Z] check_status(response.status)
[2023-12-05T21:39:43.253Z] File "/usr/local/lib/python3.8/dist-packages/pymilvus/client/utils.py", line 54, in check_status
[2023-12-05T21:39:43.253Z] raise MilvusException(status.code, status.reason, status.error_code)
[2023-12-05T21:39:43.253Z] pymilvus.exceptions.MilvusException: <MilvusException: (code=503, message=failed to search: segment lacks[segment=446119342078766866]: channel not available[channel=by-dev-rootcoord-dml_5_446119342078766854v0])>
[2023-12-05T21:39:43.253Z] (api_request.py:45)
[2023-12-05T21:39:43.253Z] [2023-12-05 21:38:41 - ERROR - ci_test]: (api_response) : <MilvusException: (code=503, message=failed to search: segment lacks[segment=446119342078766866]: channel not available[channel=by-dev-rootcoord-dml_5_446119342078766854v0])> (api_request.py:46)
log: artifacts-querynode-pod-kill-8596-server-logs.tar.gz artifacts-querynode-pod-kill-8596-pytest-logs.tar.gz
/unassign /assign @congqixia
my mistake, fix pr still blocked by ut issue
Not reproduced in 2.3-20231208-ecbb6160
@zhuwenxing Got it. Thanks!
It reproduced again in 2.3-20231212-286dce0d-amd64
failed job: https://qa-jenkins.milvus.io/blue/organizations/jenkins/chaos-test-for-release-cron/detail/chaos-test-for-release-cron/8792/pipeline
log:
This issue is still reproduced in 2.3-20231215-97d71c25 failed job: https://qa-jenkins.milvus.io/blue/organizations/jenkins/chaos-test-kafka-for-release-cron/detail/chaos-test-kafka-for-release-cron/9157/pipeline log: artifacts-querynode-pod-kill-9157-pytest-logs.tar.gz artifacts-querynode-pod-kill-9157-server-logs.tar.gz
Two test cases failed: one still is segment lacks
, but another one is failed to search: node offline[node=18]
distribution went wrong after querynode chaos
there was lots of set
aciton generated but no trace found in querynode
could you please help investigate this
/assign @yah01
@yah01 reproduce: image: master-20240130-fda6f8d0 https://qa-jenkins.milvus.io/blue/organizations/jenkins/chaos-test-kafka-cron/detail/chaos-test-kafka-cron/11091/pipeline/289
[2024-01-31T21:26:44.257Z] [2024-01-31 21:25:35 - ERROR - pymilvus.decorators]: RPC error: [search], <MilvusException: (code=503, message=failed to search: segment lacks[segment=447410166423819541]; segment lacks[segment=447410166425624829]: channel not available[channel=by-dev-rootcoord-dml_11_447410166424219065v0])>, <Time:{'RPC start': '2024-01-31 21:25:35.575838', 'RPC error': '2024-01-31 21:25:35.579349'}> (decorators.py:134)
@zhuwenxing Seems this issue it not up to date I'd close this issue, please file a new one if same problem reoccurred
Is there an existing issue for this?
Environment
Current Behavior
Expected Behavior
No response
Steps To Reproduce
No response
Milvus Log
failed job: https://qa-jenkins.milvus.io/blue/organizations/jenkins/chaos-test-for-release-cron/detail/chaos-test-for-release-cron/8470/pipeline
log:
artifacts-querynode-pod-failure-8470-pytest-logs.tar.gz
artifacts-querynode-pod-failure-8470-server-logs.tar.gz
Anything else?
4am chaos-testing