marqo-ai / marqo

Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
https://www.marqo.ai/
Apache License 2.0
4.48k stars 184 forks source link

[BUG] Unable to load model NV-EMBED-V2 #972

Open icon2341 opened 1 day ago

icon2341 commented 1 day ago

Describe the bug A clear and concise description of what the bug is.

To Reproduce Steps to reproduce the behavior:

  1. Follow commands from marqo quick setup on docker
  2. run with gpu all
  3. specify settings:

# print(reviews) settings = { "model": "hf-hub:nvidia/NV-Embed-v2", "modelProperties": { "name": "hf-hub:nvidia/NV-Embed-v2", "dimensions": 4096, "type": "hf", }, }

  1. run create index with settings

Expected behavior Model loads and begins embedding results

Screenshots raise MarqoWebError(message=response_msg, code=code, error_type=error_type, marqo.errors.MarqoWebError: MarqoWebError: MarqoWebError Error message: {'message': "Problem vectorising query. Reason: Unable to load model=hf-hub:nvidia/NV-Embed-v2 on device=cuda with normalization=True. If you are trying to load a custom model, please check that model_properties={'name': 'hf-hub:nvidia/NV-Embed-v2', 'dimensions': 4096, 'type': 'hf'} is correct and Marqo has access to the weights file.", 'code': 'bad_request', 'type': 'invalid_request', 'link': 'https://docs.marqo.ai/2.12/Guides/Models-Reference/list_of_models/', 'telemetry': {'timesMs': {'add_documents.create_vectors': 0.32188600016525015, 'POST /indexes/my-first-index/documents': 15.058888000112347}}} status_code: 400, type: invalid_request, code: bad_request, link:

Desktop (please complete the following information):

Additional context Add any other context about the problem here.

wanliAlex commented 1 day ago

Hi, thanks for the detailed information. I am looking into this.

wanliAlex commented 1 day ago

Hey, this model requires custom code thus it is not supported in Marqo. We will add the compatibility of models in our documentation. Alternatively, you can try other text models in our model registry.