run-llama / llama-hub

A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain
https://llamahub.ai/
MIT License
3.43k stars 728 forks source link

[Bug]: AzStorageBlobReader / Cannot instantiate this tokenizer from a slow version #867

Open alvarojauna opened 6 months ago

alvarojauna commented 6 months ago

Bug Description

from llama_index import download_loader

AzStorageBlobReader = download_loader("AzStorageBlobReader")

loader = AzStorageBlobReader( container_name='', connection_string='DefaultEndpointsProtocol=https;AccountName==;EndpointSuffix=core.windows.net' )

documents = loader.load_data()

Could not find image processor class in the image processor config or the model config. Loading based on pattern matching with the model's feature extractor configuration.

ValueError Traceback (most recent call last) in <cell line: 10>() 8 ) 9 ---> 10 documents = loader.load_data()

9 frames /usr/local/lib/python3.10/dist-packages/transformers/tokenization_utils_fast.py in init(self, *args, **kwargs) 100 101 if from_slow and slow_tokenizer is None and self.slow_tokenizer_class is None: --> 102 raise ValueError( 103 "Cannot instantiate this tokenizer from a slow version. If it's based on sentencepiece, make sure you " 104 "have sentencepiece installed."

ValueError: Cannot instantiate this tokenizer from a slow version. If it's based on sentencepiece, make sure you have sentencepiece installed.

Why does this happend?

Version

last one

Steps to Reproduce

that one

Relevant Logs/Tracbacks

No response